Itinai.com its now possible to take control of your website i 65053d84 9f33 4cad 8a6a 250603ea0656 2
Itinai.com its now possible to take control of your website i 65053d84 9f33 4cad 8a6a 250603ea0656 2

Together AI Launches DeepSWE: Open-Source RL Coding Agent Achieving 59% on SWEBench

Introduction to DeepSWE

Together AI has made waves with the release of DeepSWE, a fully open-source coding agent that utilizes reinforcement learning (RL) techniques. Built on the Qwen3-32B language model, DeepSWE has achieved a notable 59% accuracy on the SWEBench-Verified benchmark. This advancement indicates a significant shift for Together AI, moving towards autonomous language agents capable of continuous learning through real-world experiences.

Reinforcement Learning in Code Generation

DeepSWE’s development involved post-training the Qwen3-32B model using the rLLM framework from Agentica. Unlike traditional supervised methods that rely on fixed datasets, rLLM empowers agents to learn through real-world interactions. This approach is particularly effective for complex software engineering tasks, enabling the agent to improve continuously as it receives feedback.

Training Methodology

The backbone of DeepSWE’s training is the R2EGym dataset, a benchmark designed specifically for RL-based agent development in software engineering. This dataset focuses on practical, action-oriented objectives such as bug fixing, function completion, and code editing. As a result, DeepSWE learns to mirror the iterative nature of human software development, making it more adaptable and effective.

Performance Metrics

In terms of performance, DeepSWE stands out on the SWEBench-Verified benchmark. Scoring 59% with test-time scaling, it significantly outperforms previous models with open weights. The Pass@1 score, which assesses the likelihood of the agent solving a problem correctly on the first try, reaches an impressive 42.2%. These metrics underscore the potential of RL-based training, particularly in coding tasks that require precise and iterative reasoning.

Commitment to Open Source

Transparency is a cornerstone of DeepSWE’s release. Together AI and Agentica have provided not just the model itself, but also the entire training framework, including the rLLM architecture and the R2EGym dataset. This commitment to open-source development fosters reproducibility, allowing the research and development communities to build upon DeepSWE freely.

Accessing DeepSWE and Its Resources

Advancing from Language Reasoners to Language Agents

The development of DeepSWE signifies more than a technical upgrade; it reflects a philosophical shift in AI. Traditional large language models (LLMs) have excelled at reasoning but often fall short in adapting to new challenges. By leveraging reinforcement learning, DeepSWE can not only perform well upon release but also evolve as it encounters new tasks.

Potential Applications

DeepSWE’s modular and open-source nature allows for local deployment and customization. Developers can retrain the model for specific organizational needs, paving the way for diverse applications—from web navigation to robotics and autonomous research assistance.

Conclusion

In summary, DeepSWE represents a significant leap forward for generative AI in software engineering. By integrating reinforcement learning with the Qwen3-32B model and providing an open-source training infrastructure, Together AI is setting a new standard for coding agents. This evolution from language understanding to action-oriented agents has far-reaching implications for programming, automation, and intelligent system design.

FAQs

  • What is DeepSWE? DeepSWE is an open-source coding agent developed by Together AI, utilizing reinforcement learning to enhance software engineering tasks.
  • How does DeepSWE differ from traditional language models? Unlike traditional models, DeepSWE learns from real-world interactions and feedback, enabling continuous improvement.
  • What benchmarks has DeepSWE achieved? DeepSWE scored 59% accuracy on the SWEBench-Verified benchmark and a 42.2% Pass@1 score.
  • Where can I access DeepSWE? You can find DeepSWE’s model weights on Hugging Face and the training framework on the rLLM GitHub Repository.
  • What are some potential applications of DeepSWE? Applications include web navigation, robotics, and autonomous research assistance, tailored to organizational needs.
Itinai.com office ai background high tech quantum computing 0002ba7c e3d6 4fd7 abd6 cfe4e5f08aeb 0

Vladimir Dyachkov, Ph.D
Editor-in-Chief itinai.com

I believe that AI is only as powerful as the human insight guiding it.

Unleash Your Creative Potential with AI Agents

Competitors are already using AI Agents

Business Problems We Solve

  • Automation of internal processes.
  • Optimizing AI costs without huge budgets.
  • Training staff, developing custom courses for business needs
  • Integrating AI into client work, automating first lines of contact

Large and Medium Businesses

Startups

Offline Business

100% of clients report increased productivity and reduced operati

AI news and solutions