Together AI Launches DeepSWE: Open-Source RL Coding Agent Achieving 59% on SWEBench

Introduction to DeepSWE

Together AI has made waves with the release of DeepSWE, a fully open-source coding agent that utilizes reinforcement learning (RL) techniques. Built on the Qwen3-32B language model, DeepSWE has achieved a notable 59% accuracy on the SWEBench-Verified benchmark. This advancement indicates a significant shift for Together AI, moving towards autonomous language agents capable of continuous learning through real-world experiences.

Reinforcement Learning in Code Generation

DeepSWE’s development involved post-training the Qwen3-32B model using the rLLM framework from Agentica. Unlike traditional supervised methods that rely on fixed datasets, rLLM empowers agents to learn through real-world interactions. This approach is particularly effective for complex software engineering tasks, enabling the agent to improve continuously as it receives feedback.

Training Methodology

The backbone of DeepSWE’s training is the R2EGym dataset, a benchmark designed specifically for RL-based agent development in software engineering. This dataset focuses on practical, action-oriented objectives such as bug fixing, function completion, and code editing. As a result, DeepSWE learns to mirror the iterative nature of human software development, making it more adaptable and effective.

Performance Metrics

In terms of performance, DeepSWE stands out on the SWEBench-Verified benchmark. Scoring 59% with test-time scaling, it significantly outperforms previous models with open weights. The Pass@1 score, which assesses the likelihood of the agent solving a problem correctly on the first try, reaches an impressive 42.2%. These metrics underscore the potential of RL-based training, particularly in coding tasks that require precise and iterative reasoning.

Commitment to Open Source

Transparency is a cornerstone of DeepSWE’s release. Together AI and Agentica have provided not just the model itself, but also the entire training framework, including the rLLM architecture and the R2EGym dataset. This commitment to open-source development fosters reproducibility, allowing the research and development communities to build upon DeepSWE freely.

Accessing DeepSWE and Its Resources

Model Weights: Available on Hugging Face – DeepSWE
Training Framework: Visit the rLLM GitHub Repository
Training Documentation: Check out the DeepSWE Training Overview

Advancing from Language Reasoners to Language Agents

The development of DeepSWE signifies more than a technical upgrade; it reflects a philosophical shift in AI. Traditional large language models (LLMs) have excelled at reasoning but often fall short in adapting to new challenges. By leveraging reinforcement learning, DeepSWE can not only perform well upon release but also evolve as it encounters new tasks.

Potential Applications

DeepSWE’s modular and open-source nature allows for local deployment and customization. Developers can retrain the model for specific organizational needs, paving the way for diverse applications—from web navigation to robotics and autonomous research assistance.

Conclusion

In summary, DeepSWE represents a significant leap forward for generative AI in software engineering. By integrating reinforcement learning with the Qwen3-32B model and providing an open-source training infrastructure, Together AI is setting a new standard for coding agents. This evolution from language understanding to action-oriented agents has far-reaching implications for programming, automation, and intelligent system design.

FAQs

What is DeepSWE? DeepSWE is an open-source coding agent developed by Together AI, utilizing reinforcement learning to enhance software engineering tasks.
How does DeepSWE differ from traditional language models? Unlike traditional models, DeepSWE learns from real-world interactions and feedback, enabling continuous improvement.
What benchmarks has DeepSWE achieved? DeepSWE scored 59% accuracy on the SWEBench-Verified benchmark and a 42.2% Pass@1 score.
Where can I access DeepSWE? You can find DeepSWE’s model weights on Hugging Face and the training framework on the rLLM GitHub Repository.
What are some potential applications of DeepSWE? Applications include web navigation, robotics, and autonomous research assistance, tailored to organizational needs.

Unleash Your Creative Potential with AI Agents

Competitors are already using AI Agents

Business Problems We Solve

Automation of internal processes.
Optimizing AI costs without huge budgets.
Training staff, developing custom courses for business needs
Integrating AI into client work, automating first lines of contact

Large and Medium Businesses

Startups

Offline Business

Get a plan to reduce routine and improve metrics

100% of clients report increased productivity and reduced operati

AI Agents

Localization Project Manager – Coordinating translation workflows, answering vendor or process-related questions.

Job Title: Localization Project Manager Overview The Localization Project Manager plays a vital role in coordinating translation workflows while addressing vendor and process-related queries. This position is crucial for ensuring that translation projects are executed efficiently…
AI Agents

Environmental Health & Safety Officer – Answering compliance-related questions, retrieving safety protocols or audit histories.

Professional Summary The AI-driven Environmental Health & Safety Officer is a reliable and effective digital team member that performs repetitive and time-consuming tasks with remarkable speed, accuracy, and stability. By automating these tasks, it frees up…
AI Agents

Legal Contract Reviewer – Auto-flagging clause inconsistencies or retrieving precedent cases for review.

Job Title: Legal Contract Reviewer – Auto-flagging Clause Inconsistencies or Retrieving Precedent Cases for Review The AI functions as a reliable and effective digital team member that excels in performing repetitive and time-consuming tasks. With remarkable…
AI Agents

Customer Retention Analyst – Creating customer summaries, identifying churn risk patterns, and suggesting retention steps.

Customer Retention Analyst Professional Summary A highly analytical and detail-oriented Customer Retention Analyst with a proven track record in creating comprehensive customer summaries, identifying churn risk patterns, and suggesting effective retention strategies. Adept at leveraging data-driven…

Itinai.com httpss.mj.runmrqch2uvtvo russian handsome charisma 9fdbb2d5 a55b 425d 8f3b 76d26f86710f 2

AI Business Accelerator

Start Your AI Business in Just a Week with itinai.com

You’re a great fit if you:

Have an audience (even 500+ followers in Instagram, email, etc.)
Have an idea, service, or product you want to scale
Can invest 2–3 hours a day
You’re motivated to earn with AI but don’t want to handle technical setup

AI news and solutions

This AI Paper from China Introduces ‘Monkey’: A Novel Artificial Intelligence Approach to Enhance Input Resolution and Contextual Association in Large Multimodal Models

Large multimodal models like LLaVA, MiniGPT4, mPLUG-Owl, and Qwen-VL have made rapid progress in handling and analyzing various types of data. However, there are obstacles to overcome, such as dealing with complex scenarios and the need…

AI Tech News
Meet Torchchat: A Flexible Framework for Accelerating Llama 3, 3.1, and Other Large Language Models Across Laptop, Desktop, and Mobile

Meet Torchchat: A Flexible Framework for Accelerating Llama 3, 3.1, and Other Large Language Models Across Laptop, Desktop, and Mobile Practical Solutions and Value The rapid development of Large Language Models (LLMs) has significantly impacted various…

AI Tech News
Optimizing Large Language Models for Concise and Accurate Responses through Constrained Chain-of-Thought Prompting

Optimizing Large Language Models for Concise and Accurate Responses through Constrained Chain-of-Thought Prompting Practical Solutions and Value Recent advancements in Large Language Models (LLMs) have led to impressive abilities in handling complex question-answering tasks. However, challenges…

AI Tech News
Cloudera vs Hortonworks: Big Data AI That Supports Smarter Product Delivery

Technical Relevance In today’s data-driven landscape, organizations are increasingly relying on advanced analytics to drive decision-making and enhance profitability. Cloudera stands out as a leader in supporting large-scale data processing, particularly for applications such as fraud…

Tools
Breaking the Boundaries in 3D Scene Representation: How a New AI Technique is Changing the Game with Faster, More Efficient Rendering and Reduced Storage Demands

NeRF models scenes in 3D and learns from various viewpoints to create photorealistic images. Researchers from Sungkyunkwan University improved efficiency with a mask strategy, reducing memory requirements and increasing speed. Point-based rendering enhancements and ongoing research…

AI Tech News
Microsoft AI Research Introduces UFO: An Innovative UI-Focused Agent to Fulfill User Requests Tailored to Applications on Windows OS, Harnessing the Capabilities of GPT-Vision

Microsoft has introduced UFO, a UI-focused agent for Windows OS interaction. UFO uses natural language commands to address challenges in navigating the GUI of Windows applications. It employs a dual-agent framework and GPT-Vision to analyze and…

AI Tech News
deepset Unveils Studio Tool to Revolutionize AI Pipeline Development with Visual Architecting, Native Integrations to deepset Cloud, and NVIDIA AI Enterprise for Seamless Deployment

Revolutionize AI Pipeline Development with deepset Studio Empower Your Teams with Visual Architecting and Seamless Deployment deepset, a leader in mission-critical AI, introduces deepset Studio, an innovative tool designed to empower product, engineering, and data teams.…

AI Tech News
Learn AI for Free: 10 Best AI Courses to Take Right Now (2023)

Artificial intelligence (AI) is revolutionizing various industries and daily life. Learning about AI is essential for professionals in many fields, and luckily, there are free resources available online. This article presents the top five free AI…

AI Tech News
Advancing Sustainability Through Automation and AI in Fungi-Based Bioprocessing

Advancing Sustainability Through Automation and AI in Fungi-Based Bioprocessing Integrating automation and AI in fungi-based bioprocesses is a significant step towards sustainable biomanufacturing. This approach enhances process efficiency, reduces human error, and enables predictive analytics and…

AI Tech News
OpenAI Introduces CriticGPT: A New Artificial Intelligence AI Model based on GPT-4 to Catch Errors in ChatGPT’s Code Output

Practical Solutions and Value of CriticGPT in AI Assessment Enhancing AI Assessment with CriticGPT In the field of Artificial Intelligence (AI), it is essential to accurately evaluate model outputs. OpenAI has introduced CriticGPT, a tool designed…

AI Tech News
Goal Representations for Instruction Following

The text discusses the development of a model called Goal Representations for Instruction Following (GRIF), which allows robots to follow instructions and perform tasks. The model combines language and goal-conditioned training to improve performance. The text…

AI Tech News
BurstAttention: A Groundbreaking Machine Learning Framework that Transforms Efficiency in Large Language Models with Advanced Distributed Attention Mechanism for Extremely Long Sequences

Large language models have transformed language understanding and generation in machine learning. BurstAttention, a novel framework, addresses the challenge of processing long sequences by optimizing attention mechanisms, significantly reducing communication overhead and improving processing efficiency. It…

AI Tech News
Unlocking the Future: M3-Agent’s Multimodal Intelligence with Long-Term Memory

Understanding M3-Agent Imagine a future where a home robot can manage daily chores on its own, learning your habits and preferences over time. This is the promise of M3-Agent, a cutting-edge multimodal agent designed to enhance…

AI Tech News
Microsoft Unveils POML: Revolutionizing Prompt Engineering for AI Developers

In the rapidly evolving world of artificial intelligence, the introduction of the Prompt Orchestration Markup Language (POML) by Microsoft marks a significant advancement in how we interact with Large Language Models (LLMs). This open-source framework is…

AI Tech News
Nous Research Open-Sources Hermes 3: A Series of Instruct and Tool Use Model with Strong Reasoning and Creative Abilities

Enhancing AI Language Models for Practical Applications Addressing User Expectations Users expect AI systems to engage in complex conversations and understand context like humans. Challenges with Current Models Existing large language models (LLMs) struggle with tasks…

AI Tech News
Build an Autonomous Wet-Lab Protocol Planner with Salesforce CodeGen for Enhanced Experiment Safety and Efficiency

Building an Autonomous Wet-Lab Protocol Planner In the world of scientific research, efficiency and safety are paramount. This article explores how to create an intelligent agent that can streamline experimental design and execution in wet labs.…

AI Tech News
Migrating to Model Context Protocol (MCP): A Step-by-Step Guide for Developers and Architects

Understanding the Target Audience The target audience for this playbook includes architects, developers, and business managers involved in AI integrations. These professionals often face challenges such as: Difficulty managing and maintaining custom integrations High technical debt…

AI Tech News
Researchers from MIT and Harvard University Work on Enhancing AI Integrity: The Urgent Need for Standardized Data Provenance Frameworks

Practical Solutions for Enhancing AI Integrity Challenges in AI Data Collection Artificial intelligence relies on vast datasets from sources like social media and news outlets. However, the unstructured nature of this data poses challenges in maintaining…

AI Tech News
Intel Invests Heavily in Stability AI, Challenging OpenAI and ChatGPT

Intel Corporation has made a significant investment in Stability AI, a startup known for its Stable Diffusion software. This move positions Intel against OpenAI and its ChatGPT, marking a pivotal moment in the competitive AI market.…

AI Tech News
When Tackling Complex Topics, the First Step Is the Hardest

This text emphasizes the importance of continuous learning and growth in one’s career. It introduces several articles that cover various technical topics, such as generative AI, principle component analysis, image classification, linear algebra, support vector machines,…

AI Tech News