Salesforce AI Launches SWERank: Cost-Effective Solution for Software Issue Localization

SWERank: A New Approach to Software Issue Localization

Identifying software issues, such as bugs or feature requests, is one of the most challenging tasks in software development. Despite advancements in automated tools, finding the exact location in the code that requires changes often takes more time than fixing the issue itself. Traditional methods can be slow and costly, especially when using closed-source models. To address these challenges, Salesforce AI has developed a new framework called SWERank, which offers a more efficient and precise way to localize software issues.

Understanding SWERank

SWERank is a lightweight framework that improves the process of software issue localization by treating it as a code ranking task. It consists of two main components:

SWERankEmbed: A bi-encoder model that efficiently retrieves relevant code snippets by encoding GitHub issues and code into a shared space.
SWERankLLM: A listwise reranker that refines the retrieval results using contextual understanding from large language models (LLMs).

Data-Driven Approach

To train SWERank, the research team created a dataset called SWELOC, which links real-world issue reports with the corresponding code changes from public GitHub repositories. This dataset enhances the model’s accuracy by providing high-quality training examples.

How SWERank Works

SWERank operates in two stages:

Retrieval: SWERankEmbed converts issue descriptions and candidate functions into dense vector representations, allowing for efficient similarity-based retrieval.
Reranking: SWERankLLM processes the issue description and the top retrieved code candidates to generate a ranked list, ensuring that the most relevant code is prioritized.

Performance Insights

SWERank has demonstrated impressive results in evaluations against standard benchmarks. For instance, SWERankEmbed-Large achieved a function-level accuracy of 82.12%, surpassing other models. When combined with SWERankLLM-Large, the accuracy improved to 88.69%, setting a new standard in the field.

Cost Efficiency

In addition to its performance, SWERank is significantly more cost-effective than traditional models. For example, while other models may cost around $0.66 per example, SWERankLLM operates at just $0.011 to $0.015 per example, providing up to six times better accuracy for the cost.

Conclusion

SWERank represents a significant advancement in software issue localization by transforming it into a ranking problem. With its efficient architecture and high-quality training data, SWERank not only achieves state-of-the-art accuracy but also reduces costs and latency. This framework illustrates that practical and scalable solutions for debugging and code maintenance are achievable using open-source tools. By focusing on efficient neural retrieval, Salesforce AI has set a new benchmark for accuracy and efficiency in automated software engineering.

For more information, check out the SWERank project page.

If you are interested in exploring how artificial intelligence can enhance your business processes, consider identifying areas where automation can add value. Start small, measure effectiveness, and gradually expand your AI initiatives. For guidance on managing AI in your business, feel free to contact us.

Unleash Your Creative Potential with AI Agents

Competitors are already using AI Agents

Business Problems We Solve

Automation of internal processes.
Optimizing AI costs without huge budgets.
Training staff, developing custom courses for business needs
Integrating AI into client work, automating first lines of contact

Large and Medium Businesses

Startups

Offline Business

Get a plan to reduce routine and improve metrics

100% of clients report increased productivity and reduced operati

AI Agents

Localization Project Manager – Coordinating translation workflows, answering vendor or process-related questions.

Job Title: Localization Project Manager Overview The Localization Project Manager plays a vital role in coordinating translation workflows while addressing vendor and process-related queries. This position is crucial for ensuring that translation projects are executed efficiently…
AI Agents

Environmental Health & Safety Officer – Answering compliance-related questions, retrieving safety protocols or audit histories.

Professional Summary The AI-driven Environmental Health & Safety Officer is a reliable and effective digital team member that performs repetitive and time-consuming tasks with remarkable speed, accuracy, and stability. By automating these tasks, it frees up…
AI Agents

Legal Contract Reviewer – Auto-flagging clause inconsistencies or retrieving precedent cases for review.

Job Title: Legal Contract Reviewer – Auto-flagging Clause Inconsistencies or Retrieving Precedent Cases for Review The AI functions as a reliable and effective digital team member that excels in performing repetitive and time-consuming tasks. With remarkable…
AI Agents

Customer Retention Analyst – Creating customer summaries, identifying churn risk patterns, and suggesting retention steps.

Customer Retention Analyst Professional Summary A highly analytical and detail-oriented Customer Retention Analyst with a proven track record in creating comprehensive customer summaries, identifying churn risk patterns, and suggesting effective retention strategies. Adept at leveraging data-driven…

Itinai.com httpss.mj.runmrqch2uvtvo russian handsome charisma 9fdbb2d5 a55b 425d 8f3b 76d26f86710f 2

AI Business Accelerator

Start Your AI Business in Just a Week with itinai.com

You’re a great fit if you:

Have an audience (even 500+ followers in Instagram, email, etc.)
Have an idea, service, or product you want to scale
Can invest 2–3 hours a day
You’re motivated to earn with AI but don’t want to handle technical setup

AI news and solutions

Time Series Prediction with Transformers

The referenced article provides a comprehensive guide to using Transformers in PyTorch. It is available on Towards Data Science for further exploration.

AI Tech News
25+ AI Companies from Y Combinator that have Trained their Own AI Models Instead of Using Someone Else’s Closed Model Through an API like a Black Box

AI Tech News
Darts: A New Python Library for User-Friendly Forecasting and Anomaly Detection on Time Series

Practical Solutions for Time Series Analysis Introducing Darts: A New Python Library for User-Friendly Forecasting and Anomaly Detection on Time Series Time series data, representing observations recorded sequentially over time, permeate various aspects of nature and…

AI Tech News
Meet IBM’s Watsonx Code Assistant: Revolutionizing Enterprise Coding with AI-Powered Assistance

IBM has launched the Watsonx Code Assistant, an AI-powered tool that aims to help developers code quickly and accurately. The Code Assistant offers two models, one for IT automation and another for mainframe application modernization. It…

AI Tech News
Open O1: Revolutionizing Open-Source AI with Cutting-Edge Reasoning and Performance

Open O1: Transforming Open-Source AI The Open O1 project is an innovative initiative designed to provide the powerful capabilities of proprietary AI models, like OpenAI’s O1, through an open-source framework. This project aims to make advanced…

AI Tech News
OpenAI’s Technical Playbook for Successful Enterprise AI Integration

AI Integration Playbook for Enterprises OpenAI’s Technical Playbook for Enterprise AI Integration OpenAI has released a comprehensive technical playbook that provides insights into how top companies have successfully integrated artificial intelligence (AI) into their operations. This…

AI Tech News
ALPHAONE: Revolutionizing AI Reasoning with a Universal Test-Time Framework

Understanding ALPHAONE: Enhancing AI Reasoning Artificial Intelligence (AI) is making significant strides in various fields, including mathematics and code generation. A key player in this evolution is the large reasoning model, which mimics human cognitive processes.…

AI Tech News
CausalMM: A Causal Inference Framework that Applies Structural Causal Modeling to Multimodal Large Language Models (MLLMs)

Understanding Multimodal Large Language Models (MLLMs) Multimodal Large Language Models (MLLMs) use advanced Transformer models to process various types of data, like text and images. However, they struggle with biases in their initial setup, known as…

AI Tech News
Can Language Models Replace Programmers? Researchers from Princeton and the University of Chicago Introduce SWE-bench: An Evaluation Framework that Tests Machine Learning Models on Solving Real Issues from GitHub

The SWE-bench evaluation framework, developed by researchers from Princeton University and the University of Chicago, focuses on assessing the ability of language models (LMs) to solve real-world software engineering challenges. The findings reveal that even advanced…

AI Tech News
Meet Magika: A Novel AI-Powered File Type Detection Tool that Relies on the Recent Advances of Deep Learning to Provide Accurate Detection

Magika is an AI-powered file type detection tool that uses deep learning to accurately identify file types, achieving remarkable precision and recall rates of 99% or more. It offers Python command line, Python API, and TFJS…

AI Tech News
CONClave: Enhancing Security and Trust in Cooperative Autonomous Vehicle Networks Cooperative Infrastructure Sensors Environments

The Value of CONClave in Autonomous Vehicle Networks Enhancing Safety and Efficiency The cooperative operation of autonomous vehicles can greatly improve road safety and efficiency. Challenges in Autonomous Vehicle Networks Securing systems against unauthorized participants and…

AI Tech News
Could Brain-Inspired Patterns Be the Future of AI? Microsoft Investigates Central Pattern Generators in Neural Networks

Enhancing Spiking Neural Networks with CPG-PE Addressing Challenges in Sequential Task Processing Spiking Neural Networks (SNNs) offer energy-efficient and biologically plausible artificial neural networks. However, they face limitations in handling sequential tasks like text classification and…

AI Tech News
Run Zephyr 7B with an API

Zephyr 7B alpha outperforms Llama 2 70B Chat on MT Bench. Simple code lines teach you how to run it efficiently.

AI Tech News
Pioneering Large Vision-Language Models with MoE-LLaVA

A new breakthrough in artificial intelligence has been achieved with MoE-LLaVA, a pioneering framework for large vision-language models (LVLMs). It strategically activates only a fraction of its parameters, maintaining manageable computational costs while expanding capacity and…

AI Tech News
MInference (Milliontokens Inference): A Training-Free Efficient Method for the Pre-Filling Stage of Long-Context LLMs Based on Dynamic Sparse Attention

Practical Solutions for Long-Context LLMs Accelerating Processing with MInference The MInference method optimizes sparse calculations for GPUs, reducing latency without altering pre-training or needing fine-tuning. It achieves up to a 10x speedup, cutting the pre-filling stage…

AI Tech News
Unveiling the Quantum-Machine Learning Conundrum: Can Barren Plateau-Free Models in Quantum Computing Be Efficiently Simulated Classically?

The paper discusses the challenges faced by quantum machine learning and variational quantum algorithms due to the desert plateau event, and explores strategies for bypassing barren plateaus. Researchers from various institutions present their findings and caution…

AI Tech News
How three filmmakers created Sora’s latest stunning videos

Several filmmakers recently tested OpenAI’s Sora, yielding impressive results. Shy Kids created “Air Head,” leveraging Sora to maintain consistent characters and achieve near-perfect faces. Paul Trillo’s “Abstract” showcases raw Sora output with vintage aesthetics. Don Allen…

AI Tech News
Microsoft Researchers Unveil RadEdit: Stress-testing Biomedical Vision Models via Diffusion Image Editing to Eliminate Dataset Bias

Practical Solutions for Biomedical Vision Models Challenges in Biomedical Vision Models Dataset shifts hinder the effectiveness of biomedical vision models in real-world scenarios due to discrepancies in training data. This poses risks to patient safety. Current…

AI Tech News
This AI Paper Introduces BABILong Framework: A Generative Benchmark for Testing Natural Language Processing (NLP) Models on Processing Arbitrarily Lengthy Documents

Recent research has proposed a method to expand context windows in transformers using recurrent memory, addressing limitations of computing scalability. The team introduced the BABILong framework for NLP model evaluation in handling lengthy dispersed data, achieving…

AI Tech News
Deep Learning and Vocal Fold Analysis: The Role of the GIRAFE Dataset

Understanding the Challenges in Laryngeal Imaging Semantic segmentation of the glottal area using high-speed videoendoscopic (HSV) sequences is crucial for studying the larynx. However, there is a lack of high-quality, annotated datasets that are essential for…

AI Tech News