This AI Paper from CMU and Google DeepMind Studies the Role of Synthetic Data for Improving Math Reasoning Capabilities of LLMs

This AI Paper from CMU and Google DeepMind Studies the Role of Synthetic Data for Improving Math Reasoning Capabilities of LLMs

The Role of Synthetic Data in Improving LLMs’ Math Reasoning Capabilities

Research Findings:

Large language models (LLMs) face a challenge due to the scarcity of high-quality internet data. By 2026, researchers will need to rely on model-generated or synthetic data for training. This shift brings both opportunities and risks, impacting model performance and introducing biases. The challenge is to design high-quality synthetic data that addresses data scarcity without compromising model integrity.

Researchers have explored various approaches to tackle LLM training challenges using synthetic data. Efforts include generating positive synthetic data to mimic high-quality training data and using negative responses to unlearn problematic patterns in the training data.

A recent study by researchers from Carnegie Mellon University, Google DeepMind, and MultiOn reveals that positive synthetic data improves performance but with slower scaling rates than pretraining. Self-generated positive responses match the effectiveness of a larger amount of data, while incorporating negative synthetic data can scale efficiency up to eight times compared to using only positive data.

Proposed Method Architecture:

Synthetic Data Pipeline: Prompts capable models to generate new problems, obtains solution traces, and implements a binary reward function to verify correctness.

Dataset Construction: Creates positive synthetic dataset, generates positive and negative datasets using model-generated solutions.

Learning Algorithms: Includes Supervised Finetuning (SFT), Rejection Finetuning (RFT), and Preference Optimization using Direct Preference Optimization (DPO) with two variants: standard DPO and per-step DPO.

Conclusions and Recommendations:

The study emphasizes the importance of carefully constructing and utilizing both positive and negative synthetic data in LLM training for mathematical reasoning tasks. It suggests that incorporating negative (incorrect) traces can significantly enhance LLMs’ mathematical reasoning abilities.

AI Solutions for Business Transformation:

AI can redefine your way of work by identifying automation opportunities, defining measurable KPIs, selecting appropriate AI solutions, and implementing AI usage gradually.

For AI KPI management advice and insights into leveraging AI, connect on Telegram or Twitter.

List of Useful Links:

AI Products for Business or Try Custom Development

AI Sales Bot

Welcome AI Sales Bot, your 24/7 teammate! Engaging customers in natural language across all channels and learning from your materials, it’s a step towards efficient, enriched customer interactions and sales

AI Document Assistant

Unlock insights and drive decisions with our AI Insights Suite. Indexing your documents and data, it provides smart, AI-driven decision support, enhancing your productivity and decision-making.

AI Customer Support

Upgrade your support with our AI Assistant, reducing response times and personalizing interactions by analyzing documents and past engagements. Boost your team and customer satisfaction

AI Scrum Bot

Enhance agile management with our AI Scrum Bot, it helps to organize retrospectives. It answers queries and boosts collaboration and efficiency in your scrum processes.