Researchers from the University of Washington and Google have Developed Distilling Step-by-Step Technology to Train a Dedicated Small Machine Learning Model with Less Data

Researchers from the University of Washington and Google have developed a new technology called “Distilling Step-by-Step” to train small machine learning models with less data. This approach involves extracting informative natural language rationales from large language models and using them as additional supervision during training. The method showed significant performance gains with reduced data requirements, making advanced language models more accessible for various applications.

Review: Distilling Step-by-Step Technology for Training Small Machine Learning Models

In recent years, large language models (LLMs) have revolutionized the field of natural language processing, enabling unprecedented zero-shot and few-shot learning capabilities. However, their deployment in real-world applications has been hindered by their immense computational demands. A single 175 billion parameter LLM necessitates a staggering 350GB of GPU memory and specialized infrastructure. With today’s state-of-the-art models boasting over 500 billion parameters, these requirements render LLMs inaccessible to many research teams, particularly those with low-latency performance needs.

To address this deployment challenge, researchers have turned to smaller specialized models, trained through either fine-tuning or distillation. Fine-tuning, while effective, relies on costly and time-consuming human-generated labels. Distillation, on the other hand, demands copious amounts of unlabeled data, which can be difficult to obtain.

In a groundbreaking study by a research team from Google and the University of Washington presented at ACL2023, the authors introduced “Distilling Step-by-Step,” a novel mechanism designed to mitigate the trade-off between model size and the cost of data collection. This innovative approach hinges on extracting informative natural language rationales, or intermediate reasoning steps, from LLMs. These rationales serve as additional, richer supervision in training smaller task-specific models alongside standard task labels.

The researchers outline a two-stage process for implementing Distilling Step-by-Step. First, they employ CoT prompting to extract rationales from an LLM, enabling the model to generate rationales for unseen inputs. Subsequently, these rationales are integrated into the training of small models using a multi-task learning framework, with task prefixes guiding the model’s differentiation between label prediction and rationale generation.

In a series of experiments, a 540B parameter LLM was utilized, along with T5 models for task-specific downstream tasks. Distilling Step-by-Step exhibited remarkable performance gains with significantly reduced data requirements. For instance, on the e-SNLI dataset, the method outperformed standard fine-tuning with just 12.5% of the full dataset. Similar reductions in dataset size were observed across various NLP tasks, including ANLI, CQA, and SVAMP.

Furthermore, Distilling Step-by-Step achieved superior performance using considerably smaller model sizes compared to few-shot CoT-prompted LLMs. For instance, on the e-SNLI dataset, a 220M T5 model surpassed the performance of a 540B PaLM. On ANLI, a 770M T5 model outperformed a 540B PaLM by over 700 times, demonstrating the immense potential for efficiency gains.

Notably, Distilling Step-by-Step showcased its ability to outperform few-shot LLMs using significantly smaller models and less data. For instance, on ANLI, a 770M T5 model surpassed the performance of a 540B PaLM using only 80% of the full dataset, a feat unattainable through standard fine-tuning.

In conclusion, Distilling Step-by-Step presents a groundbreaking paradigm for training small, task-specific models. By extracting rationales from LLMs, this approach not only reduces the data required for model training but also enables the use of significantly smaller models. This innovative technique stands to revolutionize the field of natural language processing, making advanced language models more accessible and practical for a broader range of applications.

Check out the Paper and Google AI Article. All Credit For This Research Goes To the Researchers on This Project. Also, don’t forget to join our 30k+ ML SubReddit, 40k+ Facebook Community, Discord Channel, and Email Newsletter, where we share the latest AI research news, cool AI projects, and more.

If you like our work, you will love our newsletter..

Action Items:

1. Research and analyze the “Distilling Step-by-Step” technology developed by researchers from the University of Washington and Google.
2. Identify potential applications and benefits of the “Distilling Step-by-Step” approach in real-world scenarios.
3. Explore the feasibility of implementing the “Distilling Step-by-Step” technique within our organization.
4. Investigate the requirements and resources needed for training small task-specific models using the distillation approach.
5. Compare the performance and efficiency of the “Distilling Step-by-Step” technique with other existing methods in the field of natural language processing.
6. Share the research findings and insights with relevant stakeholders within the organization.
7. Consider the potential collaboration with the research team at the University of Washington and Google to further explore the application of the “Distilling Step-by-Step” technique.
8. Stay updated with the latest advancements in machine learning and natural language processing by subscribing to the MarkTechPost newsletter and joining the ML subreddit, Facebook community, and Discord channel.

Please assign owners to these action items based on the relevant individuals or teams within our organization.

List of Useful Links:

AI Scrum Bot – ask about AI scrum and agile

Researchers from the University of Washington and Google have Developed Distilling Step-by-Step Technology to Train a Dedicated Small Machine Learning Model with Less Data

MarkTechPost

Twitter – @itinaicom

Unleash Your Creative Potential with AI Agents

Competitors are already using AI Agents

Business Problems We Solve

Automation of internal processes.
Optimizing AI costs without huge budgets.
Training staff, developing custom courses for business needs
Integrating AI into client work, automating first lines of contact

Large and Medium Businesses

Startups

Offline Business

Get a plan to reduce routine and improve metrics

100% of clients report increased productivity and reduced operati

AI Agents

Localization Project Manager – Coordinating translation workflows, answering vendor or process-related questions.

Job Title: Localization Project Manager Overview The Localization Project Manager plays a vital role in coordinating translation workflows while addressing vendor and process-related queries. This position is crucial for ensuring that translation projects are executed efficiently…
AI Agents

Environmental Health & Safety Officer – Answering compliance-related questions, retrieving safety protocols or audit histories.

Professional Summary The AI-driven Environmental Health & Safety Officer is a reliable and effective digital team member that performs repetitive and time-consuming tasks with remarkable speed, accuracy, and stability. By automating these tasks, it frees up…
AI Agents

Legal Contract Reviewer – Auto-flagging clause inconsistencies or retrieving precedent cases for review.

Job Title: Legal Contract Reviewer – Auto-flagging Clause Inconsistencies or Retrieving Precedent Cases for Review The AI functions as a reliable and effective digital team member that excels in performing repetitive and time-consuming tasks. With remarkable…
AI Agents

Customer Retention Analyst – Creating customer summaries, identifying churn risk patterns, and suggesting retention steps.

Customer Retention Analyst Professional Summary A highly analytical and detail-oriented Customer Retention Analyst with a proven track record in creating comprehensive customer summaries, identifying churn risk patterns, and suggesting effective retention strategies. Adept at leveraging data-driven…

Itinai.com httpss.mj.runmrqch2uvtvo russian handsome charisma 9fdbb2d5 a55b 425d 8f3b 76d26f86710f 2

AI Business Accelerator

Start Your AI Business in Just a Week with itinai.com

You’re a great fit if you:

Have an audience (even 500+ followers in Instagram, email, etc.)
Have an idea, service, or product you want to scale
Can invest 2–3 hours a day
You’re motivated to earn with AI but don’t want to handle technical setup

AI news and solutions

This New “Expert Playbook” Makes Him $6M Per Year

The article emphasizes that valuable skills can earn substantial income. It introduces the “Expert Playbook” used by successful internet entrepreneurs like Daniel, Iman Ghadzi, Russel Brunson, and Alex Becker. The playbook involves learning an in-demand skill,…

AI Tech News
Apple’s FastVLM: 85x Faster Hybrid Vision Encoder Revolutionizing AI Models

Apple has made a significant leap in the field of Vision Language Models (VLMs) with the introduction of FastVLM. This innovative hybrid vision encoder is designed to address some of the critical challenges that high-resolution images…

AI Tech News
Future Token Prediction Model FTP: A New AI Training Method for Transformers that Predicts Multiple Future Tokens

Understanding the Future Token Prediction Model (FTP) The traditional design of language models like GPT faces challenges in maintaining coherent and relevant content over extended text. This issue arises because they predict one token at a…

AI Tech News
Alibaba AI Group Propose AgentScope: A Developer-Centric Multi-Agent Platform with Message Exchange as its Core Communication Mechanism

AgentScope is a pioneering multi-agent platform introduced by researchers from Alibaba Group, aiming to simplify multi-agent application development. It leverages message exchange and rich syntactic tools, offering robust fault tolerance and exceptional support for multi-modal data.…

AI Tech News
Salesforce AI Research Unveiled SFR-RAG: A 9-Billion Parameter Model Revolutionizing Contextual Accuracy and Efficiency in Retrieval Augmented Generation Frameworks

The Innovation of SFR-RAG Model in Contextual Accuracy Practical Solutions and Value Summary: Generative AI, powered by large language models, now includes Retrieval Augmented Generation (RAG) to improve factual accuracy by incorporating external information. RAG models…

AI Tech News
[FIXED] Conversation not found Error in ChatGPT

The “Conversation not found” error in ChatGPT may occur due to glitches, weak internet, or server overload. Complex questions or long chats can also trigger this issue. Solutions include clearing browser cookies, checking internet connection, refreshing…

AI Tech News
One Slack Message = One Full SOP. Yes, Really.

One Slack Message = One Full SOP. Yes, Really. Imagine the frustration of lost documents, time-consuming searches, and misaligned team collaboration. These are common issues that businesses face daily, leading to inefficiencies and wasted resources. But…

AI Document Assistant
Assembly AI Introduces Universal-2: The Next Leap in Speech-to-Text Technology

Transforming Speech Recognition with Universal-2 Introduction to ASR Technology In recent years, Automatic Speech Recognition (ASR) technology has become essential in various industries, including healthcare and customer support. However, accurately transcribing speech in different languages, accents,…

AI Tech News
Do More Games Mean More Wins?

The article “Do More Games Mean More Wins?” explores the impact of increasing the number of regular-season games in college football on teams’ overall win records. By analyzing historical data, it concludes that the increase in…

AI Tech News
AI-Enhanced Math Problem Solving: Exploring DualDistill and Agentic-R1

Understanding DualDistill and Agentic-R1 In the world of artificial intelligence, particularly in mathematical problem-solving, researchers are continually seeking ways to enhance performance and efficiency. The DualDistill framework and its model, Agentic-R1, represent a significant advancement in…

AI Tech News
Meet OmAgent: A New Python Library for Building Multimodal Language Agents

Understanding Long Videos with AI Solutions Long videos, like 24-hour CCTV footage or full-length films, present significant challenges in video processing. Traditional methods often lose important details by simplifying visual content, making it hard to analyze…

AI Tech News
Navigating Explainable AI in In Vitro Diagnostics: Compliance and Transparency Under European Regulations

The Role of Explainable AI in In Vitro Diagnostics Under European Regulations AI is crucial in healthcare, particularly in vitro diagnostics (IVD) under the European IVDR. AI systems must provide explainable results to comply with regulatory…

AI Tech News
How to Detect Hallucinations in LLMs

The text outlines a method for evaluating the reliability of AI-generated text, particularly chatbot responses, to detect potential inaccuracies or fabrications. By comparing the consistency of multiple responses generated by a language model and evaluating their…

AI Tech News
Netflix Introduces Go-with-the-Flow: Motion-Controllable Video Diffusion Models Using Real-Time Warped Noise

Challenges in Motion-Controlled Video Generation Creating videos with precise motion control is a complex task. Current methods face difficulties in managing motion across various scenarios. The three main techniques used are: Local Object Motion Control: Using…

AI Tech News
AI’s Thirst for Power: Can Nuclear Fusion Quench It?

AI Tech News
From Deep Knowledge Tracing to DKT2: A Leap Forward in Educational AI

Understanding Knowledge Tracing (KT) in Education Knowledge Tracing (KT) is essential in Intelligent Tutoring Systems (ITS). It helps track what students know and predict how they will perform in the future. Traditional models like Bayesian Knowledge…

AI Tech News
SEAL: A Dual-Encoder Framework Enhancing Hierarchical Imitation Learning with LLM-Guided Sub-Goal Representations

Understanding Hierarchical Imitation Learning (HIL) Hierarchical Imitation Learning (HIL) helps in making long-term decisions by breaking tasks into smaller goals. However, it struggles with limited supervision and requires a lot of expert examples. Large Language Models…

AI Tech News
Make Your Full Songs with Microsoft’s New Copilot

Microsoft’s AI chatbot, Copilot, has partnered with Suno, an AI music startup, to enable users to create songs on demand. By activating the Suno plug-in, users can provide song ideas and receive a 1-2 minute song…

AI Tech News
Microsoft Research Introduces AgentInstruct: A Multi-Agent Workflow Framework for Enhancing Synthetic Data Quality and Diversity in AI Model Training

Enhancing AI Model Training with AgentInstruct Addressing Challenges in Synthetic Data Generation Large language models (LLMs) have revolutionized applications like chatbots, content creation, and data analysis. However, ensuring high-quality and diverse training data remains a challenge.…

AI Tech News
Snowflake vs Palantir: Real-Time AI Analytics That Transform Product Strategy

Technical Relevance The Snowflake Data Cloud operates at the intersection of data and analytics, providing organizations with the capability to perform real-time analytics across various industries, including retail and finance. As businesses face an increasingly complex…

Tools

Researchers from the University of Washington and Google have Developed Distilling Step-by-Step Technology to Train a Dedicated Small Machine Learning Model with Less Data

Review: Distilling Step-by-Step Technology for Training Small Machine Learning Models

List of Useful Links:

AI Scrum Bot – ask about AI scrum and agile

Researchers from the University of Washington and Google have Developed Distilling Step-by-Step Technology to Train a Dedicated Small Machine Learning Model with Less Data

MarkTechPost

Twitter – @itinaicom

Unleash Your Creative Potential with AI Agents

Competitors are already using AI Agents

Business Problems We Solve

Localization Project Manager – Coordinating translation workflows, answering vendor or process-related questions.

Environmental Health & Safety Officer – Answering compliance-related questions, retrieving safety protocols or audit histories.

Legal Contract Reviewer – Auto-flagging clause inconsistencies or retrieving precedent cases for review.

Customer Retention Analyst – Creating customer summaries, identifying churn risk patterns, and suggesting retention steps.

Start Your AI Business in Just a Week with itinai.com

You’re a great fit if you:

AI news and solutions

This New “Expert Playbook” Makes Him $6M Per Year

Apple’s FastVLM: 85x Faster Hybrid Vision Encoder Revolutionizing AI Models

Future Token Prediction Model FTP: A New AI Training Method for Transformers that Predicts Multiple Future Tokens

Alibaba AI Group Propose AgentScope: A Developer-Centric Multi-Agent Platform with Message Exchange as its Core Communication Mechanism

Salesforce AI Research Unveiled SFR-RAG: A 9-Billion Parameter Model Revolutionizing Contextual Accuracy and Efficiency in Retrieval Augmented Generation Frameworks

[FIXED] Conversation not found Error in ChatGPT

One Slack Message = One Full SOP. Yes, Really.

Assembly AI Introduces Universal-2: The Next Leap in Speech-to-Text Technology

Do More Games Mean More Wins?

AI-Enhanced Math Problem Solving: Exploring DualDistill and Agentic-R1

Meet OmAgent: A New Python Library for Building Multimodal Language Agents

Navigating Explainable AI in In Vitro Diagnostics: Compliance and Transparency Under European Regulations

How to Detect Hallucinations in LLMs

Netflix Introduces Go-with-the-Flow: Motion-Controllable Video Diffusion Models Using Real-Time Warped Noise

AI’s Thirst for Power: Can Nuclear Fusion Quench It?

From Deep Knowledge Tracing to DKT2: A Leap Forward in Educational AI

SEAL: A Dual-Encoder Framework Enhancing Hierarchical Imitation Learning with LLM-Guided Sub-Goal Representations

Make Your Full Songs with Microsoft’s New Copilot

Microsoft Research Introduces AgentInstruct: A Multi-Agent Workflow Framework for Enhancing Synthetic Data Quality and Diversity in AI Model Training

Snowflake vs Palantir: Real-Time AI Analytics That Transform Product Strategy

Vacancies

Editorial Policy

Copyright

Availability

FAQ

Partners

Researchers from the University of Washington and Google have Developed Distilling Step-by-Step Technology to Train a Dedicated Small Machine Learning Model with Less Data

Review: Distilling Step-by-Step Technology for Training Small Machine Learning Models

List of Useful Links:

AI Scrum Bot – ask about AI scrum and agile Researchers from the University of Washington and Google have Developed Distilling Step-by-Step Technology to Train a Dedicated Small Machine Learning Model with Less Data MarkTechPost Twitter – @itinaicom

Unleash Your Creative Potential with AI Agents

Competitors are already using AI Agents

Business Problems We Solve

Start Your AI Business in Just a Week with itinai.com

You’re a great fit if you:

AI news and solutions

AI Scrum Bot – ask about AI scrum and agile

Researchers from the University of Washington and Google have Developed Distilling Step-by-Step Technology to Train a Dedicated Small Machine Learning Model with Less Data

MarkTechPost

Twitter – @itinaicom