Microsoft Asia Research Introduces SPEED: An AI Framework that Aligns Open-Source Small Models (8B) to Efficiently Generate Large-Scale Synthetic Embedding Data

Understanding Text Embedding in AI

Text embedding is a key part of natural language processing (NLP). It turns words and phrases into numerical vectors that capture their meanings. This allows machines to handle tasks like classification, clustering, retrieval, and summarization. By converting text into vectors, machines can better understand human language, improving applications such as sentiment analysis and recommendation systems.

The Challenge of Training Data

A major issue in text embedding is the need for large amounts of high-quality training data. Manually labeling this data is both costly and time-consuming. While creating synthetic data is a potential fix, many techniques rely on expensive proprietary models like GPT-4. This reliance can limit access to advanced embedding technologies for researchers.

Current Methods and Their Limitations

Many current methods use large language models (LLMs) to generate synthetic text. For example, GPT-4 creates triplets—queries with positive and negative examples—to generate diverse data. However, this process can be costly and complex, making it hard for researchers to tailor it to their needs. There’s a clear need for more accessible and cost-effective solutions.

Introducing SPEED: A New Framework

Researchers from the Gaoling School of Artificial Intelligence and Microsoft have developed SPEED, a framework that uses small, open-source models to create high-quality embedding data with much lower resource needs. This innovative approach aims to make synthetic data generation more accessible.

How SPEED Works

SPEED consists of three main components:

Junior Generator: Produces initial low-cost synthetic data based on task descriptions.
Senior Generator: Enhances data quality using preference optimization.
Data Revisor: Refines outputs to improve quality and consistency.

This process enables SPEED to efficiently align small models with tasks typically handled by larger models.

Results and Benefits of SPEED

SPEED has shown remarkable improvements in embedding quality and cost-effectiveness. It outperformed the leading model, E5mistral, using only 45,000 API calls compared to E5mistral’s 500,000, achieving a cost reduction of over 90%. On the Massive Text Embedding Benchmark (MTEB), SPEED performed exceptionally across various tasks, demonstrating its versatility and effectiveness.

Practical Solutions and Value of SPEED

SPEED provides a practical, low-cost solution for the NLP community. It allows researchers to generate high-quality data for training embedding models without relying on expensive proprietary technologies. This framework showcases how small, open-source models can efficiently meet the demands of synthetic data generation, fostering broader access to advanced NLP tools.

Get Involved and Stay Updated

Check out the Paper for more details. Follow us on Twitter, join our Telegram Channel, and connect on LinkedIn. If you appreciate our work, subscribe to our newsletter and join our 55k+ ML SubReddit.

Enhance Your Business with AI

To evolve your company with AI, consider the following steps:

Identify Automation Opportunities: Find key customer interaction points that can benefit from AI.
Define KPIs: Ensure measurable impacts on business outcomes.
Select an AI Solution: Choose tools that fit your needs and allow customization.
Implement Gradually: Start with a pilot project, gather data, and expand wisely.

For AI KPI management advice, contact us at hello@itinai.com. For ongoing insights into leveraging AI, follow us on Telegram or Twitter.

List of Useful Links:

Unleash Your Creative Potential with AI Agents

Competitors are already using AI Agents

Business Problems We Solve

Automation of internal processes.
Optimizing AI costs without huge budgets.
Training staff, developing custom courses for business needs
Integrating AI into client work, automating first lines of contact

Large and Medium Businesses

Startups

Offline Business

Get a plan to reduce routine and improve metrics

100% of clients report increased productivity and reduced operati

AI Agents

Localization Project Manager – Coordinating translation workflows, answering vendor or process-related questions.

Job Title: Localization Project Manager Overview The Localization Project Manager plays a vital role in coordinating translation workflows while addressing vendor and process-related queries. This position is crucial for ensuring that translation projects are executed efficiently…
AI Agents

Environmental Health & Safety Officer – Answering compliance-related questions, retrieving safety protocols or audit histories.

Professional Summary The AI-driven Environmental Health & Safety Officer is a reliable and effective digital team member that performs repetitive and time-consuming tasks with remarkable speed, accuracy, and stability. By automating these tasks, it frees up…
AI Agents

Legal Contract Reviewer – Auto-flagging clause inconsistencies or retrieving precedent cases for review.

Job Title: Legal Contract Reviewer – Auto-flagging Clause Inconsistencies or Retrieving Precedent Cases for Review The AI functions as a reliable and effective digital team member that excels in performing repetitive and time-consuming tasks. With remarkable…
AI Agents

Customer Retention Analyst – Creating customer summaries, identifying churn risk patterns, and suggesting retention steps.

Customer Retention Analyst Professional Summary A highly analytical and detail-oriented Customer Retention Analyst with a proven track record in creating comprehensive customer summaries, identifying churn risk patterns, and suggesting effective retention strategies. Adept at leveraging data-driven…

Itinai.com httpss.mj.runmrqch2uvtvo russian handsome charisma 9fdbb2d5 a55b 425d 8f3b 76d26f86710f 2

AI Business Accelerator

Start Your AI Business in Just a Week with itinai.com

You’re a great fit if you:

Have an audience (even 500+ followers in Instagram, email, etc.)
Have an idea, service, or product you want to scale
Can invest 2–3 hours a day
You’re motivated to earn with AI but don’t want to handle technical setup

AI news and solutions

Top 12 API Testing Tools to Elevate Software Quality in 2025

Understanding the Target Audience for API Testing Tools The target audience for the top API testing tools in 2025 primarily includes software developers, quality assurance engineers, DevOps teams, and IT managers. These professionals operate in tech-driven…

AI Tech News
Decoding Similarity: A Framework for Analyzing Neural and Model Representations

Understanding Similarity in Information Processing To find out if two systems—biological or artificial—process information in the same way, we use various similarity measures. These include: Linear Regression Centered Kernel Alignment (CKA) Normalized Bures Similarity (NBS) Angular…

AI Tech News
AI2BMD: A Quantum-Accurate Machine Learning Approach for Large-Scale Biomolecular Dynamics

AI2BMD: Advanced AI Solutions for Biomolecular Dynamics Understanding Biomolecular Dynamics Biomolecular dynamics simulations are essential in life sciences as they help us understand how molecules interact. Traditional molecular dynamics (MD) are fast but may not provide…

AI Tech News
Unraveling Direct Alignment Algorithms: A Comparative Study on Optimization Strategies for LLM Alignment

Aligning AI with Human Values Aligning large language models (LLMs) with human values is challenging due to unclear goals and complex human intentions. Direct Alignment Algorithms (DAAs) simplify this process by optimizing models directly, without needing…

AI Tech News
OpenAI form an ‘agreement in principle’ for Sam Altman to return as CEO

In a surprising turn of events, Sam Altman is set to be reinstated as the CEO of OpenAI. The drama started when Altman was removed for a lack of candor in his communications. This led to…

AI Tech News
Microsoft Researchers Introduce Advanced Query Categorization System to Enhance Large Language Model Accuracy and Reduce Hallucinations in Specialized Fields

Practical Solutions for Enhancing Large Language Models (LLMs) Overview Large language models (LLMs) have transformed AI by generating human-like text and complex reasoning. However, they struggle with domain-specific tasks in sectors like healthcare, law, and finance.…

AI Tech News
This AI Paper by Alibaba Introduces Data-Juicer Sandbox: A Probe-Analyze-Refine Approach to Co-Developing Multi-Modal Data and Generative AI Models

Practical Solutions for Multi-Modal Generative Models Challenges in Model Optimization Multi-modal generative models integrate text, images, and videos, but face challenges in data processing and model training optimization. Addressing Isolated Progression Researchers struggle to integrate data…

AI Tech News
Nexusflow Releases Athene-V2: An Open 72B Model Suite Comparable to GPT-4o Across Benchmarks

Understanding the Shift in AI Development Large language models (LLMs) like chatbots and virtual assistants have become essential in AI. However, there’s a challenge: simply making models bigger isn’t leading to better performance as it used…

AI Tech News
OLMoE-1B-7B and OLMoE-1B-7B-INSTRUCT Released: A Fully Open-Sourced Mixture-of-Experts LLM with 1B Active and 7B Total Parameters

Practical Solutions and Value of OLMoE-1B-7B and OLMoE-1B-7B-INSTRUCT Introduction Large-scale language models have changed natural language processing with their capabilities in tasks like text generation and translation. However, their high computational costs make them difficult to…

AI Tech News
Researchers from Yale and Google DeepMind Unlock Math Problem-Solving Success with Advanced Fine-Tuning Techniques on Large Language Models

Large language models (LLMs) like GPT-4 and PaLM 2 struggle with mathematical problem-solving due to the need for imagination, reasoning, and computation. However, with multiple attempts, LLMs show potential for improvement. Fine-tuning techniques such as supervised…

AI Tech News
Open-source startup Mistral AI secures $415M in funding

French AI startup Mistral AI secured a significant €385m or $414m in funding, led by Andreessen Horowitz and Lightspeed Venture Partners. The company focuses on open-source models, aiming to counter the emerging AI oligopoly. Its new…

AI Tech News
Salesforce AI Research Propose Programmatic VLM Evaluation (PROVE): A New Benchmarking Paradigm for Evaluating VLM Responses to Open-Ended Queries

Understanding Vision-Language Models (VLMs) Vision-Language Models (VLMs) are tools that help generate answers to questions about images. However, they often produce answers that sound plausible but are incorrect, a problem known as hallucination. This can reduce…

AI Tech News
Exploring Feature Extraction with CNNs

This article discusses the use of Convolutional Neural Networks (CNNs) for feature extraction in image classification tasks. It explains how CNNs recognize patterns in an image to classify it and demonstrates an example of feature extraction…

AI Tech News
Devin AI Launches DeepWiki: AI-Powered Tool for Understanding GitHub Repositories

Devin AI Introduces DeepWiki: Enhancing Code Understanding Devin AI Introduces DeepWiki: Enhancing Code Understanding Devin AI has launched DeepWiki, a free tool that generates structured, wiki-style documentation for GitHub repositories. This innovative tool, powered by the…

AI Tech News
Top Antidetect Browsers in 2024

Practical AI Solutions for Your Business Top Antidetect Browsers in 2024 Everything is online in the 21st century, and websites often use cookies to enhance user experience. However, some websites track and sell user data, making…

AI Tech News
Meet Hydragen: A Hardware-Aware Exact Implementation of Attention with Shared Prefixes

Hydragen is a transformative solution in optimizing large language models (LLMs). Developed by research teams from Stanford University, the University of Oxford, and the University of Waterloo, Hydragen’s innovative attention decomposition method significantly enhances computational efficiency…

AI Tech News
Differentiable MCMC Layers: Revolutionizing Neural Networks for Combinatorial Optimization

Differentiable MCMC Layers: A New AI Framework for Discrete Decision-Making Understanding the Challenge Neural networks excel at processing complex data but struggle with discrete decision-making tasks, such as vehicle routing or scheduling. These tasks often involve…

AI News
2023 Wrapped – Multi Sensory AI & Remote Assistance Year in Review

I’m ready to help! Could you please provide the text that you’d like me to summarize?

Support Ai News
Google DeepMind wants to define what counts as artificial general intelligence

Google DeepMind researchers have proposed a new definition and taxonomy for artificial general intelligence (AGI). The team outlines five ascending levels of AGI, ranging from emerging to superhuman. They emphasize that AGI must be both general-purpose…

AI Tech News
Leveraging language to understand machines

Irene Terpstra ’23 and Rujul Gandhi ’22, two MIT engineering students, are leveraging natural language for AI systems. Terpstra’s team is using language models to assist in chip design, while Gandhi is developing a system to…

AI Tech News