IBM and ETH Zürich Develop Analog Foundation Models to Enhance In-Memory AI Hardware Performance

Overview of Analog Foundation Models

IBM researchers, in collaboration with ETH Zürich, have introduced a new class of Analog Foundation Models (AFMs) aimed at addressing the noise issues inherent in Analog In-Memory Computing (AIMC) hardware. AIMC has the potential to significantly enhance efficiency by enabling the execution of models with a billion parameters in a compact footprint suitable for embedded or edge devices. However, noise has been a critical barrier, as matrix-vector multiplications performed directly within non-volatile memory (NVM) devices often result in non-deterministic errors that hinder the performance of existing models.

The Importance of Analog Computing for LLMs

Unlike traditional computing methods using GPUs or TPUs, AIMC performs matrix-vector multiplications directly within memory arrays, eliminating the von Neumann bottleneck and significantly improving throughput and power efficiency. Previous studies indicated that combining AIMC with 3D NVM and Mixture-of-Experts (MoE) architectures could theoretically support trillion-parameter models on compact accelerators, making large-scale AI feasible beyond data centers.

Challenges in Implementing AIMC

The primary challenge in utilizing AIMC is the presence of noise. AIMC computations are affected by device variability, DAC/ADC quantization, and runtime fluctuations, which can degrade model accuracy. Unlike quantization on GPUs, where errors are predictable, analog noise is stochastic and unpredictable. While earlier research adapted smaller networks like CNNs and RNNs (less than 100M parameters) to tolerate such noise, LLMs with billions of parameters have struggled under AIMC constraints.

Addressing Noise with Analog Foundation Models

The IBM team has developed AFMs that incorporate hardware-aware training to prepare LLMs for analog execution. Their training pipeline includes:

Noise injection during training to simulate AIMC randomness.
Iterative weight clipping to stabilize distributions within device limits.
Learned static input/output quantization ranges aligned with real hardware constraints.
Distillation from pre-trained LLMs using 20B tokens of synthetic data.

These methods, implemented with AIHWKIT-Lightning, enable models like Phi-3-mini-4k-instruct and Llama-3.2-1B-Instruct to maintain performance comparable to weight-quantized 4-bit / activation 8-bit baselines under analog noise. Evaluations across reasoning and factual benchmarks indicate that AFMs outperform both quantization-aware training (QAT) and post-training quantization (SpinQuant).

Compatibility with Digital Hardware

Interestingly, AFMs also demonstrate strong performance on low-precision digital hardware. Because AFMs are trained to withstand noise and clipping, they manage simple post-training round-to-nearest (RTN) quantization more effectively than existing methods. This adaptability makes them valuable not only for AIMC accelerators but also for standard digital inference hardware.

Scalability of Performance

Yes, performance can scale with increased compute at inference time. Researchers tested compute scaling on the MATH-500 benchmark, generating multiple answers per query and selecting the best using a reward model. AFMs exhibited better scaling behavior than QAT models, with accuracy gaps diminishing as more inference compute was allocated. This aligns with AIMC’s strengths in low-power, high-throughput inference rather than training.

Future Implications for AIMC

This research represents the first systematic demonstration that large LLMs can be adapted to AIMC hardware without significant accuracy loss. While training AFMs is resource-intensive and reasoning tasks like GSM8K still reveal accuracy gaps, the findings mark a significant milestone. The combination of energy efficiency, robustness to noise, and compatibility with digital hardware positions AFMs as a promising avenue for scaling foundation models beyond the limitations of GPU technology.

Conclusion

In summary, the introduction of Analog Foundation Models by IBM and ETH Zürich offers a groundbreaking approach to overcoming the challenges posed by noise in analog computing. By enhancing the performance of large language models on compact hardware, these innovations pave the way for more efficient AI solutions in various applications. As the technology matures, it holds the potential to transform how we approach AI, making it more accessible and effective across different platforms.

FAQ

What are Analog Foundation Models? Analog Foundation Models are a new class of AI models designed to operate efficiently in Analog In-Memory Computing environments, addressing noise issues that affect model accuracy.
How do AFMs improve model performance? AFMs utilize hardware-aware training techniques that prepare models for the unique challenges of analog execution, such as noise and variability.
What are the advantages of AIMC over traditional computing methods? AIMC eliminates the von Neumann bottleneck, improving throughput and power efficiency, making it suitable for large-scale AI applications.
Can AFMs be used with digital hardware? Yes, AFMs are compatible with low-precision digital hardware and can perform effectively even under standard digital inference conditions.
What future implications do AFMs have for AI technology? AFMs represent a significant advancement in scaling AI models, potentially leading to more energy-efficient and robust AI solutions that can operate beyond the limitations of current GPU technology.

Unleash Your Creative Potential with AI Agents

Competitors are already using AI Agents

Business Problems We Solve

Automation of internal processes.
Optimizing AI costs without huge budgets.
Training staff, developing custom courses for business needs
Integrating AI into client work, automating first lines of contact

Large and Medium Businesses

Startups

Offline Business

Get a plan to reduce routine and improve metrics

100% of clients report increased productivity and reduced operati

AI Agents

Localization Project Manager – Coordinating translation workflows, answering vendor or process-related questions.

Job Title: Localization Project Manager Overview The Localization Project Manager plays a vital role in coordinating translation workflows while addressing vendor and process-related queries. This position is crucial for ensuring that translation projects are executed efficiently…
AI Agents

Environmental Health & Safety Officer – Answering compliance-related questions, retrieving safety protocols or audit histories.

Professional Summary The AI-driven Environmental Health & Safety Officer is a reliable and effective digital team member that performs repetitive and time-consuming tasks with remarkable speed, accuracy, and stability. By automating these tasks, it frees up…
AI Agents

Legal Contract Reviewer – Auto-flagging clause inconsistencies or retrieving precedent cases for review.

Job Title: Legal Contract Reviewer – Auto-flagging Clause Inconsistencies or Retrieving Precedent Cases for Review The AI functions as a reliable and effective digital team member that excels in performing repetitive and time-consuming tasks. With remarkable…
AI Agents

Customer Retention Analyst – Creating customer summaries, identifying churn risk patterns, and suggesting retention steps.

Customer Retention Analyst Professional Summary A highly analytical and detail-oriented Customer Retention Analyst with a proven track record in creating comprehensive customer summaries, identifying churn risk patterns, and suggesting effective retention strategies. Adept at leveraging data-driven…

Itinai.com httpss.mj.runmrqch2uvtvo russian handsome charisma 9fdbb2d5 a55b 425d 8f3b 76d26f86710f 2

AI Business Accelerator

Start Your AI Business in Just a Week with itinai.com

You’re a great fit if you:

Have an audience (even 500+ followers in Instagram, email, etc.)
Have an idea, service, or product you want to scale
Can invest 2–3 hours a day
You’re motivated to earn with AI but don’t want to handle technical setup

AI news and solutions

Introducing the AWS Generative AI Innovation Center’s Custom Model Program for Anthropic Claude

The AWS Generative AI Innovation Center, launched in June 2023, has assisted numerous clients in creating custom AI solutions. Starting Q1 2024, the new Custom Model Program will enable customers to fine-tune Anthropic Claude models with…

AI Tech News
NVIDIA Unveils AI Innovations for Robotics: Cosmos Models and Omniverse Libraries

Introduction to NVIDIA’s Innovations in Physical AI NVIDIA recently made waves at SIGGRAPH 2025 with groundbreaking announcements that promise to redefine the landscape of physical AI applications. Their new suite of Cosmos world models, simulation libraries,…

AI Tech News
How I used my first #30DayChartChallenge to learn Observable Plot

The #30DayChartChallenge is a community-driven challenge that takes place each year in April. Participants create data visualizations based on daily prompts. The author participated in the challenge to learn the Observable Plot library and improve their…

AI Tech News
Researchers at Google Deepmind Introduce BOND: A Novel RLHF Method that Fine-Tunes the Policy via Online Distillation of the Best-of-N Sampling Distribution

Practical Solutions and Value of BOND: A Novel RLHF Method Enhancing Language Generation Quality Reinforcement learning from human feedback (RLHF) is crucial for ensuring quality and safety in language and learning models (LLMs). State-of-the-art LLMs like…

AI Tech News
Google AI Releases TensorFlow GNN 1.0 (TF-GNN): A Production-Tested Library for Building GNNs at Scale

Graph Neural Networks (GNNs) leverage graph structures to perform inference on complex data, addressing the limitations of traditional ML algorithms. Google’s TensorFlow GNN 1.0 (TF-GNN) library integrates with TensorFlow, enabling scalable training of GNNs on heterogeneous…

AI Tech News
Researchers from CMU and Max Planck Institute Unveil WHAM: A Groundbreaking AI Approach for Precise and Efficient 3D Human Motion Estimation from Video

Researchers from Carnegie Mellon University and Max Planck Institute have developed WHAM (World-grounded Humans with Accurate Motion), a pioneering method for precise 3D human motion reconstruction. WHAM addresses challenges such as foot sliding in real-world settings…

AI Tech News
Learning and Knowledge Retrieval: A Comprehensive Framework for In-Context Learning in Large Language Models (LLMs)

Practical Solutions and Value of In-Context Learning in Large Language Models (LLMs) Understanding In-Context Learning Generative Large Language Models (LLMs) can learn from examples given within a prompt, but the principles underlying their performance are still…

AI Tech News
Visual Intuitive Physics: Enhancing Understanding Through Visualization

Visual Intuitive Physics: Enhancing Understanding Through Visualization Often perceived as abstract and challenging, physics covers fundamental aspects of the universe, from the tiny world of quantum mechanics to the vast cosmos of general relativity. Visual Intuitive…

AI Tech News
Meet PydanticAI: A New Python-based Agent Framework to Build Production-Grade LLM-Powered Applications

Challenges of Building LLM-Powered Applications Creating applications using large language models (LLMs) can be tough. Developers often struggle with: Inconsistent responses from models. Ensuring robustness in applications. Lack of type safety in outputs. The aim is…

AI Tech News
Carbon Emissions of an ML Engineering Team

This text discusses the significance of the hidden costs of development. It emphasizes the importance of recognizing and considering these costs in order to ensure accurate decision-making and successful project outcomes.

AI Tech News
AI could consume the same energy as the Netherlands by 2027

A study predicts that the energy consumption of the AI industry could match that of the Netherlands by 2027. However, if AI growth slows, its environmental impact may be less severe. The study’s projections consider factors…

AI Tech News
TSMixer: The Latest Forecasting Model by Google

TSMixer architecture is explained and can be implemented in Python for long-term multivariate forecasting tasks.

AI Tech News
11 Essential AI Concepts for Enterprise Leaders to Drive Success

The AI Integration Gap Many enterprises invest in AI tools with great expectations, yet they often struggle to integrate these technologies into their daily operations. Research indicates that nearly half of AI projects fail to progress…

AI Tech News
Meet Audiobox: A New Meta AI’s Foundation Research Model for Audio Generation

Audiobox is a new AI model developed by Meta-researchers. It can generate voices and sound effects using voice inputs and natural language text prompts, making it easier to create custom audio for various use cases. It…

AI Tech News
AWS Researchers Propose LEDEX: A Machine Learning Training Framework that Significantly Improves the Self-Debugging Capability of LLMs

Code Generation and Debugging with AI Understanding the Challenge Code generation using Large Language Models (LLMs) is a vital area of research. However, creating accurate code for complex problems in one attempt is tough. Even experienced…

AI Tech News
Meta AI Release CyberSecEval 3: A Wide-Ranging Evaluation Framework for LLM Security Used in the Development of the Models

The Practical Solutions and Value of Meta AI’s CYBERSECEVAL 3 Addressing AI Cybersecurity Risks Meta AI introduces CYBERSECEVAL 3 to assess the cybersecurity risks, benefits, and capabilities of AI systems, focusing on large language models (LLMs)…

AI Tech News
Zyphra Unveils Zamba2-mini: A State-of-the-Art Small Language Model Redefining On-Device AI with Unmatched Efficiency and Performance

Zyphra Unveils Zamba2-mini: A State-of-the-Art Small Language Model Redefining On-Device AI with Unmatched Efficiency and Performance State-of-the-Art Performance in a Compact Package Zyphra has released Zamba2-mini 1.2B, a small language model designed for on-device applications. It…

AI Tech News
HybridRAG: A Hybrid AI System Formed by Integrating Knowledge Graphs and Vector Retrieval Augmented Generation Outperforming both Individually

Practical Solutions for Financial Data Analysis Challenges in Financial Data Analysis Financial data analysis is crucial for decision-making in the financial sector. Extracting insights from complex documents like earnings call transcripts and financial reports poses challenges…

AI Tech News
Researchers from Tsinghua University Proposes a Novel Slide Loss Function to Enhance SVM Classification for Robust Machine Learning

AI Tech News
LLaVA-NeXT-Interleave: A Versatile Large Multimodal Model LMM that can Handle Settings like Multi-image, Multi-frame, and Multi-view

Practical Solutions and Value of LLaVA-NeXT-Interleave: A Versatile Large Multimodal Model Practical Solutions and Value Recent advancements in Large Multimodal Models (LMMs) have shown significant progress in various multimodal settings, bringing us closer to achieving artificial…

AI Tech News