2026-05-10 Обзор ИИ новостей: NVIDIA AI Releases Star Elastic: One Checkpoint that Contains 30B, 23B, and 12B Reasoning Models with Zero-Shot Slicing

NVIDIA AI Releases Star Elastic: One Checkpoint that Contains 30B, 23B, and 12B Reasoning Models with Zero-Shot Slicing

NVIDIA researchers have introduced Star Elastic, a post-training method that embeds multiple nested reasoning models—at 30B, 23B, and 12B parameter scales—inside a single checkpoint using a single training run. Applied to Nemotron Nano v3 (a hybrid Mamba–Transformer–MoE model with 30B total parameters and 3.6B active parameters), Star Elastic produces 23B (2.8B active) and 12B (2.0B active) nested variants trained with approximately 160B tokens. All three variants coexist in one checkpoint and can be extracted without any additional fine-tuning, eliminating the need to train or store separate model variants.

The method uses importance estimation to score model components (embedding channels, attention heads, Mamba SSM heads, MoE experts, and FFN channels) by their contribution to accuracy, then ranks and sorts them so smaller-budget submodels use the highest-ranked contiguous subset of components from the larger model—a property called nested weight-sharing. Star Elastic employs an end-to-end trainable router that takes a target budget as a one-hot input and outputs differentiable masks selecting active components, trained jointly with the model via Gumbel-Softmax to allow gradient flow through discrete architectural decisions. The loss combines knowledge distillation (with the non-elastified parent as teacher) and a router loss penalizing deviation from the target resource budget.

Star Elastic enables elastic budget control by using different nested submodels for different reasoning phases: the optimal configuration (ℳS → ℳL) uses a cheaper model for extended reasoning traces and reserves the full-capacity model for synthesizing the final answer. The 23B → 30B configuration advances the accuracy–latency Pareto frontier, achieving up to 16% higher accuracy and 1.9× lower latency compared to default Nemotron Nano v3 budget control. Quantization-Aware Distillation (QAD) applied directly to the elastic checkpoint preserves the nested mask hierarchy, allowing zero-shot slicing of quantized variants; for NVFP4, a short QAD phase brings 30B variant recovery to 97.79% of BF16 accuracy. Storage efficiency is significant: storing separate 12B, 23B, and 30B BF16 checkpoints requires 126.1 GB, while the single elastic checkpoint requires 58.9 GB, and the 30B NVFP4 elastic checkpoint fits in 18.7 GB, enabling the 12B NVFP4 variant to run on an RTX 5080 where every BF16 configuration runs out of memory.

Research paper: Star Elastic: One Checkpoint that Contains Multiple Reasoning Models (PDF)

Unleash Your Creative Potential with AI Agents

Competitors are already using AI Agents

Business Problems We Solve

Automation of internal processes.
Optimizing AI costs without huge budgets.
Training staff, developing custom courses for business needs
Integrating AI into client work, automating first lines of contact

Large and Medium Businesses

Startups

Offline Business

Get a plan to reduce routine and improve metrics

100% of clients report increased productivity and reduced operati

AI Agents

Localization Project Manager – Coordinating translation workflows, answering vendor or process-related questions.

Job Title: Localization Project Manager Overview The Localization Project Manager plays a vital role in coordinating translation workflows while addressing vendor and process-related queries. This position is crucial for ensuring that translation projects are executed efficiently…
AI Agents

Environmental Health & Safety Officer – Answering compliance-related questions, retrieving safety protocols or audit histories.

Professional Summary The AI-driven Environmental Health & Safety Officer is a reliable and effective digital team member that performs repetitive and time-consuming tasks with remarkable speed, accuracy, and stability. By automating these tasks, it frees up…
AI Agents

Legal Contract Reviewer – Auto-flagging clause inconsistencies or retrieving precedent cases for review.

Job Title: Legal Contract Reviewer – Auto-flagging Clause Inconsistencies or Retrieving Precedent Cases for Review The AI functions as a reliable and effective digital team member that excels in performing repetitive and time-consuming tasks. With remarkable…
AI Agents

Customer Retention Analyst – Creating customer summaries, identifying churn risk patterns, and suggesting retention steps.

Customer Retention Analyst Professional Summary A highly analytical and detail-oriented Customer Retention Analyst with a proven track record in creating comprehensive customer summaries, identifying churn risk patterns, and suggesting effective retention strategies. Adept at leveraging data-driven…

Itinai.com httpss.mj.runmrqch2uvtvo russian handsome charisma 9fdbb2d5 a55b 425d 8f3b 76d26f86710f 2

AI Business Accelerator

Start Your AI Business in Just a Week with itinai.com

You’re a great fit if you:

Have an audience (even 500+ followers in Instagram, email, etc.)
Have an idea, service, or product you want to scale
Can invest 2–3 hours a day
You’re motivated to earn with AI but don’t want to handle technical setup

AI news and solutions

MiniCTX: Advancing Context-Dependent Theorem Proving in Large Language Models

Understanding Formal Theorem Proving and Its Importance Formal theorem proving is essential for evaluating the reasoning skills of large language models (LLMs). It plays a crucial role in automating mathematical tasks. While LLMs can assist mathematicians…

AI Tech News
Researchers jailbreak GPT-4 using low-resource languages

The latest research from Brown University reveals that using low-resource languages (LRL) like Zulu or Scots Gaelic can cause GPT-4, an AI model, to produce unsafe responses, despite its alignment guardrails. When prompted in these languages,…

AI Tech News
This AI Research from China Provides Empirical Evidence on the Relationship between Compression and Intelligence

AI Tech News
Visualizing AI and Tech Hype Using Google Trends & ChatGPT

The text provides a tutorial on creating slopegraph visualizations to analyze technological trend shifts, focusing on the resurgence of interest in virtual reality and generative AI. It introduces Google Trends for market research and content planning…

AI Tech News
Hands on Sampling Techniques and comparison, in Python

The tutorial discusses efficient dataset sampling techniques in Python. It compares three methods: uniform, random, and Latin Hypercube Sampling (LHS). Uniform sampling is simple but scales poorly with dimensions. Random sampling is straightforward, better for large…

AI Tech News
Why does AI being good at math matter?

Google DeepMind recently created AlphaGeometry, an AI system combining a language model and a symbolic engine to solve complex geometry problems, demonstrating progress in AI reasoning skills. However, human understanding of technology is crucial to harness…

AI Tech News
Top AI Coding Agents in 2025

Transforming Software Development with AI Coding Agents in 2025 AI-powered coding agents are revolutionizing software development, enhancing productivity and simplifying workflows. Here are some of the top AI coding agents available: Devin AI Efficient Project Management:…

AI Tech News
This Machine Learning Unveils How Large Language Models LLMs Operate as Markov Chains to Unlock Their Hidden Potential

Understanding Large Language Models (LLMs) Large Language Models (LLMs) excel in tasks like machine translation and question-answering. However, we still need a better understanding of how they work and generate relevant text. A major challenge is…

AI Tech News
Dear Taylor Swift, we’re sorry about those explicit deepfakes

The text is an urgent message to Taylor, encouraging her to take action against nonconsensual deepfake porn. It describes the disturbing rise of deepfake technology, its impact on women and marginalized groups, and the lack of…

AI Tech News
RARE: A Scalable AI Framework for Enhanced Domain-Specific Reasoning

RARE: Enhancing Domain-Specific Reasoning in AI RARE: A Scalable AI Framework for Domain-Specific Reasoning Introduction Recent advancements in Large Language Models (LLMs) have shown impressive capabilities across various tasks, including mathematical reasoning and automation. However, these…

AI Tech News
PRISE: A Unique Machine Learning Method for Learning Multitask Temporal Action Abstractions Using Natural Language Processing (NLP)

Practical Solutions and Value Learning Multitask Temporal Action Abstractions Using Natural Language Processing (NLP) In the domain of sequential decision-making, agents face challenges with continuous action spaces and high-dimensional observations. This hinders efficient decision-making and processing…

AI Tech News
US Chief Justice cautiously optimistic about AI use in law

US Chief Justice John Roberts expressed cautious optimism in his year-end report about AI’s increasing role in the legal system. He highlighted the benefits of previous technological advancements and the potential for AI to democratize access…

AI Tech News
Google DeepMind Introduces DeepMind Control Vision Benchmark (DMC-VB): A Dataset and Benchmark to Evaluate the Robustness of Offline Reinforcement Learning Agents to Visual Distractors

Understanding Reinforcement Learning and Its Challenges Reinforcement Learning (RL) helps models learn how to make decisions and control actions to maximize rewards in different environments. Traditional online RL methods learn slowly by taking actions, observing outcomes,…

AI Tech News
Relaxed Recursive Transformers with Layer-wise Low-Rank Adaptation: Achieving High Performance and Reduced Computational Cost in Large Language Models

Understanding Relaxed Recursive Transformers Large language models (LLMs) are powerful tools that rely on complex deep learning structures, primarily using Transformer architectures. These models are used in various industries for tasks that require a deep understanding…

AI Tech News
Can Google’s Gemini Rival OpenAI’s GPT-4V in Visual Understanding?: This Paper Explores the Battle of Titans in Multi-modal AI

The development of Multi-modal Large Language Models (MLLMs) such as Google’s Gemini presents a significant shift in AI, combining textual data with visual understanding. A study evaluates Gemini’s capabilities compared to leader GPT-4V and Sphinx, highlighting…

AI Tech News
Enhancing Machine Learning ML Education Through No-Code AI: Integrating Lightweight AI Tools in Non-Technical Higher Education Programs

Integrating No-Code AI in Non-Technical Higher Education Practical Solutions and Value Recent developments in ML underscore its ability to drive value across diverse sectors. To make ML more accessible to non-STEM students, a case-based approach utilizing…

AI Tech News
Is Scaling the Only Path to AI Supremacy? This AI Paper Unveils ‘Phantom of Latent for Large Language and Vision Models

Practical Solutions for Efficient Large Language and Vision Models Challenge: Large language and vision models (LLVMs) face a critical challenge in balancing performance improvements with computational efficiency. Solutions: – **Phantom Dimension:** Temporarily increases latent hidden dimension…

AI Tech News
Mistral Code: The Ultimate AI Coding Assistant for Enterprise Development

Introduction to Mistral Code Mistral AI has recently launched Mistral Code, an innovative AI coding assistant tailored for enterprise software development. This tool is designed to meet the specific demands of professional environments, focusing on control,…

AI Tech News
DeepSeek AI Introduces NSA: A Hardware-Aligned and Natively Trainable Sparse Attention Mechanism for Ultra-Fast Long-Context Training and Inference

Understanding the Challenges of Long Contexts in Language Models Language models are increasingly required to manage long contexts, but traditional attention mechanisms face significant issues. The complexity of full attention makes it hard to process long…

AI Tech News
The State of Sustainability in Agile – Reflections on SoSA 2023

The SoSA 2023 conference brought together the Agile community to address sustainability in social, environmental, and economic areas, setting a direction for global responsibility. This update was originally published on Agile Alliance. (51 words)

Scrum Agile News

2026-05-10 Обзор ИИ новостей: NVIDIA AI Releases Star Elastic: One Checkpoint that Contains 30B, 23B, and 12B Reasoning Models with Zero-Shot Slicing

NVIDIA AI Releases Star Elastic: One Checkpoint that Contains 30B, 23B, and 12B Reasoning Models with Zero-Shot Slicing

Unleash Your Creative Potential with AI Agents

Competitors are already using AI Agents

Business Problems We Solve

Localization Project Manager – Coordinating translation workflows, answering vendor or process-related questions.

Environmental Health & Safety Officer – Answering compliance-related questions, retrieving safety protocols or audit histories.

Legal Contract Reviewer – Auto-flagging clause inconsistencies or retrieving precedent cases for review.

Customer Retention Analyst – Creating customer summaries, identifying churn risk patterns, and suggesting retention steps.

Start Your AI Business in Just a Week with itinai.com

You’re a great fit if you:

AI news and solutions

MiniCTX: Advancing Context-Dependent Theorem Proving in Large Language Models

Researchers jailbreak GPT-4 using low-resource languages

This AI Research from China Provides Empirical Evidence on the Relationship between Compression and Intelligence

Visualizing AI and Tech Hype Using Google Trends & ChatGPT

Hands on Sampling Techniques and comparison, in Python

Why does AI being good at math matter?

Top AI Coding Agents in 2025

This Machine Learning Unveils How Large Language Models LLMs Operate as Markov Chains to Unlock Their Hidden Potential

Dear Taylor Swift, we’re sorry about those explicit deepfakes

RARE: A Scalable AI Framework for Enhanced Domain-Specific Reasoning

PRISE: A Unique Machine Learning Method for Learning Multitask Temporal Action Abstractions Using Natural Language Processing (NLP)

US Chief Justice cautiously optimistic about AI use in law

Google DeepMind Introduces DeepMind Control Vision Benchmark (DMC-VB): A Dataset and Benchmark to Evaluate the Robustness of Offline Reinforcement Learning Agents to Visual Distractors

Relaxed Recursive Transformers with Layer-wise Low-Rank Adaptation: Achieving High Performance and Reduced Computational Cost in Large Language Models

Can Google’s Gemini Rival OpenAI’s GPT-4V in Visual Understanding?: This Paper Explores the Battle of Titans in Multi-modal AI

Enhancing Machine Learning ML Education Through No-Code AI: Integrating Lightweight AI Tools in Non-Technical Higher Education Programs

Is Scaling the Only Path to AI Supremacy? This AI Paper Unveils ‘Phantom of Latent for Large Language and Vision Models

Mistral Code: The Ultimate AI Coding Assistant for Enterprise Development

DeepSeek AI Introduces NSA: A Hardware-Aligned and Natively Trainable Sparse Attention Mechanism for Ultra-Fast Long-Context Training and Inference

The State of Sustainability in Agile – Reflections on SoSA 2023

Cookie Policy

Editor-in-chief page

Editorial Policy

Sitemap, API and other feed

About us

Copyright