Beyond Monte Carlo Tree Search: Implicit Chess Strategies with Discrete Diffusion

Challenges of Large Language Models in Complex Problem-Solving

Large language models (LLMs) generate text in a step-by-step manner, which limits their ability to handle tasks that require multiple reasoning steps, such as structured writing and problem-solving. This limitation affects their coherence and decision-making in complex scenarios. While some approaches evaluate various alternatives to improve prediction accuracy, they incur higher computational costs and can lead to errors if future forecasts are incorrect.

Limitations of Current Search Algorithms

Common search algorithms like Monte Carlo Tree Search (MCTS) and beam search are popular in AI planning and decision-making but come with significant limitations. These algorithms rely on repeated simulations of future scenarios, which increases computational costs and makes them unsuitable for real-time applications. Additionally, they depend on a value model to estimate each state; if this model is incorrect, it propagates errors throughout the search process. This accumulation of errors can severely impact decision-making accuracy, particularly in complex tasks requiring long-term planning.

Introducing DIFFUSEARCH: A New Framework for Decision-Making

To address these challenges, researchers from The University of Hong Kong, Shanghai Jiaotong University, Huawei Noah’s Ark Lab, and Shanghai AI Laboratory proposed DIFFUSEARCH. This innovative framework eliminates the need for explicit search algorithms like MCTS. Instead, DIFFUSEARCH trains a policy to directly predict and utilize future representations, refining these predictions iteratively through diffusion models. By integrating the world model and policy into a single framework, DIFFUSEARCH reduces computational overhead while enhancing efficiency and accuracy in long-term planning.

Training Methodology

The DIFFUSEARCH framework employs supervised learning, using Stockfish as an oracle to label board states from chess games. It explores different future representations, ultimately selecting the action-state (s-asa) method for its simplicity and efficiency. Rather than predicting future sequences directly, the model employs discrete diffusion modeling, utilizing self-attention and iterative denoising to gradually enhance action predictions. This approach avoids the costly marginalization of future states during inference by sampling directly from the trained model. An easy-first decoding strategy prioritizes more predictable tokens for denoising, thus improving accuracy.

Performance Evaluation

Researchers evaluated DIFFUSEARCH against three transformer-based baselines: State-Action (S-A), State-Value (S-V), and Action-Value (SA-V) models. Using a dataset of 100,000 chess games, they implemented GPT-2-based models with specific configurations and conducted evaluations on action accuracy, puzzle accuracy, and Elo ratings from a 6000-game internal tournament. DIFFUSEARCH outperformed S-A by 653 Elo points and showed a 19% improvement in action accuracy while using significantly fewer data records than SA-V. The discrete diffusion with linear λt achieved the highest accuracy of 41.31%, surpassing autoregressive and Gaussian methods.

Conclusion and Future Applications

The proposed model demonstrates that implicit search through discrete diffusion can effectively replace explicit search methods and enhance decision-making in chess. Despite using an external oracle and a limited dataset, it shows promise for improvement through self-play and long-context modeling. This method can also be applied to enhance next-token prediction in language models, serving as a foundation for further exploration in AI planning and decision-making.

Explore AI Solutions for Your Business

Discover how artificial intelligence can transform your work processes:

Identify areas for automation and enhance customer interactions with AI.
Establish key performance indicators (KPIs) to ensure your AI investments yield positive business outcomes.
Select tools that align with your needs and are customizable to meet your objectives.
Start with a small AI project, gather data on its effectiveness, and gradually expand its usage.

If you need assistance in managing AI in your business, please contact us at hello@itinai.ru.

Connect with us on Telegram, Twitter, and LinkedIn.

Unleash Your Creative Potential with AI Agents

Competitors are already using AI Agents

Business Problems We Solve

Automation of internal processes.
Optimizing AI costs without huge budgets.
Training staff, developing custom courses for business needs
Integrating AI into client work, automating first lines of contact

Large and Medium Businesses

Startups

Offline Business

Get a plan to reduce routine and improve metrics

100% of clients report increased productivity and reduced operati

AI Agents

Localization Project Manager – Coordinating translation workflows, answering vendor or process-related questions.

Job Title: Localization Project Manager Overview The Localization Project Manager plays a vital role in coordinating translation workflows while addressing vendor and process-related queries. This position is crucial for ensuring that translation projects are executed efficiently…
AI Agents

Environmental Health & Safety Officer – Answering compliance-related questions, retrieving safety protocols or audit histories.

Professional Summary The AI-driven Environmental Health & Safety Officer is a reliable and effective digital team member that performs repetitive and time-consuming tasks with remarkable speed, accuracy, and stability. By automating these tasks, it frees up…
AI Agents

Legal Contract Reviewer – Auto-flagging clause inconsistencies or retrieving precedent cases for review.

Job Title: Legal Contract Reviewer – Auto-flagging Clause Inconsistencies or Retrieving Precedent Cases for Review The AI functions as a reliable and effective digital team member that excels in performing repetitive and time-consuming tasks. With remarkable…
AI Agents

Customer Retention Analyst – Creating customer summaries, identifying churn risk patterns, and suggesting retention steps.

Customer Retention Analyst Professional Summary A highly analytical and detail-oriented Customer Retention Analyst with a proven track record in creating comprehensive customer summaries, identifying churn risk patterns, and suggesting effective retention strategies. Adept at leveraging data-driven…

Itinai.com httpss.mj.runmrqch2uvtvo russian handsome charisma 9fdbb2d5 a55b 425d 8f3b 76d26f86710f 2

AI Business Accelerator

Start Your AI Business in Just a Week with itinai.com

You’re a great fit if you:

Have an audience (even 500+ followers in Instagram, email, etc.)
Have an idea, service, or product you want to scale
Can invest 2–3 hours a day
You’re motivated to earn with AI but don’t want to handle technical setup

AI news and solutions

Unmasking the Web’s Tower of Babel: How Machine Translation Floods Low-Resource Languages with Low-Quality Content

This research paper investigates the prevalence and impact of low-cost machine translation (MT) on the web and large multi-lingual language models (LLMs). It highlights the abundance of MT on the web, the use of multi-way parallelism,…

AI Tech News
Ten Effective Strategies to Lower Large Language Model (LLM) Inference Costs

Practical Solutions to Reduce Large Language Model (LLM) Inference Costs Quantization Decrease precision of model weights and activations to save memory and computational resources. Pruning Remove insignificant weights to reduce neural network size without performance loss.…

AI Tech News
How to Use ChatGPT: A Step-by-Step Guide

AI, particularly ChatGPT by OpenAI, is revolutionizing human-machine interaction. To access ChatGPT, create an account, understand the interface, craft clear prompts, interact with responses, refine queries, explore advanced features, remain aware of limitations, and consider ethical…

AI Tech News
Nexa AI Releases OmniVision-968M: World’s Smallest Vision Language Model with 9x Tokens Reduction for Edge Devices

Edge AI Efficiency and Effectiveness Edge AI aims to be both efficient and effective, but deploying Vision Language Models (VLMs) on edge devices can be challenging. These models are often too large and require too much…

AI Tech News
Hollywood’s strikes near a resolution, but what lies ahead for creatives?

The Writer’s Guild of America (WGA) has reached a draft agreement with the Alliance of Motion Picture and Television Producers (AMPTP), marking the first official industry protections against AI. The agreement includes financial benefits for writers,…

AI Tech News
Wide-eyed Putin confronted with an AI deep fake of himself in live Q&A

Russian President Putin faced an AI-generated deep fake version of himself during a public Q&A. The incident sparked amusement as the AI posed a question on twins and the dangers of AI. Deep fake technology targets…

AI Tech News
Kinetix: An Open-Ended Universe of Physics-based Tasks for Reinforcement Learning

Understanding Kinetix: A New Approach to Reinforcement Learning Self-Supervised Learning Breakthroughs Self-supervised learning has enabled large models to excel in text and image tasks. However, applying similar techniques to agents in decision-making scenarios remains challenging. Traditional…

AI Tech News
From Deep Knowledge Tracing to DKT2: A Leap Forward in Educational AI

Understanding Knowledge Tracing (KT) in Education Knowledge Tracing (KT) is essential in Intelligent Tutoring Systems (ITS). It helps track what students know and predict how they will perform in the future. Traditional models like Bayesian Knowledge…

AI Tech News
Build a Multi-Tool AI Agent with Nebius and Llama 3 for Developers and Researchers

Building a Powerful Multi-Tool AI Agent with Nebius This tutorial explores the creation of an advanced AI agent using Nebius, specifically leveraging components like ChatNebius, NebiusEmbeddings, and NebiusRetriever. By utilizing the Llama-3.3-70B-Instruct-fast model, this agent aims…

AI Tech News
LUMOS: An Open-Source Generalizable Language Agent Training Framework

AI Tech News
Cartesia AI Released Rene: A Groundbreaking 1.3B Parameter Open-Source Small Language Model Transforming Natural Language Processing Applications

Practical Solutions and Value of Cartesia AI’s Rene Language Model Architecture and Training Cartesia AI’s Rene language model is built on a hybrid architecture, combining feedforward and sliding window attention layers to effectively manage long-range dependencies…

AI Tech News
Plant-based materials give ‘life’ to tiny soft robots

Researchers have developed advanced materials for soft medical microrobots, paving the way for minimally invasive medical procedures like biopsies and cell and tissue transport. These robots hold promise for the future of healthcare.

AI Tech News
UK government releases schedule for the AI Safety Summit

The UK’s AI Safety Summit, taking place on November 1-2, 2023, has published the program for day one. The event aims to influence the development of safe AI and will include representatives from international governments, major…

AI Tech News
Top 20 Code Review Tools for Software Developers

Practical Solutions and Value of Top 20 Code Review Tools for Software Developers Introduction In the fast-paced world of software development, maintaining high code quality is crucial for success. Code reviews play a vital role in…

AI Tech News
Future-Proofing Our Interns: Cultivating the Next Generation Amidst AI’s Corporate March

The text discusses the intersection of AI and sustainability, emphasizing the need to demystify technology and understand its true capabilities. It highlights the role of AI as a powerful ally to human capability but also warns…

AI Tech News
Amazon Translate vs Google Translate: Which Cloud Giant Handles Scale and Speed Better?

Amazon Translate vs. Google Translate: A Business Comparison This comparison aims to evaluate Amazon Translate and Google Translate as potential solutions for businesses needing machine translation services. Both are powerful tools, but cater to slightly different…

Compare
This AI Paper Introduces Perseus: A Trailblazing Framework for Slashing Energy Bloat in Large-Scale Machine Learning and AI Model Training by Up to 30%

Large language models like GPT-3 require substantial energy for training and operational needs, with varying consumption based on factors such as size and task complexity. Researchers at the University of Michigan and the University of Washington…

AI Tech News
Microsoft AI Researchers Release LLaVA-Rad: A Lightweight Open-Source Foundation Model for Advanced Clinical Radiology Report Generation

Introduction to LLaVA-Rad Large foundation models have shown great promise in the biomedical field, especially in tasks requiring minimal labeled data. However, using these advanced models in clinical settings faces challenges such as performance gaps and…

AI Tech News
IBM Researchers Introduce AI-Hilbert: An Innovative Machine Learning Framework for Scientific Discovery Integrating Algebraic Geometry and Mixed-Integer Optimization

Practical Solutions for Scientific Discovery Integrating Background Knowledge with Experimental Data Recent advances in global optimization methods offer promising tools for scientific discovery by integrating background knowledge with experimental data. Derive Well-Known Laws with Guaranteed Results…

AI Tech News
Mora: A New Multi-Agent Framework that Incorporates Several Advanced Visual AI Agents to Replicate Generalist Video Generation Demonstrated by Sora

AI Tech News