Introduction to Tongyi DeepResearch
Alibaba has made a significant leap in the field of artificial intelligence with the release of Tongyi DeepResearch-30B-A3B, a large language model (LLM) designed specifically for deep research tasks. This model is not just another AI; it’s built to handle complex, long-horizon research workflows that require extensive information gathering and synthesis.
Understanding the Model’s Architecture
Tongyi DeepResearch employs a mixture-of-experts (MoE) architecture, boasting around 30.5 billion parameters, with approximately 3 to 3.3 billion active parameters per token. This design allows the model to maintain high throughput while delivering strong reasoning capabilities. The model is optimized for multi-turn research workflows, which include searching, browsing, extracting, cross-checking, and synthesizing evidence.
Performance Benchmarks
The performance of Tongyi DeepResearch is impressive, as evidenced by its state-of-the-art results on various agentic search suites:
- Humanity’s Last Exam (HLE): 32.9
- BrowseComp: 43.4 (English) and 46.7 (Chinese)
- xbench-DeepSearch: 75
These benchmarks indicate that Tongyi DeepResearch is competitive with leading models like those from OpenAI, outperforming many existing proprietary and open-source agents.
Training and Inference Capabilities
The training pipeline for Tongyi DeepResearch is noteworthy. It utilizes a fully automated data engine that incorporates:
- Agentic Continual Pre-Training (CPT): This involves large-scale synthetic trajectories derived from curated corpora and historical tool traces.
- On-Policy Reinforcement Learning (RL): The model employs Group Relative Policy Optimization (GRPO) to stabilize learning in dynamic web environments.
These training methods ensure that the model is not just reactive but can plan and execute complex research tasks effectively.
Key Features of Tongyi DeepResearch
Some standout features of this model include:
- MoE Efficiency: The model’s architecture allows for the inference cost of a smaller model while retaining the capabilities of a larger one.
- 128K Context Window: This feature supports long-horizon rollouts, making it ideal for extensive web research.
- Dual Inference Modes: The model can operate in both native ReAct mode for direct reasoning and in IterResearch mode for deeper synthesis.
Applications in Research Workflows
Tongyi DeepResearch is particularly suited for tasks that require:
- Long-horizon planning
- Iterative retrieval and verification across multiple sources
- Evidence tracking with minimal hallucination rates
- Synthesis of information under large contexts
The model’s ability to restructure context during each round of inquiry helps mitigate errors and enhances the reliability of the information gathered.
Conclusion
Tongyi DeepResearch-30B-A3B represents a significant advancement in the development of AI for deep research tasks. With its innovative architecture, robust training methods, and impressive performance metrics, it offers a practical solution for teams looking to enhance their research capabilities. This model not only balances inference cost and capability but also sets a new standard for precision and reliability in AI-driven research.
Frequently Asked Questions (FAQ)
1. What is Tongyi DeepResearch?
Tongyi DeepResearch is an open-source large language model developed by Alibaba, designed for deep research tasks that require extensive information gathering and synthesis.
2. How does the MoE architecture benefit the model?
The mixture-of-experts architecture allows Tongyi DeepResearch to maintain high performance while keeping inference costs low, making it efficient for large-scale applications.
3. What are the key performance metrics of Tongyi DeepResearch?
The model has achieved state-of-the-art results on various benchmarks, including scores of 32.9 on Humanity’s Last Exam and 75 on xbench-DeepSearch.
4. How is the model trained?
Tongyi DeepResearch is trained using a combination of synthetic data, continual pre-training, and reinforcement learning techniques to ensure robust performance in dynamic environments.
5. What are the practical applications of Tongyi DeepResearch?
The model is ideal for tasks that involve long-horizon planning, iterative retrieval, and synthesis of information from multiple sources, making it valuable in academic and professional research settings.


























