Convergence Releases Proxy Lite: A Mini, Open-Weights Version of Proxy Assistant Performing Pretty Well on UI Navigation Tasks

Challenges in Web Interaction Automation

Automating interactions with web content is a complex task in today’s digital environment. Many solutions are resource-heavy and designed for specific tasks, limiting their effectiveness across various applications. Developers struggle to find a balance between computational efficiency and the model’s ability to generalize across different websites, as traditional systems often lack the reflective reasoning necessary for unpredictable web scenarios. Proprietary models can further complicate matters by restricting access to their inner workings, hindering innovation in the open-source community. This highlights the need for an efficient and accessible automation tool.

Introducing Proxy Lite

Convergence has addressed this need with Proxy Lite, a streamlined, open-weights version of their acclaimed Proxy assistant. This 3B parameter Vision-Language Model provides robust web automation capabilities to the open-source community. Proxy Lite focuses on efficiency and reliability, allowing it to handle various web tasks without extensive computational requirements.

Key Features of Proxy Lite

Proxy Lite stands out due to its transparent design and open-weights model, which encourages community exploration and enhancements. It integrates a Vision-Language Model with browser interactions, allowing detailed control over browser tasks, from data extraction to complex navigation, while efficiently managing resources.

Technical Advantages

Proxy Lite is built on the Qwen2.5-VL-3B-Instruct foundation, balancing performance and efficiency through a structured three-phase response process:

  • Observation: The model assesses the current web page state, confirming actions like dismissing overlays.
  • Thinking: It evaluates potential next actions based on context.
  • Tool Call: It issues precise commands to execute the chosen action.

This approach enhances reliability and allows for generalization across various web interactions. Its design supports easy integration into command-line interfaces and Streamlit applications, making it accessible even for users with limited technical skills.

Performance Evaluation

Proxy Lite has undergone thorough evaluation using the WebVoyager benchmark, achieving a commendable overall score of 72.4%. Specific performance statistics demonstrate its capabilities:

  • Allrecipes: 87.8% success rate with an average of 10.3 message exchanges, effective in content-rich environments.
  • Amazon: 70.0% success rate, showcasing its ability to navigate dynamic e-commerce platforms.
  • High-Profile Sites: Low 80s success rates on platforms like Apple and GitHub indicate consistent reliability.
  • Google Services: Despite lower success in areas like Google Flights, the overall performance remains competitive.

These results suggest that Proxy Lite efficiently manages tasks while avoiding the burdens typical of larger proprietary models, highlighting its current utility and potential for community-driven improvements.

Conclusion

Proxy Lite represents a well-designed solution for web automation challenges. By tackling issues like resource constraints, generalization, and transparency, it offers a practical tool for automating online tasks. Its open-weights design invites collaboration and ongoing innovation, serving as a valuable resource for both academic and commercial applications.

Explore the Technical Details and Model. All credit for this research goes to the project’s researchers. Follow us on Twitter and join our 80k+ ML SubReddit.

Leverage AI for Business Transformation

Discover how AI can streamline your operations:

  • Identify automation opportunities in customer interactions to maximize AI value.
  • Track key performance indicators (KPIs) to ensure your AI initiatives are effective.
  • Select customizable tools that fit your business objectives.
  • Start small, evaluate effectiveness, and expand your AI applications gradually.

For guidance on managing AI in business, contact us at hello@itinai.ru or reach us on Telegram, X, and LinkedIn.


AI Products for Business or Try Custom Development

AI Sales Bot

Welcome AI Sales Bot, your 24/7 teammate! Engaging customers in natural language across all channels and learning from your materials, it’s a step towards efficient, enriched customer interactions and sales

AI Document Assistant

Unlock insights and drive decisions with our AI Insights Suite. Indexing your documents and data, it provides smart, AI-driven decision support, enhancing your productivity and decision-making.

AI Customer Support

Upgrade your support with our AI Assistant, reducing response times and personalizing interactions by analyzing documents and past engagements. Boost your team and customer satisfaction

AI Scrum Bot

Enhance agile management with our AI Scrum Bot, it helps to organize retrospectives. It answers queries and boosts collaboration and efficiency in your scrum processes.