Building AI Agents: Why Software Engineering Matters More Than AI

Building AI Agents: 5% AI and 100% Software Engineering

The development of AI agents is more about software engineering than the AI models themselves. Key elements such as data management, controls, and observability play a crucial role in ensuring success. This article delves into the essential components of a doc-to-chat pipeline and how to effectively integrate AI agents into existing software stacks.

Understanding the Doc-to-Chat Pipeline

A doc-to-chat pipeline processes enterprise documents by ingesting, standardizing, enforcing governance, indexing embeddings, and serving retrieval and generation through authenticated APIs. This architecture is vital for applications like agentic Q&A, copilots, and workflow automation, ensuring that responses comply with permissions and are audit-ready.

Integration with Existing Stacks

To seamlessly integrate AI agents, it’s important to utilize standard service boundaries such as REST/JSON or gRPC over a trusted storage layer. For managing tables, Iceberg offers ACID compliance, schema evolution, and snapshots, which are essential for reproducible retrieval. When dealing with vector data, pgvector is a great option for embedding management, and dedicated engines like Milvus can handle high-query-per-second (QPS) approximate nearest neighbor (ANN) searches.

Key Properties of Data Management

Iceberg Tables: Provide ACID compliance, hidden partitioning, and snapshot isolation.
pgvector: Combines SQL and vector similarity in a single query plan.
Milvus: Features a scalable architecture for large-scale similarity searches.

Coordinating Agents, Humans, and Workflows

Effective production agents require clear coordination points for human intervention. Tools like AWS A2I offer managed human-in-the-loop (HITL) processes, ensuring that low-confidence outputs are reviewed. Frameworks such as LangGraph can model these checkpoints within agent workflows, making approvals a key part of the process.

Ensuring Reliability Before Model Deployment

Reliability in AI systems should be approached as a layered defense strategy:

Language and Content Guardrails: Pre-validate inputs and outputs for safety.
PII Detection and Redaction: Utilize tools like Microsoft Presidio to identify and mask personally identifiable information.
Access Control and Lineage: Implement row- and column-level access controls to maintain compliance.
Retrieval Quality Gates: Assess retrieval-augmented generation (RAG) using metrics like faithfulness and context precision.

Scaling Indexing and Retrieval

To effectively manage real traffic, focus on two main aspects: ingest throughput and query concurrency. Normalize data at the lakehouse edge and write to Iceberg for versioned snapshots. For vector serving, leverage Milvus’s architecture to support horizontal scaling and independent failure domains.

Monitoring Beyond Logs

Effective monitoring requires a mix of traces, metrics, and evaluations:

Distributed Tracing: Use OpenTelemetry for comprehensive visibility.
LLM Observability Platforms: Compare options like LangSmith and Arize Phoenix.
Continuous Evaluation: Regularly evaluate canary sets to track performance over time.

Conclusion: The Importance of Software Engineering in AI

The notion that building AI agents is 5% AI and 100% software engineering underscores the reality that most failures in agent systems arise from issues related to data quality, permissioning, and retrieval decay rather than model performance. By prioritizing strong data management and observability practices, organizations can ensure their AI systems are both reliable and effective.

FAQs

What is a doc-to-chat pipeline? A doc-to-chat pipeline processes documents for applications like Q&A and workflow automation, ensuring compliance and audit readiness.
Why is software engineering more important than AI models in building agents? Most failures stem from data management issues rather than the AI models themselves, making software engineering critical.
How can I ensure data quality in AI systems? Implementing strict data management practices and monitoring can help maintain data quality.
What tools can assist in human-in-the-loop processes? Tools like AWS A2I can help manage HITL processes effectively.
How do I scale indexing and retrieval for AI systems? Focus on optimizing ingest throughput and query concurrency, and consider using architectures like Milvus for vector serving.

Unleash Your Creative Potential with AI Agents

Competitors are already using AI Agents

Business Problems We Solve

Automation of internal processes.
Optimizing AI costs without huge budgets.
Training staff, developing custom courses for business needs
Integrating AI into client work, automating first lines of contact

Large and Medium Businesses

Startups

Offline Business

Get a plan to reduce routine and improve metrics

100% of clients report increased productivity and reduced operati

AI Agents

Localization Project Manager – Coordinating translation workflows, answering vendor or process-related questions.

Job Title: Localization Project Manager Overview The Localization Project Manager plays a vital role in coordinating translation workflows while addressing vendor and process-related queries. This position is crucial for ensuring that translation projects are executed efficiently…
AI Agents

Environmental Health & Safety Officer – Answering compliance-related questions, retrieving safety protocols or audit histories.

Professional Summary The AI-driven Environmental Health & Safety Officer is a reliable and effective digital team member that performs repetitive and time-consuming tasks with remarkable speed, accuracy, and stability. By automating these tasks, it frees up…
AI Agents

Legal Contract Reviewer – Auto-flagging clause inconsistencies or retrieving precedent cases for review.

Job Title: Legal Contract Reviewer – Auto-flagging Clause Inconsistencies or Retrieving Precedent Cases for Review The AI functions as a reliable and effective digital team member that excels in performing repetitive and time-consuming tasks. With remarkable…
AI Agents

Customer Retention Analyst – Creating customer summaries, identifying churn risk patterns, and suggesting retention steps.

Customer Retention Analyst Professional Summary A highly analytical and detail-oriented Customer Retention Analyst with a proven track record in creating comprehensive customer summaries, identifying churn risk patterns, and suggesting effective retention strategies. Adept at leveraging data-driven…

Itinai.com httpss.mj.runmrqch2uvtvo russian handsome charisma 9fdbb2d5 a55b 425d 8f3b 76d26f86710f 2

AI Business Accelerator

Start Your AI Business in Just a Week with itinai.com

You’re a great fit if you:

Have an audience (even 500+ followers in Instagram, email, etc.)
Have an idea, service, or product you want to scale
Can invest 2–3 hours a day
You’re motivated to earn with AI but don’t want to handle technical setup

AI news and solutions

How we think about Data Pipelines is changing

Data pipelines, traditionally run on open-source platforms like Airflow or Prefect, are undergoing a shift in mindset. Rather than simply moving data to serve the business, there is now a focus on reliability, efficiency, and a…

AI Tech News
Getting Started with Kaggle Kernels for Machine Learning

Kaggle Kernels: A Cloud-Based Solution for Data Science Kaggle Kernels, also known as Notebooks, offer a powerful cloud platform for data science and machine learning. This platform allows users to write, run, and visualize code directly…

AI Tech News
Realistic talking faces created from only an audio clip and a person’s photo

Researchers have created a program called DIRFA that generates realistic videos by combining audio and a face photo. The program uses artificial intelligence to create 3D videos that accurately show the person’s facial expressions and head…

AI Tech News
PyTorch Introduces ExecuTorch Alpha: An End-to-End Solution Focused on Deploying Large Language Models and Large Machine Learning ML Models to the Edge

PyTorch Introduces ExecuTorch Alpha: An End-to-End Solution Focused on Deploying Large Language Models and Large Machine Learning ML Models to the Edge Practical AI Solutions for Edge Devices PyTorch recently launched ExecuTorch Alpha to enable the…

AI Tech News
This AI Paper Introduces JudgeLM: A Novel Approach for Scalable Evaluation of Large Language Models in Open-Ended Scenarios

The researchers propose JudgeLM, a scalable language model judge designed to evaluate large language models (LLMs) in open-ended scenarios. They introduce a high-quality dataset for judge models, examine biases in LLM judge fine-tuning, and provide solutions.…

AI Tech News
OpenAI Released GPT-4o for Enhanced Interactivity and Many Free Tools for ChatGPT Free Users

The Advancements of GPT-4o in AI Technology Enhancing Interactivity and Accessibility The latest innovations in AI aim to harmonize text, audio, and visual data within a single framework, reducing response times and improving communication experiences. Traditional…

AI Tech News
UC Berkeley Researchers Introduce ThoughtSculpt: Enhancing Large Language Model Reasoning with Innovative Monte Carlo Tree Search and Revision Techniques

AI Tech News
Conversational AI revolutionizes the customer experience landscape

Summary: AI is revolutionizing customer experiences, particularly with generative AI and large language models, leading to more seamless interactions. Elizabeth Tobey from NICE highlights the role of AI in understanding sentiment, creating personalized answers, and breaking…

AI Tech News
Does the Turing test no longer work?

A new study proposes a three-step system to evaluate artificial intelligence’s ability to reason like a human, acknowledging the limitations of the Turing test due to AI’s capacity to imitate human responses.

AI Tech News
US concerns over the UAE’s AI industry and ties to China mount up

The UAE’s AI industry, led by G42, is causing US concerns due to its ties with China. The Middle East is aiming to become a competitive AI hub, with the US restricting AI hardware trade with…

AI Tech News
“Streamline AI Development with Moonshot AI’s Kosong LLM Abstraction Layer”

Understanding the Target Audience The launch of Moonshot AI’s Kosong specifically targets software developers, data scientists, and AI engineers. These professionals are deeply involved in creating modern agent applications and are already familiar with machine learning…

AI Tech News
Round up of day two of the UK’s AI Safety Summit

On day two of the AI Safety Summit, UK Prime Minister Rishi Sunak announced that industry leaders such as Meta, Google Deep Mind, and OpenAI have agreed to allow government evaluation of their AI tools before…

AI Tech News
Agile Alliance’s 2023 year-in-review

In 2023, Agile Alliance had an exciting and eventful year. For a recap of the highlights, check out the year-in-review post on Agile Alliance’s website.

Scrum Agile News
Unlocking Feature Interactions in Machine Learning with SHAP-IQ: A Step-by-Step Guide for Data Scientists

Understanding the Target Audience The audience for this tutorial primarily consists of data scientists, machine learning practitioners, and business analysts. These individuals work in various sectors, including finance, healthcare, logistics, and technology, where predictive modeling is…

AI Tech News
ADOPT: A Universal Adaptive Gradient Method for Reliable Convergence without Hyperparameter Tuning

Understanding the Challenges with Adam in Deep Learning Adam is a popular optimization algorithm in deep learning, but it can struggle to converge unless the hyperparameter β2 is adjusted for each specific problem. Alternative methods like…

AI Tech News
OpenThoughts: Revolutionizing SFT Data Curation for Advanced Reasoning Models

Understanding the Target Audience The primary audience for OpenThoughts consists of researchers, data scientists, and AI practitioners who are focused on enhancing reasoning models. They often encounter challenges related to accessing comprehensive methodologies for developing these…

AI Tech News
CMU Researchers Unveil Groundbreaking AI Method for Camera Pose Estimation: Harnessing Ray Diffusion for Enhanced 3D Reconstruction

Researchers at CMU propose a novel approach to camera pose estimation, introducing a patch-wise ray prediction model, diverging from traditional methods. This innovative method shows promising results, surpassing existing techniques and setting new standards for accuracy…

AI Tech News
Machine Learning Revolutionizes Path Loss Modeling with Simplified Features

Machine Learning Revolutionizes Path Loss Modeling with Simplified Features Practical Solutions and Value Accurate propagation modeling is crucial for effective radio deployments, coverage analysis, and interference mitigation in wireless communications. Traditional models like Longley-Rice and free…

AI Tech News
“Secure AI Workflow: Build a Memory-Enabled Cipher with Dynamic LLM Selection”

Creating a Secure Cipher Workflow for AI Agents In the ever-evolving field of artificial intelligence, establishing a secure and efficient workflow is paramount. This guide will take you through building a Cipher-based system that can adaptively…

AI Tech News
Build an Advanced Web Intelligence Agent with Tavily and Gemini AI: A Step-by-Step Guide for Developers

Building an Advanced Web Intelligence Agent In today’s digital landscape, the ability to extract and analyze web content efficiently is crucial for businesses and researchers alike. This article explores how to create an advanced web intelligence…

AI Tech News