Building AI Agents: Why Software Engineering Matters More Than AI

Building AI Agents: 5% AI and 100% Software Engineering

The development of AI agents is more about software engineering than the AI models themselves. Key elements such as data management, controls, and observability play a crucial role in ensuring success. This article delves into the essential components of a doc-to-chat pipeline and how to effectively integrate AI agents into existing software stacks.

Understanding the Doc-to-Chat Pipeline

A doc-to-chat pipeline processes enterprise documents by ingesting, standardizing, enforcing governance, indexing embeddings, and serving retrieval and generation through authenticated APIs. This architecture is vital for applications like agentic Q&A, copilots, and workflow automation, ensuring that responses comply with permissions and are audit-ready.

Integration with Existing Stacks

To seamlessly integrate AI agents, it’s important to utilize standard service boundaries such as REST/JSON or gRPC over a trusted storage layer. For managing tables, Iceberg offers ACID compliance, schema evolution, and snapshots, which are essential for reproducible retrieval. When dealing with vector data, pgvector is a great option for embedding management, and dedicated engines like Milvus can handle high-query-per-second (QPS) approximate nearest neighbor (ANN) searches.

Key Properties of Data Management

Iceberg Tables: Provide ACID compliance, hidden partitioning, and snapshot isolation.
pgvector: Combines SQL and vector similarity in a single query plan.
Milvus: Features a scalable architecture for large-scale similarity searches.

Coordinating Agents, Humans, and Workflows

Effective production agents require clear coordination points for human intervention. Tools like AWS A2I offer managed human-in-the-loop (HITL) processes, ensuring that low-confidence outputs are reviewed. Frameworks such as LangGraph can model these checkpoints within agent workflows, making approvals a key part of the process.

Ensuring Reliability Before Model Deployment

Reliability in AI systems should be approached as a layered defense strategy:

Language and Content Guardrails: Pre-validate inputs and outputs for safety.
PII Detection and Redaction: Utilize tools like Microsoft Presidio to identify and mask personally identifiable information.
Access Control and Lineage: Implement row- and column-level access controls to maintain compliance.
Retrieval Quality Gates: Assess retrieval-augmented generation (RAG) using metrics like faithfulness and context precision.

Scaling Indexing and Retrieval

To effectively manage real traffic, focus on two main aspects: ingest throughput and query concurrency. Normalize data at the lakehouse edge and write to Iceberg for versioned snapshots. For vector serving, leverage Milvus’s architecture to support horizontal scaling and independent failure domains.

Monitoring Beyond Logs

Effective monitoring requires a mix of traces, metrics, and evaluations:

Distributed Tracing: Use OpenTelemetry for comprehensive visibility.
LLM Observability Platforms: Compare options like LangSmith and Arize Phoenix.
Continuous Evaluation: Regularly evaluate canary sets to track performance over time.

Conclusion: The Importance of Software Engineering in AI

The notion that building AI agents is 5% AI and 100% software engineering underscores the reality that most failures in agent systems arise from issues related to data quality, permissioning, and retrieval decay rather than model performance. By prioritizing strong data management and observability practices, organizations can ensure their AI systems are both reliable and effective.

FAQs

What is a doc-to-chat pipeline? A doc-to-chat pipeline processes documents for applications like Q&A and workflow automation, ensuring compliance and audit readiness.
Why is software engineering more important than AI models in building agents? Most failures stem from data management issues rather than the AI models themselves, making software engineering critical.
How can I ensure data quality in AI systems? Implementing strict data management practices and monitoring can help maintain data quality.
What tools can assist in human-in-the-loop processes? Tools like AWS A2I can help manage HITL processes effectively.
How do I scale indexing and retrieval for AI systems? Focus on optimizing ingest throughput and query concurrency, and consider using architectures like Milvus for vector serving.

Unleash Your Creative Potential with AI Agents

Competitors are already using AI Agents

Business Problems We Solve

Automation of internal processes.
Optimizing AI costs without huge budgets.
Training staff, developing custom courses for business needs
Integrating AI into client work, automating first lines of contact

Large and Medium Businesses

Startups

Offline Business

Get a plan to reduce routine and improve metrics

100% of clients report increased productivity and reduced operati

AI Agents

Localization Project Manager – Coordinating translation workflows, answering vendor or process-related questions.

Job Title: Localization Project Manager Overview The Localization Project Manager plays a vital role in coordinating translation workflows while addressing vendor and process-related queries. This position is crucial for ensuring that translation projects are executed efficiently…
AI Agents

Environmental Health & Safety Officer – Answering compliance-related questions, retrieving safety protocols or audit histories.

Professional Summary The AI-driven Environmental Health & Safety Officer is a reliable and effective digital team member that performs repetitive and time-consuming tasks with remarkable speed, accuracy, and stability. By automating these tasks, it frees up…
AI Agents

Legal Contract Reviewer – Auto-flagging clause inconsistencies or retrieving precedent cases for review.

Job Title: Legal Contract Reviewer – Auto-flagging Clause Inconsistencies or Retrieving Precedent Cases for Review The AI functions as a reliable and effective digital team member that excels in performing repetitive and time-consuming tasks. With remarkable…
AI Agents

Customer Retention Analyst – Creating customer summaries, identifying churn risk patterns, and suggesting retention steps.

Customer Retention Analyst Professional Summary A highly analytical and detail-oriented Customer Retention Analyst with a proven track record in creating comprehensive customer summaries, identifying churn risk patterns, and suggesting effective retention strategies. Adept at leveraging data-driven…

Itinai.com httpss.mj.runmrqch2uvtvo russian handsome charisma 9fdbb2d5 a55b 425d 8f3b 76d26f86710f 2

AI Business Accelerator

Start Your AI Business in Just a Week with itinai.com

You’re a great fit if you:

Have an audience (even 500+ followers in Instagram, email, etc.)
Have an idea, service, or product you want to scale
Can invest 2–3 hours a day
You’re motivated to earn with AI but don’t want to handle technical setup

AI news and solutions

My Experience with DevOps and DataOps

In this article, the author discusses their experience working as a data engineer in both a DevOps-focused role and an analytics engineering role. They highlight the differences between DevOps and DataOps, including the focus on software…

AI Tech News
Reinforcement Learning Breakthroughs in Open-Weight LLMs for Software Engineering Automation

Introduction to Reinforcement Learning in Software Engineering The field of software engineering automation is undergoing significant transformation, largely due to advancements in Large Language Models (LLMs). Traditional methods often rely on proprietary models or expensive teacher-based…

AI Tech News
Redefining Single-Channel Speech Enhancement: The xLSTM-SENet Approach

Challenges in Speech Processing Speech processing systems often have difficulty providing clear audio in noisy environments. This affects important applications like hearing aids, automatic speech recognition (ASR), and speaker verification. Traditional speech enhancement systems use neural…

AI Tech News
Researchers from NVIDIA Introduce Retro 48B: The Largest LLM Pretrained with Retrieval before Instruction Tuning

Researchers from Nvidia and the University of Illinois at Urbana-Champaign have developed Retro 48B, a larger language model that improves on previous retrieval-augmented models. By pre-training with retrieval on a vast corpus, Retro 48B enhances task…

AI Tech News
This new tool could give artists an edge over AI

Nightshade, a new tool developed by a computer science lab at the University of Chicago, may shift the power dynamics between artists and technology companies. By applying Nightshade to their work, artists can trick machine-learning models…

AI Tech News
Building a Semantic Search Engine with Sentence Transformers and FAISS

Building a Semantic Search Engine Building a Semantic Search Engine: A Practical Guide Understanding Semantic Search Semantic search enhances traditional keyword matching by grasping the contextual meaning of search queries. Unlike conventional systems that rely solely…

AI Tech News
Researchers from the University of Maryland Introduce an Automatic Text Privatization Framework that Fine-Tunes a Large Language Model via Reinforcement Learning

The Importance of Privacy in Online Communities The privacy of users in online communities is crucial, and websites like Reddit allow users to post under fictitious names to protect their identity. It is essential to maintain…

AI Tech News
IoT-LLM: An AI Framework that Integrates IoT Sensor Data with LLMs to Enhance their Perception and Reasoning Abilities in the Physical World

Enhancing IoT with AI: The IoT-LLM Framework Growing sectors like Healthcare, Logistics, and Smart Cities rely on interconnected devices that need advanced reasoning capabilities. To address this, researchers are integrating real-time data and context into Large…

AI Tech News
Improving Vision-inspired Keyword Spotting Using a Streaming Conformer Encoder With Input-dependent Dynamic Depth

This text proposes an architecture capable of processing streaming audio using a vision-inspired keyword spotting framework. By extending a Conformer encoder with trainable binary gates, the approach improves detection and localization accuracy on continuous speech while…

AI Tech News
Hugging Face Introduces Cosmopedia To Create Large-Scale Synthetic Data For Pre-Training

AI Tech News
Chunking vs. Tokenization: Essential Insights for AI Text Processing

When diving into the world of artificial intelligence and natural language processing, two concepts often come to the forefront: tokenization and chunking. These techniques are essential for breaking down text, but they serve distinct purposes and…

AI Tech News
Navigating the Landscape of CLIP: Investigating Data, Architecture, and Training Strategies

AI Tech News
Evaluating Large Language Models

Generative AI has rapidly developed since going mainstream, with new models emerging regularly. Evaluating generative models is more complex than discriminative models due to the challenge of assessing quality, coherence, diversity, and usefulness. Evaluation methods include…

AI Tech News
Researchers from Stanford and Google AI Introduce MELON: An AI Technique that can Determine Object-Centric Camera Poses Entirely from Scratch while Reconstructing the Object in 3D

MELON, a new AI technique developed by Stanford and Google researchers, addresses the challenge of reconstructing 3D objects from 2D images with unknown poses. By utilizing lightweight CNN encoders and introducing a modulo loss that considers…

AI Tech News
CAMEL-AI Unveils CAMEL: Revolutionary Multi-Agent Framework for Enhanced Autonomous Cooperation Among Communicative Agents

CAMEL-AI Unveils CAMEL: Revolutionary Multi-Agent Framework for Enhanced Autonomous Cooperation Among Communicative Agents CAMEL-AI has introduced CAMEL, a communicative agent framework designed to enhance scalability and autonomous cooperation among language model agents. The framework minimizes the…

AI Tech News
Adaptive Attacks on LLMs: Lessons from the Frontlines of AI Robustness Testing

Understanding the Importance of AI Safety The field of Artificial Intelligence (AI) is progressing quickly, especially with Large Language Models (LLMs) becoming essential in AI applications. These models come with built-in safety features to prevent unethical…

AI Tech News
Apple Releases AIMv2: A Family of State-of-the-Art Open-Set Vision Encoders

Vision Models and Their Evolution Vision models have greatly improved over time, responding to the challenges of previous versions. Researchers in computer vision often struggle with making models that are both complex and adaptable. Many current…

AI Tech News
NIST Releases a Machine Learning Tool for Testing AI Model Risks

Practical AI Tools for Ensuring Model Reliability and Security The rapid advancement and widespread adoption of AI systems have brought about numerous benefits but also significant risks. AI systems can be susceptible to attacks, leading to…

AI Tech News
Advancing Artificial Intelligence: Sungkyunkwan University’s Innovative Memory System Called ‘Memoria’ Boosts Transformer Performance on Long-Sequence Complex Tasks

Researchers at Sungkyunkwan University have developed a novel memory system called “Memoria” that enhances the performance of transformer models in handling lengthy data sequences. The system draws inspiration from human memory principles and has shown promising…

AI Tech News
Revolutionizing Heuristic Design: Monte Carlo Tree Search Meets Large Language Models

Understanding Heuristic Design Heuristic design is a vital tool used in fields like artificial intelligence and operations research to solve complex optimization problems. Traditionally, experts create these designs manually, which can be slow and costly. Introducing…

AI Tech News