Getting Started with Asyncio: Boosting AI Application Performance with Asynchronous Python

In today’s fast-paced world of artificial intelligence, performance is key. When working with Large Language Models (LLMs), developers often find themselves waiting for API responses or multiple calls to finish. This is where asyncio comes in. Many developers use LLMs without realizing that asynchronous programming can significantly enhance their applications.

What is Asyncio?

Python’s asyncio library allows developers to write concurrent code using the async/await syntax. This means that multiple I/O-bound tasks can run efficiently within a single thread. Essentially, while synchronous code processes tasks one after another—like standing in a single line at the grocery store—asynchronous code can handle multiple tasks simultaneously, akin to using multiple self-checkout machines. This is especially beneficial for API calls, which often involve waiting for responses.

Getting Started with Asynchronous Python

Example: Running Tasks With and Without Asyncio

Consider a simple function that prints a greeting, waits for 2 seconds, and then completes. In a synchronous setup, running this function three times results in a total wait time of 6 seconds. However, by using asyncio, all three greetings can be printed almost simultaneously, significantly reducing the total wait time.

import time

def say_hello():
    print("Hello...")
    time.sleep(2)  # simulate waiting (like an API call)
    print("...World!")

def main():
    say_hello()
    say_hello()
    say_hello()

if __name__ == "__main__":
    start = time.time()
    main()
    print(f"Finished in {time.time() - start:.2f} seconds")

In contrast, the asynchronous version allows all calls to start almost at the same time, with each greeting printed immediately, leading to a total wait time closer to 2 seconds.

import nest_asyncio, asyncio

nest_asyncio.apply()

async def say_hello():
    print("Hello...")
    await asyncio.sleep(2)  # simulate waiting (like an API call)
    print("...World!")

async def main():
    await asyncio.gather(
        say_hello(),
        say_hello(),
        say_hello()
    )

if __name__ == "__main__":
    start = time.time()
    asyncio.run(main())
    print(f"Finished in {time.time() - start:.2f} seconds")

Example: Download Simulation

Imagine needing to download several files. In a synchronous approach, each download would block the next one until it completes. However, with asyncio, your program can handle multiple downloads at once, making better use of time.

import asyncio
import random
import time

async def download_file(file_id: int):
    print(f"Start downloading file {file_id}")
    download_time = random.uniform(1, 3)  # simulate variable download time
    await asyncio.sleep(download_time)    # non-blocking wait
    print(f"Finished downloading file {file_id} in {download_time:.2f} seconds")
    return f"File {file_id} content"

async def main():
    files = [1, 2, 3, 4, 5]

    start_time = time.time()
    results = await asyncio.gather(*(download_file(f) for f in files))
    end_time = time.time()
    
    print("\nAll downloads completed.")
    print(f"Total time taken: {end_time - start_time:.2f} seconds")
    print("Results:", results)

if __name__ == "__main__":
    asyncio.run(main())

Using Asyncio in an AI Application with an LLM

Now, let’s see how to apply asyncio in a real-world AI context. LLMs like OpenAI’s GPT models often require multiple API calls. If these calls are made sequentially, it leads to wasted time. Let’s compare the performance of running multiple prompts with and without asyncio.

!pip install openai

import asyncio
from openai import AsyncOpenAI
import os
from getpass import getpass

os.environ['OPENAI_API_KEY'] = getpass('Enter OpenAI API Key: ')

import time
from openai import OpenAI

client = OpenAI()

def ask_llm(prompt: str):
    response = client.chat.completions.create(
        model="gpt-4o-mini",
        messages=[{"role": "user", "content": prompt}]
    )
    return response.choices[0].message.content

def main():
    prompts = [
        "Briefly explain quantum computing.",
        "Write a 3-line haiku about AI.",
        "List 3 startup ideas in agri-tech.",
        "Summarize Inception in 2 sentences.",
        "Explain blockchain in 2 sentences.",
        "Write a 3-line story about a robot.",
        "List 5 ways AI helps healthcare.",
        "Explain Higgs boson in simple terms.",
        "Describe neural networks in 2 sentences.",
        "List 5 blog post ideas on renewable energy.",
        "Give a short metaphor for time.",
        "List 3 emerging trends in ML.",
        "Write a short limerick about programming.",
        "Explain supervised vs unsupervised learning in one sentence.",
        "List 3 ways to reduce urban traffic."
    ]

    start = time.time()
    results = []
    for prompt in prompts:
        results.append(ask_llm(prompt))
    end = time.time()

    for i, res in enumerate(results, 1):
        print(f"\n--- Response {i} ---")
        print(res)

    print(f"\n[Synchronous] Finished in {end - start:.2f} seconds")

if __name__ == "__main__":
    main()

The synchronous version took significantly longer, processing all 15 prompts sequentially. In contrast, the asynchronous version processes all prompts concurrently, drastically reducing the total runtime.

from openai import AsyncOpenAI

client = AsyncOpenAI()

async def ask_llm(prompt: str):
    response = await client.chat.completions.create(
        model="gpt-4o-mini",
        messages=[{"role": "user", "content": prompt}]
    )
    return response.choices[0].message.content

async def main():
    prompts = [
        "Briefly explain quantum computing.",
        "Write a 3-line haiku about AI.",
        "List 3 startup ideas in agri-tech.",
        "Summarize Inception in 2 sentences.",
        "Explain blockchain in 2 sentences.",
        "Write a 3-line story about a robot.",
        "List 5 ways AI helps healthcare.",
        "Explain Higgs boson in simple terms.",
        "Describe neural networks in 2 sentences.",
        "List 5 blog post ideas on renewable energy.",
        "Give a short metaphor for time.",
        "List 3 emerging trends in ML.",
        "Write a short limerick about programming.",
        "Explain supervised vs unsupervised learning in one sentence.",
        "List 3 ways to reduce urban traffic."
    ]

    start = time.time()
    results = await asyncio.gather(*(ask_llm(p) for p in prompts))
    end = time.time()

    for i, res in enumerate(results, 1):
        print(f"\n--- Response {i} ---")
        print(res)

    print(f"\n[Asynchronous] Finished in {end - start:.2f} seconds")

if __name__ == "__main__":
    asyncio.run(main())

Why This Matters in AI Applications

In real-world AI applications, waiting for each request to finish can become a significant bottleneck, especially when dealing with multiple queries or data sources. This is particularly common in:

Generating content for multiple users simultaneously—like chatbots or recommendation engines.
Calling the LLM several times in one workflow—for tasks like summarization or multi-step reasoning.
Fetching data from multiple APIs—combining LLM output with external information.

Using asyncio can lead to:

Improved performance: Parallel API calls reduce overall execution time.
Cost efficiency: Faster execution can lower operational costs.
Better user experience: Concurrency enhances responsiveness in real-time systems.
Scalability: Asynchronous patterns allow handling more simultaneous requests without a proportional increase in resource consumption.

In conclusion, integrating asyncio into your AI applications can significantly enhance performance, efficiency, and user experience. By leveraging asynchronous programming, developers can make the most of their resources and build more responsive applications.

FAQ

What is asyncio?
Asyncio is a Python library used for writing concurrent code using the async/await syntax, which allows for efficient handling of I/O-bound tasks.
How does asyncio improve performance?
By allowing multiple tasks to run concurrently, asyncio reduces the total waiting time for I/O operations, making applications faster.
When should I use asyncio?
Use asyncio when your application involves many I/O-bound tasks, such as API calls or file downloads, where waiting time can be reduced.
Can asyncio be used with LLMs?
Yes, asyncio can significantly improve the performance of applications that make multiple API calls to LLMs by processing requests concurrently.
What are some common mistakes when using asyncio?
Common mistakes include not using await with async functions, blocking the event loop with synchronous code, and not handling exceptions properly.

Unleash Your Creative Potential with AI Agents

Competitors are already using AI Agents

Business Problems We Solve

Automation of internal processes.
Optimizing AI costs without huge budgets.
Training staff, developing custom courses for business needs
Integrating AI into client work, automating first lines of contact

Large and Medium Businesses

Startups

Offline Business

Get a plan to reduce routine and improve metrics

100% of clients report increased productivity and reduced operati

AI Agents

Localization Project Manager – Coordinating translation workflows, answering vendor or process-related questions.

Job Title: Localization Project Manager Overview The Localization Project Manager plays a vital role in coordinating translation workflows while addressing vendor and process-related queries. This position is crucial for ensuring that translation projects are executed efficiently…
AI Agents

Environmental Health & Safety Officer – Answering compliance-related questions, retrieving safety protocols or audit histories.

Professional Summary The AI-driven Environmental Health & Safety Officer is a reliable and effective digital team member that performs repetitive and time-consuming tasks with remarkable speed, accuracy, and stability. By automating these tasks, it frees up…
AI Agents

Legal Contract Reviewer – Auto-flagging clause inconsistencies or retrieving precedent cases for review.

Job Title: Legal Contract Reviewer – Auto-flagging Clause Inconsistencies or Retrieving Precedent Cases for Review The AI functions as a reliable and effective digital team member that excels in performing repetitive and time-consuming tasks. With remarkable…
AI Agents

Customer Retention Analyst – Creating customer summaries, identifying churn risk patterns, and suggesting retention steps.

Customer Retention Analyst Professional Summary A highly analytical and detail-oriented Customer Retention Analyst with a proven track record in creating comprehensive customer summaries, identifying churn risk patterns, and suggesting effective retention strategies. Adept at leveraging data-driven…

Itinai.com httpss.mj.runmrqch2uvtvo russian handsome charisma 9fdbb2d5 a55b 425d 8f3b 76d26f86710f 2

AI Business Accelerator

Start Your AI Business in Just a Week with itinai.com

You’re a great fit if you:

Have an audience (even 500+ followers in Instagram, email, etc.)
Have an idea, service, or product you want to scale
Can invest 2–3 hours a day
You’re motivated to earn with AI but don’t want to handle technical setup

AI news and solutions

Best Image Annotation Tools in 2024

After human annotation, a machine-learning model automatically replicates the same annotations from tagged pictures, aiming to meet defined standards. Image annotation categorizes and labels images for object identification, crucial for computer vision, robotics, and autonomous driving.…

AI Tech News
Meet Notus: Enhancing Language Models with Data-Driven Fine-Tuning

Notus, a new language model, builds on Zephyr’s success by fine-tuning data curation, prioritizing high-quality data from UltraFeedback and emphasizing user preference alignment. Implementing a meticulous curation process, Notus aims to elevate language model performance by…

AI Tech News
How Veriff decreased deployment time by 80% using Amazon SageMaker multi-model endpoints

Veriff is an identity verification platform partner for organizations in various industries. They use advanced technology, including AI-powered automation and human feedback, to verify user identities. Veriff standardized their model deployment workflow using Amazon SageMaker, reducing…

AI Tech News
Optimizing Memory for Large-Scale NLP Models: A Look at MINI-SEQUENCE TRANSFORMER

The Evolution of Transformer Models in NLP Addressing Memory Challenges in Training Large-Scale Models The evolution of Transformer models has significantly improved natural language processing (NLP) performance. However, it has also introduced memory challenges during training.…

AI Tech News
IoT-LLM: An AI Framework that Integrates IoT Sensor Data with LLMs to Enhance their Perception and Reasoning Abilities in the Physical World

Enhancing IoT with AI: The IoT-LLM Framework Growing sectors like Healthcare, Logistics, and Smart Cities rely on interconnected devices that need advanced reasoning capabilities. To address this, researchers are integrating real-time data and context into Large…

AI Tech News
Automate prior authorization using CRD with CDS Hooks and AWS HealthLake

Prior authorization is a crucial process in healthcare that involves the approval of medical treatments before they are carried out. The Da Vinci Burden Reduction project has rearranged the prior authorization process into three implementation guides…

AI Tech News
This AI Paper Introduces ROMAS: A Role-Based Multi-Agent System for Efficient Database Monitoring and Planning

Understanding Multi-Agent Systems (MAS) Multi-agent systems (MAS) are crucial in artificial intelligence as they enable different agents to work together on complex tasks. They are especially useful in changing environments where they can assist with data…

AI Tech News
Understanding Modern Databases: Types, Examples, and Applications for Developers in 2025

Understanding Databases in the Modern Tech Era In our increasingly digital landscape, databases serve as the crucial backbone for various applications, from mobile platforms to complex enterprise systems. Grasping the different types of databases and their…

AI Tech News
AI deep fake misinformation hits the Bangladeshi election

AI-generated disinformation is threatening the upcoming Bangladesh national elections. Pro-government groups are using AI tools to create fake news clips and deep fake videos to sway public opinion and discredit the opposition. The lack of robust…

AI Tech News
Advancing Membrane Science: The Role of Machine Learning in Optimization and Innovation

Machine Learning in Membrane Science Practical Solutions and Value: ML transforms natural sciences like cheminformatics and materials science, benefiting membrane technology. ML applications analyze data to improve processes like reverse osmosis and gas separation, enhancing membrane…

AI Tech News
Carbon Emissions of an ML Engineering Team

This text discusses the significance of the hidden costs of development. It emphasizes the importance of recognizing and considering these costs in order to ensure accurate decision-making and successful project outcomes.

AI Tech News
Fact or Fiction? NOCHA: A New Benchmark for Evaluating Long-Context Reasoning in LLMs

Natural Language Processing (NLP) in Artificial Intelligence Natural Language Processing (NLP) involves developing algorithms and models that enable computers to comprehend, interpret, and generate human language. This technology finds applications in various domains, such as machine…

AI Tech News
Don’t Write Another Job Description—Let AI Handle It

Don’t Write Another Job Description—Let AI Handle It One common issue businesses face is the inefficiency and frustration of writing job descriptions. It’s a time-consuming task that can lead to lost documents, misaligned team collaboration, and…

AI Document Assistant
The Ultimate Guide to Training BERT from Scratch: Final Act

This blog post serves as the conclusion to a series on training BERT from scratch. It discusses the significance of BERT in Natural Language Processing, reviews the previous parts of the series, and outlines the process…

AI Tech News
NACL: A Robust KV Cache Eviction Framework for Efficient Long-Text Processing in LLMs

Practical Solutions for Efficient Long-Text Processing in LLMs Challenges in Deployment Large Language Models (LLMs) with extended context windows face challenges due to significant memory consumption. This limits their practical application in resource-constrained settings. Addressing Memory…

AI Tech News
This AI Paper Introduces SWE-Gym: A Comprehensive Training Environment for Real-World Software Engineering Agents

Understanding Software Engineering Agents Software engineering agents are crucial for handling complex coding tasks, especially in large codebases. These agents use advanced language models to: Interpret natural language descriptions Analyze codebases Implement modifications They are valuable…

AI Tech News
Arcee AI Releases SuperNova-Medius: A 14B Small Language Model Built on the Qwen2.5-14B-Instruct Architecture

Introduction to SuperNova-Medius In the fast-changing field of artificial intelligence (AI), large language models are key to solving many problems, like automating tasks and improving decision-making. However, these models can be expensive and hard to access,…

AI Tech News
Bridging Modalities with VisionLLaMA: A Unified Architecture for Vision Tasks

VisionLLaMA, a vision transformer, merges language and vision modalities. It introduces a tailored architecture, VisionLLaMA, to process 2D images effectively. The design retains LLaMA’s architecture and follows ViT’s pipeline, utilizing innovative features. VisionLLaMA achieves superior performance…

AI Tech News
Apple to Add New AI in iOS 18: Big Changes Coming

Apple Inc. is preparing to launch iOS 18 at its next Worldwide Developer Conference. The update will focus on integrating generative AI and is an effort to keep up with Google and OpenAI. Significant software advancements,…

AI Tech News
Fabric: An Open-Source Framework for Augmenting Humans Using AI

Fabric: An Open-Source Framework for Augmenting Humans Using AI The year 2023 saw a surge in generative AI, leading to the development of various AI applications for diverse tasks. However, integrating AI into daily life has…

AI Tech News