asyncio Pitfalls: The 3-Hour Bug

Published: 3 days ago (April 30, 2026 at 09:21 PM EDT)

4 min read

Source: Dev.to

The Scenario: 200 API data fetches, from 40 s to 2 s

The original synchronous code looked like this — simple but painfully slow:

import time
import requests

def fetch_all(urls):
    results = []
    for url in urls:
        resp = requests.get(url, timeout=5)
        results.append(resp.json())
    return results

urls = [f"https://api.example.com/item/{i}" for i in range(200)]
start = time.time()
data = fetch_all(urls)
print(f"耗时: {time.time() - start:.2f}s")
# 输出: 耗时: 41.23s

200 sequential requests taking over 40 seconds — a terrible experience. Brimming with confidence, I set out to rewrite it with asyncio.

Core Concept: Event Loop + Coroutines = Non‑blocking Concurrency

asyncio works completely differently from multithreading. It runs a single‑threaded event loop where all coroutines are scheduled within the same thread. When a coroutine hits an I/O wait (network, disk), it voluntarily yields control back to the event loop via await. The loop immediately switches to other coroutines that are ready to run. This way, the CPU never idly spins waiting for I/O.

The most basic pattern:

import asyncio
import aiohttp

async def fetch(session, url):
    async with session.get(url) as resp:
        return await resp.json()

async def main():
    async with aiohttp.ClientSession() as session:
        tasks = [fetch(session, url) for url in urls]
        results = await asyncio.gather(*tasks)
    return results

urls = [f"https://api.example.com/item/{i}" for i in range(200)]
asyncio.run(main())

asyncio.gather schedules all coroutines concurrently, so the total time is roughly the duration of the slowest request, not the sum. In theory, wonderful — but as soon as I ran it, the pitfalls began.

Pitfall 1: Sneaky Synchronous Blocking Calls Inside Coroutines

At first, I took a shortcut and kept using requests.get inside the coroutine, thinking simply wrapping it in async def would work. The event loop got completely stuck on requests.get — concurrency went out the window.

# 错误示范
import requests

async def fetch_bad(url):
    resp = requests.get(url)   # 同步阻塞！事件循环被堵死
    return resp.json()

The requests library performs synchronous I/O. Once called, the entire thread blocks waiting for the network response, and the event loop loses all control. For asyncio, you must use libraries that support async/await — like aiohttp for HTTP and aiomysql for database queries.

Solution

Replace all I/O with libraries from the async/await ecosystem. For unavoidable synchronous code, offload it to a thread pool using loop.run_in_executor:

import concurrent.futures
import asyncio

def sync_heavy_work(data):
    # 这是一个没法改写的同步 CPU 计算
    return sum(i * i for i in range(data))

async def run_in_thread(data):
    loop = asyncio.get_running_loop()
    with concurrent.futures.ThreadPoolExecutor() as pool:
        result = await loop.run_in_executor(pool, sync_heavy_work, data)
    return result

This prevents blocking the event loop, but don’t overuse it — thread‑switching overhead still exists.

Pitfall 2: `gather` Raises on First Exception, Discarding Other Results

With 200 requests, a couple of timeouts or 500 s are expected. The first time I used gather, if a single task raised an exception, the entire gather would immediately throw, and the other 190+ successful responses were discarded.

results = await asyncio.gather(*tasks)  # 一个挂了，全部白干

Solution

gather has a return_exceptions=True parameter. Instead of raising, it places exception objects directly into the result list, so you can handle them like regular return values:

results = await asyncio.gather(*tasks, return_exceptions=True)

for i, res in enumerate(results):
    if isinstance(res, Exception):
        print(f"任务 {i} 失败: {res}")
    else:
        process(res)

This way, even if 10 tasks timeout, you still get the data from the remaining 190. I also like to add timeout and retry logic inside fetch:

from aiohttp import ClientTimeout

async def fetch(session, url, retries=2):
    for attempt in range(retries):
        try:
            async with session.get(url, timeout=ClientTimeout(total=10)) as resp:
                return await resp.json()
        except Exception as e:
            if attempt == retries - 1:
                raise
            # optional: backoff before retrying

asyncio Pitfalls: The 3-Hour Bug

The Scenario: 200 API data fetches, from 40 s to 2 s

Core Concept: Event Loop + Coroutines = Non‑blocking Concurrency

Pitfall 1: Sneaky Synchronous Blocking Calls Inside Coroutines

Solution

Pitfall 2: `gather` Raises on First Exception, Discarding Other Results

Solution

Related posts

I Rewrote Our Scraper with asyncio. My CTO Thought I Added Servers.

How I Boosted Concurrency 40x with asyncio — and Ops Thought We Were DDoSed

3 Asyncio Pitfalls That Took Me 3 Hours to Debug and Almost Crashed Production

The asyncio Mistake That Cost Me 3 Hours

The Scenario: 200 API data fetches, from 40 s to 2 s

Core Concept: Event Loop + Coroutines = Non‑blocking Concurrency

Pitfall 1: Sneaky Synchronous Blocking Calls Inside Coroutines

Solution

Pitfall 2: gather Raises on First Exception, Discarding Other Results

Solution

Related posts

I Rewrote Our Scraper with asyncio. My CTO Thought I Added Servers.

How I Boosted Concurrency 40x with asyncio — and Ops Thought We Were DDoSed

3 Asyncio Pitfalls That Took Me 3 Hours to Debug and Almost Crashed Production

The asyncio Mistake That Cost Me 3 Hours

The Scenario: 200 API data fetches, from 40 s to 2 s

Pitfall 2: `gather` Raises on First Exception, Discarding Other Results