Implement native async client by joe-clickhouse · Pull Request #617 · ClickHouse/clickhouse-connect

joe-clickhouse · 2026-01-15T21:39:11Z

Summary

This PR adds a native async HTTP client based on aiohttp (AiohttpAsyncClient in clickhouse_connect/driver/aiohttp_client.py) and wires it into the async client factory path. It replaces the default executor-wrapped sync client with true async I/O while keeping the public async API consistent with the existing AsyncClient surface. The legacy executor-wrapped path remains available (and is now explicitly deprecated) when passing a sync client.

Why this change

The previous async client was a thin wrapper around sync operations executed in a thread pool, which:

added thread overhead and context switching
limited async I/O benefits
complicated resource/session management

The new implementation performs HTTP I/O natively with aiohttp while preserving the established client API and behavior.

Key behavior and design points

Native async I/O with aiohttp

Requests use aiohttp.ClientSession with a configurable TCPConnector (pool limits, keepalive). HTTP response handling is fully async.

Streaming bridge for Native format

Native format parsing/serialization is still synchronous CPU-bound work. The client uses a bounded queue as a sync/async bridge so async network reads/writes can overlap with sync parsing/serialization in an executor.

On the async query path (StreamingResponseSource), he async producer reads from aiohttp response and the sync consumer parses in an executor.

On the async insert path (StreamingInsertSource), the sync producer serializes in an executor and the async consumer streams to aiohttp.

Preventing event loop blocking

The client uses two complementary strategies to prevent users from accidentally blocking the event loop.

For non-streaming queries (.query(), .query_df(), etc.) results are fully materialized inside the executor before returning to the event loop. By the time a QueryResult is returned, all data is already in memory, so synchronous iteration is safe and won't cause deadlocks.

For streaming queries (.query_rows_stream(), .query_df_stream(), etc.) the AsyncSyncQueue bridge actively detects deadlock attempts. If you try to synchronously iterate a stream (for row in stream) from within an async def function, it raises a ProgrammingError immediately, prompting you to use async for instead.

Backward compatibility

AsyncClient(client=...) still wraps the sync client in an executor, but emits a deprecation warning. The recommended path is get_async_client(...), which now creates the aiohttp-based client.

Tests

Integration tests using param_client now exercise both sync and async clients, and new async-specific tests validate native async behaviors (concurrency, streaming cleanup, session protection, timeouts, etc.).

Migration and compatibility notes

Recommended: async_client = await clickhouse_connect.get_async_client(...)
Legacy (still supported, deprecated warning): AsyncClient(client=sync_client)

Notable trade-offs

Non-streaming queries are fully materialized in the executor to keep the event loop safe.
Parsing/serialization still runs in a thread pool. The async benefit is in I/O and concurrency.

Performance notes

A preliminary benchmark comparing the executor-based async client (as it exists in clickhouse-connect v0.10.0) against the new async-native client was performed. The setup was as follows:

Server: ClickHouse Cloud (25.10.1.7186), 1–4 vCPUs (burstable), 8 GiB RAM, node type r5ad‑2xlarge
Client: Apple M4 Max (14 cores), 36 GB RAM

The observed speedups of the new async client over the executor-based client ranged from 2% to 95% with average increase of around 40%, depending on the workload. P95 latencies showed marked improvement as well. A detailed design/benchmark blog post is planned and a link will be provided when done.

Checklist

Delete items not relevant to your PR:

Unit and integration tests covering the common scenarios were added
A human-readable description of the changes was provided to include in CHANGELOG

…-asyncio

genzgd

This seems okay to me, although I can't claim to have done anything resembling a full review. A couple observations:

I'm curious as to where the improvements come from over the existing implementation, so I'm looking forward to that blog post.
There's a lot of duplicated code in the aiohttp_client. It would be nice to consolidate that somewhere.
The piece with the async queue is hard to follow -- I don't know out feasible it is, but it would be nice to remove that layer and just use some kind of async based generator without wrapping the extra queue.

joe-clickhouse · 2026-01-29T23:40:16Z

Thanks @genzgd. To address your questions:

I think the improvements are mainly from true I/O <-> CPU pipelining. In the existing async client we run the sync client in an executor, and it effectively does read -> parse -> read sequentially in a single thread. In the new client, an async producer reads from aiohttp and pushes chunks into AsyncSyncQueue while the parser runs in a separate executor thread. Those stages actually overlap, giving true concurrency.
Agreed on the duplication. I avoided refactoring the shared sync client pieces for now to keep the changes fully separate while the async path is still new. Once it stabilizes, I can move common logic into the base client to reduce duplication.
I did try for quite a while to use simpler async‑generator patterns, but we need to keep using the synchronous NativeTransform parser. If we do parsing directly on the event loop, we lose the async benefit because the CPU‑heavy parsing blocks the loop. The queue lets the event loop keep reading from the socket while parsing runs off‑loop. Additionally, it provides backpressure/bounded buffering.

genzgd · 2026-01-30T00:13:24Z

In the new client, an async producer reads from aiohttp and pushes chunks into AsyncSyncQueue while the parser runs in a separate executor thread. Those stages actually overlap, giving true concurrency.

If we do parsing directly on the event loop, we lose the async benefit because the CPU‑heavy parsing blocks the loop. The queue lets the event loop keep reading from the socket while parsing runs off‑loop. Additionally, it provides backpressure/bounded buffering.

Yes, as I think about it, that makes sense. It might be theoretically possible to run the sync HTTP client (and the buffer) in a separate thread than the parser, gaining a similar benefit. On a related note, making the transform step truly parallel would be challenging given the fact that HTTP chunks won't align with Native format blocks, but that's another argument in favor of a TCP protocol client. :)

…-asyncio

joe-clickhouse added 20 commits January 12, 2026 16:45

implement native async client

34e0b89

add pylint exceptions

3971254

linting

d1f262d

improve test config setup

83ad886

fix some tests

4aff17d

enable log tables on startup

7392e2b

improve error handling

d96d1d5

linting

9019bc2

fix deprecation warning in aiohttp

759c736

error handling

4e050ab

separate sync/async stream failure tests

841c5d8

more error handling issues

fb31ccb

linting

b61a1d2

test updates and improvements

0f14167

test adjustments

74f5737

increase async stream buffer size

5781c52

don't parallelize cloud tests

6334be8

apply consistency settings to client factory fixture

e060dfd

up cloud par to 4

590fd12

enforce sequential consistency in helper

1b2578e

joe-clickhouse linked an issue Jan 15, 2026 that may be closed by this pull request

A database client should be based on asyncio #141

Open

joe-clickhouse added 4 commits January 15, 2026 14:02

update enable_cleanup_closed vers

08abfee

add conn abort to wrong port assertion

85f0860

small test refactor

1f16500

more consistency

e95f61b

joe-clickhouse changed the title ~~Joe/141 a database client should be based on asyncio~~ Implement native async client Jan 16, 2026

joe-clickhouse added 4 commits January 16, 2026 15:30

add deadlock check in async queue

ceb672a

enforce consistent use of async iter stream consumption

038dfb5

linting

beaa28a

DO materialize non-streaming queries

45c96c1

joe-clickhouse added 3 commits January 18, 2026 21:45

accept either concurrent session error

66438d8

Merge branch 'main' into joe/141-a-database-client-should-be-based-on…

2fdae8f

…-asyncio

update async client to always reset context

95d30d8

joe-clickhouse marked this pull request as ready for review January 21, 2026 00:40

joe-clickhouse requested a review from alex-clickhouse as a code owner January 21, 2026 00:40

joe-clickhouse requested a review from genzgd January 21, 2026 00:42

Merge branch 'main' into joe/141-a-database-client-should-be-based-on…

7319d8f

…-asyncio

genzgd approved these changes Jan 28, 2026

View reviewed changes

joe-clickhouse added 2 commits January 29, 2026 16:23

add pipelining to arrow methods

f4e0010

Merge branch 'main' into joe/141-a-database-client-should-be-based-on…

e362dc2

…-asyncio

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Implement native async client#617

Implement native async client#617
joe-clickhouse wants to merge 34 commits intomainfrom
joe/141-a-database-client-should-be-based-on-asyncio

joe-clickhouse commented Jan 15, 2026 •

edited

Loading

Uh oh!

genzgd left a comment

Uh oh!

joe-clickhouse commented Jan 29, 2026

Uh oh!

genzgd commented Jan 30, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

joe-clickhouse commented Jan 15, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Why this change

Key behavior and design points

Native async I/O with aiohttp

Streaming bridge for Native format

Preventing event loop blocking

Backward compatibility

Tests

Migration and compatibility notes

Notable trade-offs

Performance notes

Checklist

Uh oh!

genzgd left a comment

Choose a reason for hiding this comment

Uh oh!

joe-clickhouse commented Jan 29, 2026

Uh oh!

genzgd commented Jan 30, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

joe-clickhouse commented Jan 15, 2026 •

edited

Loading