fix: prevent HTTP client connection leaks in embedders by choutos · Pull Request #917 · timescale/pgai

choutos · 2026-02-04T17:19:52Z

Fixes #919

Problem

The AsyncOpenAI, Ollama, and VoyageAI embedders create HTTP clients that are never explicitly closed. This causes connections to accumulate in CLOSE_WAIT state and eventually exhaust file descriptors.

Investigation Details

In production, we observed:

~35,000 file descriptors held by pgai-vectorizer-worker
All were sockets in CLOSE_WAIT state
Connections were to the OpenAI API (via Cloudflare)
The systemd error: Failed to allocate manager object: Too many open files

Root Cause

The _embedder property in openai.py creates an AsyncOpenAI client that holds an httpx.AsyncClient. When the embedder is garbage collected, the HTTP client's connections are not properly closed, leaving them in CLOSE_WAIT until the OS times them out.

Similar patterns exist in ollama.py and voyageai.py.

Solution

Add cleanup() method to the Embedder base class
Implement cleanup() in OpenAI, Ollama, and VoyageAI embedders to close underlying HTTP clients
Call cleanup() in Executor.run()'s finally block to ensure resources are released
Reuse client instances instead of creating new ones for each request (Ollama, VoyageAI)

Changes

embeddings.py: Add cleanup() method to base class
openai.py: Store client reference, implement cleanup()
ollama.py: Add _get_client() for client reuse, implement cleanup()
voyageai.py: Add _get_client() for client reuse, implement cleanup()
vectorizer.py: Call cleanup() in finally block

Testing

This fix was developed in response to a production issue. We recommend:

Unit tests for cleanup() methods
Integration test that verifies no file descriptor leak after multiple embedding batches

CLAassistant · 2026-02-04T17:20:01Z

All committers have signed the CLA.

The AsyncOpenAI, Ollama, and VoyageAI embedders were creating HTTP clients that were never explicitly closed, causing connections to accumulate in CLOSE_WAIT state and eventually exhausting file descriptors. Changes: - Add cleanup() method to Embedder base class - Implement cleanup() in OpenAI, Ollama, and VoyageAI embedders to close underlying HTTP clients - Call cleanup() in Executor.run() finally block to ensure resources are released regardless of how the embedding loop exits - Reuse client instances instead of creating new ones for each request (Ollama, VoyageAI) This fixes a file descriptor leak that could cause 'Too many open files' errors after prolonged operation.

choutos requested a review from a team as a code owner February 4, 2026 17:19

choutos had a problem deploying to external-contributors February 4, 2026 17:19 — with GitHub Actions Error

choutos force-pushed the fix/openai-client-connection-leak branch from 5e73f20 to 20648be Compare February 4, 2026 17:27

choutos had a problem deploying to external-contributors February 4, 2026 17:27 — with GitHub Actions Error

choutos mentioned this pull request Feb 9, 2026

HTTP client connection leak in embedders causes file descriptor exhaustion #919

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix: prevent HTTP client connection leaks in embedders#917

fix: prevent HTTP client connection leaks in embedders#917
choutos wants to merge 1 commit intotimescale:mainfrom
choutos:fix/openai-client-connection-leak

choutos commented Feb 4, 2026 •

edited

Loading

Uh oh!

CLAassistant commented Feb 4, 2026 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

choutos commented Feb 4, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Problem

Investigation Details

Root Cause

Solution

Changes

Testing

Uh oh!

CLAassistant commented Feb 4, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

choutos commented Feb 4, 2026 •

edited

Loading

CLAassistant commented Feb 4, 2026 •

edited

Loading