fix: Fallback for OpenaAI AI model #670

Sahilbhatane · 2026-01-23T20:32:31Z

Related Issue

Closes #

Summary

LLM_router.py had openai fallback but after commit was changed and openAI model wasn't refereed anywhere for cortex

AI Disclosure

No AI used
AI/IDE/Agents used (please describe below)
copilot 5.2-codex

Checklist

PR title follows format: type(scope): description or [scope] description
Tests pass (pytest tests/)
MVP label added if closing MVP issue
Update "Cortex -h" (if needed)

SS's

before-

After -

Summary by CodeRabbit

New Features
- OpenAI is now a first-class LLM provider with full sync/async support.
- Multiple OpenAI API keys supported via configuration or environment variable with automatic key fallback.
- Provider routing and fallback behavior improved to include OpenAI as a primary and fallback option.
- Usage metrics, token accounting, and cost reporting now include OpenAI.
Tests
- Added integration and key-rotation tests covering sync/async OpenAI flows.

_{✏️ Tip: You can customize this high-level summary in your review settings.}

github-actions · 2026-01-23T20:32:43Z

CLA Verification Passed

All contributors have signed the CLA.

Contributor	Signed As
@Sahilbhatane	@Sahilbhatane
@Sahilbhatane	@Sahilbhatane

gemini-code-assist · 2026-01-23T20:32:53Z

Summary of Changes

Hello @Sahilbhatane, I'm Gemini Code Assist¹! I'm currently reviewing this pull request and will post my feedback shortly. In the meantime, here's a summary to help you and other reviewers quickly get up to speed!

This pull request restores and significantly improves the OpenAI integration within the LLM routing system. It introduces the ability to configure and utilize multiple OpenAI API keys, enhancing the system's resilience by allowing it to automatically fall back to alternative keys or providers if an initial request fails. This change ensures more robust and flexible operation when leveraging OpenAI models.

Highlights

Re-introduction of OpenAI Fallback: The core change is bringing back and enhancing OpenAI as a fallback option for LLM routing, addressing a previous removal.
Multiple OpenAI API Key Support: The system now supports providing multiple OpenAI API keys via the OPENAI_API_KEYS environment variable, with a mechanism to cycle through them upon failure.
Enhanced LLM Routing and Fallback Logic: OpenAI is integrated into the LLMRouter's decision-making process, including its LLMProvider enum, initialization, and the route_task and _get_fallback_provider methods, ensuring it's considered in the fallback sequence.
Standardized Error Logging: A new _log_provider_failure method is introduced to provide consistent and informative logging for issues encountered with any LLM provider.
OpenAI Client Integration: Dedicated synchronous and asynchronous methods (_complete_openai, _acomplete_openai) have been added to handle interactions with the OpenAI API, including model selection and tool usage.

Using Gemini Code Assist

The full guide for Gemini Code Assist can be found on our documentation page, here are some quick tips.

Invoking Gemini

You can request assistance from Gemini at any point by creating a comment using either /gemini <command> or @gemini-code-assist <command>. Below is a summary of the supported commands on the current page.

Feature	Command	Description
Code Review	`/gemini review`	Performs a code review for the current pull request in its current state.
Pull Request Summary	`/gemini summary`	Provides a summary of the current pull request in its current state.
Comment	@gemini-code-assist	Responds in comments when explicitly tagged, both in pull request comments and review comments.
Help	`/gemini help`	Displays a list of available commands.

Customization

To customize Gemini Code Assist for GitHub experience, repository maintainers can create a configuration file and/or provide a custom code review style guide (such as PEP-8 for Python) by creating and adding files to a .gemini/ folder in the base of the repository. Detailed instructions can be found here.

Limitations & Feedback

Gemini Code Assist may make mistakes. Please leave feedback on any instances where its feedback is incorrect or counter productive. You can react with 👍 and 👎 on @gemini-code-assist comments. If you're interested in giving your feedback about your experience with Gemini Code Assist for Github and other Google products, sign up here.

You can also get AI-powered code generation, chat, as well as code reviews directly in the IDE at no cost with the Gemini Code Assist IDE Extension.

Review the Privacy Notices, Generative AI Prohibited Use Policy, Terms of Service, and learn how to configure Gemini Code Assist in GitHub here. Gemini can make mistakes, so double check it and use code with caution. ↩

coderabbitai · 2026-01-23T20:32:53Z

📝 Walkthrough

Walkthrough

Adds OpenAI as a first-class LLM provider: multi-key API-key parsing/rotation, provider detection via environment, sync/async OpenAI client initialization, completion paths with multi-key retry, token/cost accounting, provider-failure logging, routing/fallback integration, and tests for OpenAI flows.

Changes

Cohort / File(s)	Summary
CLI Provider Detection `cortex/cli.py`	Parse `OPENAI_API_KEYS` (whitespace/comma/semicolon-delimited) via `re` and treat its presence as selecting the `openai` provider in `_get_provider`; `_get_api_key` falls back to the first key from `OPENAI_API_KEYS`.
LLM Router OpenAI Integration `cortex/llm_router.py`	Add `LLMProvider.OPENAI`; add `openai_api_key`/`openai_model` ctor params; parse/load multiple OpenAI keys; initialize `openai_client`/`openai_client_async`; implement `_complete_openai` and `_acomplete_openai` with multi-key retry, token counting, cost tracking, and `_log_provider_failure`; integrate OpenAI into routing, fallback selection, batch flows, provider stats, and `get_stats`.
Predictive Prevention Wiring `cortex/predictive_prevention.py`	Pass OpenAI API key into `LLMRouter` when normalized provider is `openai` (wiring only).
Tests — OpenAI coverage `tests/test_llm_router.py`	Add `TestOpenAIIntegration` tests covering sync/async completion, key rotation, multi-key failure scenarios, response fields, token/cost reporting, and error paths; minor formatting splits.

Sequence Diagram(s)

sequenceDiagram
    participant Client
    participant LLMRouter
    participant KeyLoader as Key Loader
    participant OpenAI_API as OpenAI API

    Client->>LLMRouter: complete(messages)
    LLMRouter->>KeyLoader: _load_openai_keys()
    KeyLoader-->>LLMRouter: [api_key_1, api_key_2, ...]

    loop For each loaded API key
        LLMRouter->>OpenAI_API: POST /chat/completions (using api_key_n)
        alt Request succeeds
            OpenAI_API-->>LLMRouter: response + token usage
            LLMRouter->>LLMRouter: track tokens & cost
            LLMRouter-->>Client: LLMResponse (provider=OPENAI)
        else Request fails
            OpenAI_API-->>LLMRouter: error
            LLMRouter->>LLMRouter: _log_provider_failure, try next key
        end
    end

    alt All keys exhausted
        LLMRouter-->>Client: raise informative error / try fallback provider
    end

Estimated code review effort

🎯 4 (Complex) | ⏱️ ~50 minutes

Possibly related PRs

feat: Zero-config API key auto-detection (#255) #436 — Modifies CLI api-key/provider detection; overlaps with OPENAI_API_KEYS handling changes.
feat: add parallel LLM call tests and demo scripts #311 — Adds/changes async LLM routing and acomplete wiring; overlaps with OpenAI async client integration.
fix: Automatically load API keys from .env files at runtime #318 — Related CLI/OpenAI API key sourcing and environment loading changes.

Suggested reviewers

mikejmorgan-ai
Suyashd999
Anshgrover23

Poem

🐰 I found keys in a hop and a skip,

commas, spaces, each little blip,
I try one then two when the first has failed,
count tokens, cost, so no plan derailed,
OpenAI joined the burrow — onward we zip! 🚀

🚥 Pre-merge checks | ❌ 3

❌ Failed checks (2 warnings, 1 inconclusive)

Check name	Status	Explanation	Resolution
Title check	⚠️ Warning	The title contains a typo ('OpenaAI' instead of 'OpenAI') and is vague about what 'fallback' means without context of the changes.	Correct the typo and use a more specific title like 'fix: Add OpenAI as LLM provider with fallback support' or 'fix: Restore OpenAI provider integration and fallback logic'.
Docstring Coverage	⚠️ Warning	Docstring coverage is 70.27% which is insufficient. The required threshold is 80.00%.	Write docstrings for the functions missing them to satisfy the coverage threshold.
Description check	❓ Inconclusive	The PR description is incomplete and vague; the 'Summary' section lacks detail about the actual changes, and the 'Related Issue' field is unfilled with only a template comment.	Fill in the issue number in 'Related Issue', provide a detailed technical summary of what was added/changed (mentioning API key handling, provider routing, etc.), and clarify the scope of the fallback restoration.

_{✏️ Tip: You can configure your own custom pre-merge checks in the settings.}

✨ Finishing touches

📝 Generate docstrings

Thanks for using CodeRabbit! It's free for OSS, and your support helps us grow. If you like it, consider giving us a shout-out.

❤️ Share

_{Comment @coderabbitai help to get the list of available commands and usage tips.}

gemini-code-assist

Code Review

This pull request successfully integrates OpenAI as a new LLM provider across the cortex/cli.py, cortex/llm_router.py, and cortex/predictive_prevention.py files. The changes include adding OpenAI to the LLMProvider enum, updating routing and fallback logic, implementing synchronous and asynchronous API completion methods, and enhancing API key management to support multiple OpenAI keys. The logging for provider failures has also been standardized to use logger.warning and logger.debug for consistency. Overall, the implementation is robust and aligns well with the existing architecture.

coderabbitai

Actionable comments posted: 2

🤖 Fix all issues with AI agents

In `@cortex/llm_router.py`:
- Around line 138-145: Update the docstring in llm_router (constructor or
function where Args are listed) to mention that OpenAI keys can be provided via
both OPENAI_API_KEY and OPENAI_API_KEYS environment variables and that
_load_openai_keys() accepts multiple keys separated by commas, semicolons, or
spaces; explicitly note multi-key support and the accepted delimiters for
openai_api_key/openai_api_keys and that the loader falls back to single-key env
var if needed.
- Around line 538-592: Add a new test class TestOpenAIIntegration that mirrors
existing provider tests and covers the sync method _complete_openai and async
_acomplete_openai, including successful completion, multi-key fallback when
openai_api_keys contains invalid then valid keys, and final error propagation
when all keys fail; specifically mock the OpenAI client/
OpenAI.chat.completions.create to return a fake response (with
usage.prompt_tokens and usage.completion_tokens and choices[0].message.content)
and to raise exceptions for the first key(s) to exercise the key-rotation loop,
assert returned LLMResponse fields (content, provider == LLMProvider.OPENAI,
tokens_used, cost_usd, raw_response) and latency handling, and add a test that
verifies RuntimeError is raised when all keys fail (ensuring last_error is
chained).

cortex/llm_router.py

coderabbitai

Actionable comments posted: 1

🤖 Fix all issues with AI agents

In `@cortex/llm_router.py`:
- Around line 591-605: In _openai_response_to_llmresponse guard against an empty
response.choices by checking that response.choices exists and has at least one
item before accessing choices[0]; if it's empty, return a safe LLMResponse
(e.g., content="", provider=LLMProvider.OPENAI, model=self.openai_model,
tokens_used=0, cost_usd=0.0, latency_seconds=0.0) and include the raw_response
if available, or alternatively raise a descriptive exception — ensure you also
safely access message.content (e.g., check for .message) when building the
content.

♻️ Duplicate comments (2)

cortex/llm_router.py (2)

138-148: Update docstring to reflect multi-key and delimiter support.

The docstring only mentions OPENAI_API_KEY but the implementation loads from both OPENAI_API_KEY and OPENAI_API_KEYS environment variables and supports comma/space/semicolon-delimited lists.

538-592: Test coverage for OpenAI integration is still needed.

As noted in a previous review, the new _complete_openai and _acomplete_openai methods with multi-key retry logic need dedicated test coverage to meet the >80% test coverage requirement.

🧹 Nitpick comments (4)

cortex/llm_router.py (4)
228-248: LGTM with optional docstring suggestion.

The key loading logic correctly handles multiple sources, deduplication, and various delimiters. Consider adding brief docstrings for maintainability.
📝 Optional: Add docstrings
     `@staticmethod`
     def _split_api_keys(value: str | None) -> list[str]:
+        """Split a string of API keys by whitespace, comma, or semicolon."""
         if not value:
             return []
         return [key.strip() for key in re.split(r"[\s,;]+", value) if key.strip()]

     def _load_openai_keys(self, explicit_key: str | None) -> list[str]:
+        """Load OpenAI API keys from explicit input and environment variables, deduplicating."""
         keys: list[str] = []
6-9: Update module docstring to include OpenAI.

The module docstring lists supported providers but doesn't mention OpenAI.
📝 Suggested update
 Supports:
 - Claude API (Anthropic) - Best for natural language, chat, requirement parsing
 - Kimi K2 API (Moonshot) - Best for system operations, debugging, tool use
+- OpenAI API - Flexible fallback with multi-key support
101-104: Update OpenAI cost values for accurate cost tracking.

Currently OpenAI costs are set to 0.0, which means all OpenAI usage will report $0 in cost tracking. Since gpt-4o-mini is the default model with known pricing, and the codebase comment at line 91 states "update with actual pricing", these values should be updated. Other providers already have pricing populated.
💡 Suggested improvement
         LLMProvider.OPENAI: {
-            "input": 0.0,  # Unknown/varies by model
-            "output": 0.0,  # Unknown/varies by model
+            "input": 0.15,  # gpt-4o-mini: $0.15 per 1M input tokens
+            "output": 0.60,  # gpt-4o-mini: $0.60 per 1M output tokens
         },
538-569: Narrow the exception handling to only retry on authentication failures.

The current implementation catches all exceptions and retries with the next key. However, only AuthenticationError (invalid/expired API key) is key-specific and warrants retry with another key. RateLimitError (429) is enforced at the organization level and applies to all keys in the same account—switching keys won't resolve it. Additionally, other errors like invalid request format or client errors should fail immediately rather than retry.
💡 Suggested improvement
+from openai import AuthenticationError
+
     def _complete_openai(
         ...
     ) -> LLMResponse:
         ...
         for index, key in enumerate(self.openai_api_keys):
             try:
                 ...
-            except Exception as e:
+            except AuthenticationError as e:
                 last_error = e
                 if index < len(self.openai_api_keys) - 1:
                     logger.warning("OpenAI API key failed, trying next key")
                     logger.debug("OpenAI error: %s", e)
                     continue
                 break
Note: The same pattern applies to the async _acomplete_openai method.

cortex/llm_router.py

sonarqubecloud · 2026-01-23T21:03:19Z

Quality Gate passed

Issues
0 New issues
0 Accepted issues

Measures
0 Security Hotspots
0.0% Coverage on New Code
0.9% Duplication on New Code

See analysis details on SonarQube Cloud

coderabbitai

Actionable comments posted: 1

🤖 Fix all issues with AI agents

In `@cortex/llm_router.py`:
- Around line 237-244: The _load_openai_keys function currently extends keys
with OPENAI_API_KEY before OPENAI_API_KEYS which contradicts the docstring that
says OPENAI_API_KEYS should be preferred; update _load_openai_keys so it calls
self._split_api_keys(os.getenv("OPENAI_API_KEYS")) before
self._split_api_keys(os.getenv("OPENAI_API_KEY")) (while still preserving
explicit_key handling and de-duplication), or alternatively update the docstring
to reflect the existing precedence—refer to _load_openai_keys, _split_api_keys,
explicit_key, OPENAI_API_KEY and OPENAI_API_KEYS to locate the change.

♻️ Duplicate comments (1)

cortex/llm_router.py (1)
594-607: Guard against empty OpenAI choices to avoid IndexError.
This still assumes response.choices[0] exists. A defensive check prevents unexpected crashes.
🛡️ Suggested guard
     def _openai_response_to_llmresponse(self, response: Any) -> LLMResponse:
-        content = response.choices[0].message.content or ""
+        if not getattr(response, "choices", None):
+            raise RuntimeError("OpenAI returned empty choices")
+        content = response.choices[0].message.content or ""

cortex/llm_router.py

Sahilbhatane and others added 2 commits January 24, 2026 01:55

Fallback for openAI API key

598c87a

Merge branch 'cortexlinux:main' into main

51ffb5e

Sahilbhatane requested review from Anshgrover23, Suyashd999 and mikejmorgan-ai as code owners January 23, 2026 20:32

[autofix.ci] apply automated fixes

22ee60a

gemini-code-assist bot reviewed Jan 23, 2026

View reviewed changes

Duplicate removal

8551179

coderabbitai bot reviewed Jan 23, 2026

View reviewed changes

cortex/llm_router.py Show resolved Hide resolved

cortex/llm_router.py Show resolved Hide resolved

coderabbitai bot reviewed Jan 23, 2026

View reviewed changes

cortex/llm_router.py Show resolved Hide resolved

Conversation fix

7dbfd78

coderabbitai bot reviewed Jan 23, 2026

View reviewed changes

cortex/llm_router.py Show resolved Hide resolved

Sahilbhatane self-assigned this Jan 23, 2026

mikejmorgan-ai merged commit e037799 into cortexlinux:main Jan 23, 2026
14 checks passed

Uh oh!

fix: Fallback for OpenaAI AI model #670

fix: Fallback for OpenaAI AI model #670

Conversation

Sahilbhatane commented Jan 23, 2026 • edited by coderabbitai bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Related Issue

Summary

AI Disclosure

Checklist

SS's

Summary by CodeRabbit

Uh oh!

github-actions bot commented Jan 23, 2026

CLA Verification Passed

Uh oh!

gemini-code-assist bot commented Jan 23, 2026

Summary of Changes

Highlights

Footnotes

Uh oh!

coderabbitai bot commented Jan 23, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Walkthrough

Changes

Sequence Diagram(s)

Estimated code review effort

Possibly related PRs

Suggested reviewers

Poem

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

coderabbitai bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

coderabbitai bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

sonarqubecloud bot commented Jan 23, 2026

Quality Gate passed

Uh oh!

coderabbitai bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Sahilbhatane commented Jan 23, 2026 •

edited by coderabbitai bot

Loading

coderabbitai bot commented Jan 23, 2026 •

edited

Loading