LCORE-1216: Bump up to llama-stack 0.4.3 by are-ces · Pull Request #52 · lightspeed-core/lightspeed-providers

are-ces · 2026-02-08T16:53:13Z

Description

This is a significant refactoring of all the modules, mostly because the Agents API has been deprecated in favor of the Responses API in llama-stack (already from 0.3.x).

This upgrade is needed to keep lightspeed-providers on par with LCORE

NOTE: run_moderation has not been designed for redaction but to only block the request, thus lightspeed-redactions will block the message if an unauthorized string is detected, as opposed to run_shield where it is possible to redact the original message.

Changes:

Bump up llama-stack library to 0.4.3
Refactor agent code to migrate from Agents API to Responses API
Refactor safety module run_shield, added run_moderation
Kept temperature override, prioritization to latest used tools, tool fitering

Type of change

Tools used to create PR

Identify any AI code assistants used in this PR (for transparency and review context)

Partially generated by: Claude

Related Tickets & Documents

Related Issue # LCORE-1216
Closes # LCORE-1216

Checklist before requesting a review

I have performed a self-review of my code.
PR has passed all pre-merge test jobs.
If it is a core feature, I have added thorough tests.

Testing

I tested manually via curl requests the following:

Question validity run_shield (valid/invalid questions)
Question validity run_moderation
Redaction run_shield (sensitive data redacted)
Redaction run_moderation (message with sensitive data BLOCKED)
Tool filtering (11→1 tools)
min_tools threshold
Previously called tools persistence
always_include_tools config
Temperature override (1.0 for GPT-5)

tisnik

I'd say LGTM on my side. But definitely need at least one more reviewer, especially from teams that managed to use provider(s).

pyproject.toml

ldjebran · 2026-02-10T07:45:59Z

lightspeed_stack_providers/providers/inline/agents/lightspeed_inline_agent/agent_instance.py

You are removing the inline::lightspeed_inline_agent we are using in Ansible Lightspeed chatbot, if this PR is merged this will break the chatbot functionality.

inline::lightspeed_inline_agent still works, the logic has been moved from agent_instance.py to agents.py

lightspeed_stack_providers/providers/remote/agents/lightspeed_agent/lightspeed.py

pyproject.toml

TamiTakamiya · 2026-02-10T22:03:53Z

@are-ces @ldjebran I could run the updated lightspeed_inline_agent with ansible-chatbot-stack The test setup uses:

This PR (I created a wheel file from the PR branch)
Newly generated RAG DB with https://github.com/ansible/aap-rag-content/pull/293
ansible-chatbot-stack modified for Llama Stack 0.4.3 & this PR https://github.com/TamiTakamiya/ansible-chatbot-stack/tree/TamiTakamiya/AAP-64341/lightspeed-core-0_4_x

The setup is somehow complicated because it's using a number of codes that are not merged to main yet. I will create a memo on my test setup.

Note: My setup does not enable MCP server yet. After writing the memo, I plan to test this with MCP server enabled.

ldjebran · 2026-02-11T09:15:35Z

pyproject.toml

-    "llama-stack==0.2.22",
-    "llama-stack-client==0.2.22",
+    "llama-stack==0.4.3",
+    "llama-stack-api==0.4.4",


why its not the same version 0.4.3 is this intentional ?

I tried to use llama-stack-api 0.4.3 for ansible-chatbot-stack and it did not work. I think 0.4.3 is broken.

Yes, because we updated LCORE to have the api package on 0.4.4 because of a CVE (v0.4.4 shouldn't have breaking changes)

Jdubrick

@are-ces since we only consume the safety shield portion for my use case, that part lgtm, fyi

ldjebran · 2026-02-11T14:24:51Z

@are-ces seems the file https://github.com/lightspeed-core/lightspeed-providers/blob/main/resources/external_providers/inline/agents/lightspeed_inline_agent.yaml

needs to be updated to:

config_class: lightspeed_stack_providers.providers.inline.agents.lightspeed_inline_agent.config.LightspeedAgentsImplConfig
module: lightspeed_stack_providers.providers.inline.agents.lightspeed_inline_agent
api_dependencies: [ inference, safety, tool_runtime, tool_groups, conversations, prompts ]
optional_api_dependencies: [vector_io, files]

The agent lightspeed_inline_agent is passing through the queries and overriding the temperature when configured , unfortunately I was not able to test mcp filtring as seems the lightspeed-stack has a regression as not passing mcp headers received from client by MCP-HEADERS header.

There is a big work done her, @are-ces many thanks for your efforts,
can we wait a little to merge to see comments of the team about mcp headers ?

ldjebran

@are-ces many thanks for the work the changes that I proposed in my last comment still valid, tested the mcp but seems the lightspeed_inline_agent is unfortunately not working as expected and breaking when enabling the mcp configuration, I see the mcp returning the list of tools, but the agent seems do not detect that tools and see only 2 instead of more than 300.
this will needs more investigations.

are-ces · 2026-02-12T11:14:44Z

Hey @ldjebran good catch! I have encountered the same problem, I was handling the tools in a wrong way; basically the MCP servers were not being expanded to their tools so we were counting the MCP servers and comparing them with min_tools.
I have tested it on my side and it works as expected, hopefully the same on your side 😄

are-ces marked this pull request as draft February 8, 2026 16:53

are-ces requested review from TamiTakamiya, ldjebran and manstis February 8, 2026 16:53

are-ces force-pushed the llama-stack-0.4.x-bumpup branch 3 times, most recently from b2b25c6 to c84a80e Compare February 8, 2026 17:25

tisnik approved these changes Feb 9, 2026

View reviewed changes

Jdubrick reviewed Feb 9, 2026

View reviewed changes

pyproject.toml Outdated Show resolved Hide resolved

ldjebran reviewed Feb 10, 2026

View reviewed changes

lightspeed_stack_providers/providers/remote/agents/lightspeed_agent/lightspeed.py Outdated Show resolved Hide resolved

are-ces force-pushed the llama-stack-0.4.x-bumpup branch 3 times, most recently from 218e6d4 to 3ad6905 Compare February 10, 2026 11:29

TamiTakamiya reviewed Feb 10, 2026

View reviewed changes

pyproject.toml Outdated Show resolved Hide resolved

Refactor: Bump up llama-stack 0.4.3

f99d3c1

are-ces force-pushed the llama-stack-0.4.x-bumpup branch from 3ad6905 to f99d3c1 Compare February 11, 2026 08:31

are-ces marked this pull request as ready for review February 11, 2026 08:32

ldjebran reviewed Feb 11, 2026

View reviewed changes

Jdubrick approved these changes Feb 11, 2026

View reviewed changes

ldjebran reviewed Feb 11, 2026

View reviewed changes

Fix tool filtering for MCP tools

622151e

are-ces force-pushed the llama-stack-0.4.x-bumpup branch from 84d4bf7 to 622151e Compare February 12, 2026 11:08

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

LCORE-1216: Bump up to llama-stack 0.4.3#52

LCORE-1216: Bump up to llama-stack 0.4.3#52
are-ces wants to merge 2 commits intolightspeed-core:mainfrom
are-ces:llama-stack-0.4.x-bumpup

are-ces commented Feb 8, 2026

Uh oh!

tisnik left a comment

Uh oh!

Uh oh!

ldjebran Feb 10, 2026

Uh oh!

are-ces Feb 10, 2026

Uh oh!

Uh oh!

Uh oh!

TamiTakamiya commented Feb 10, 2026

Uh oh!

ldjebran Feb 11, 2026

Uh oh!

TamiTakamiya Feb 11, 2026

Uh oh!

are-ces Feb 11, 2026 •

edited

Loading

Uh oh!

Jdubrick left a comment

Uh oh!

ldjebran commented Feb 11, 2026

Uh oh!

ldjebran left a comment •

edited

Loading

Uh oh!

are-ces commented Feb 12, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

Conversation

are-ces commented Feb 8, 2026

Description

Type of change

Tools used to create PR

Related Tickets & Documents

Checklist before requesting a review

Testing

Uh oh!

tisnik left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

ldjebran Feb 10, 2026

Choose a reason for hiding this comment

Uh oh!

are-ces Feb 10, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

TamiTakamiya commented Feb 10, 2026

Uh oh!

ldjebran Feb 11, 2026

Choose a reason for hiding this comment

Uh oh!

TamiTakamiya Feb 11, 2026

Choose a reason for hiding this comment

Uh oh!

are-ces Feb 11, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Jdubrick left a comment

Choose a reason for hiding this comment

Uh oh!

ldjebran commented Feb 11, 2026

Uh oh!

ldjebran left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

are-ces commented Feb 12, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

are-ces Feb 11, 2026 •

edited

Loading

ldjebran left a comment •

edited

Loading