Skip to content
View fernando-neto-ai's full-sized avatar

Block or report fernando-neto-ai

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Popular repositories Loading

  1. ReCall ReCall Public

    Forked from Agent-RL/ReCall

    ReCall: Learning to Reason with Tool Call for LLMs via Reinforcement Learning

    Python

  2. vllm-fork vllm-fork Public

    Forked from vllm-project/vllm

    A high-throughput and memory-efficient inference and serving engine for LLMs

    Python

  3. one-shot-em one-shot-em Public

    Forked from UbiquantAI/one-shot-em

    One-shot Entropy Minimization

    Python

  4. verl-tool verl-tool Public

    Forked from TIGER-AI-Lab/verl-tool

    A version of verl to support diverse tool use

    Python

  5. astra astra Public

    Forked from LianjiaTech/astra

    ASTRA is an end-to-end system for synthesizing agentic trajectories and rule-verifiable environments for SFT and RL training, developed by Beike Language and Intelligence (BLI).

    Python

  6. maxrl maxrl Public

    Forked from tajwarfahim/maxrl

    Official Implementation of "Maximum Likelihood Reinforcement Learning (MaxRL)"

    Python