Skip to content

perelloliver/convfinqa-task

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

24 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

ConvFinQA Task

This repository implements a financial conversation QA agent using the ConvFinQA dataset.

Main components:

  • Report: A markdown file containing a written report of the task: methodology, outcomes, assessment and next actions.
  • Agent: A simple agent which uses the OpenAI agents SDK.
  • Evals: Evaluation metrics for agent performance compared to dataset ground truth.
  • Models: Data models for parsing dataset and storing agent responses.
  • Main: Main script to run the agent and evaluation.
  • Parsing: Data parsing script to convert ConvFinQA dataset into an easily usable format for prompting and evaluation.

Getting Started

Requirements

  • A funded OpenAI API Key
  • Python 3.10+ and pip

Quickstart (Local)

  1. Set your OpenAI API key in the .env file, or export as an environment variable in your terminal, as OPENAI_API_KEY

  2. Install dependencies:

    python -m venv venv
    source venv/bin/activate
    pip install -r requirements.txt
    
  3. Run the main script:

    python -m app.main --mode tiny   # For a quick test
    python -m app.main --mode test   # For full evaluation
    

Arguments

  • --mode: Either tiny (quick test), test (run test set), or full (run full dataset). Default: tiny.

Output

  • Evaluation metrics are printed to the terminal.
  • Agent responses are saved to data/responses.csv.
  • Parsed data for future use is saved to data/formatted_dataset in train/test/val splits and the full dataset.

Project Structure

  • app/ - Main application code with agent, evals, models, and main.
  • data/ - Datasets - Only the original dataset - running the script will generate formatted data and responses. Our formatted dataset will save to data/formatted_dataset with test, train, validate sets and a full unsplit dataset. Our responses will save to the main /data dir as responses.csv

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages