ConvFinQA Task

This repository implements a financial conversation QA agent using the ConvFinQA dataset.

Main components:

Report: A markdown file containing a written report of the task: methodology, outcomes, assessment and next actions.
Agent: A simple agent which uses the OpenAI agents SDK.
Evals: Evaluation metrics for agent performance compared to dataset ground truth.
Models: Data models for parsing dataset and storing agent responses.
Main: Main script to run the agent and evaluation.
Parsing: Data parsing script to convert ConvFinQA dataset into an easily usable format for prompting and evaluation.

Getting Started

Requirements

A funded OpenAI API Key
Python 3.10+ and pip

Quickstart (Local)

Set your OpenAI API key in the .env file, or export as an environment variable in your terminal, as OPENAI_API_KEY

Install dependencies:

python -m venv venv
source venv/bin/activate
pip install -r requirements.txt

Run the main script:

python -m app.main --mode tiny   # For a quick test
python -m app.main --mode test   # For full evaluation

Arguments

--mode: Either tiny (quick test), test (run test set), or full (run full dataset). Default: tiny.

Output

Evaluation metrics are printed to the terminal.
Agent responses are saved to data/responses.csv.
Parsed data for future use is saved to data/formatted_dataset in train/test/val splits and the full dataset.

Project Structure

app/ - Main application code with agent, evals, models, and main.
data/ - Datasets - Only the original dataset - running the script will generate formatted data and responses. Our formatted dataset will save to data/formatted_dataset with test, train, validate sets and a full unsplit dataset. Our responses will save to the main /data dir as responses.csv

Name		Name	Last commit message	Last commit date
Latest commit History 24 Commits
app		app
data		data
.gitignore		.gitignore
README.md		README.md
report.md		report.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

ConvFinQA Task

Main components:

Getting Started

Requirements

Quickstart (Local)

Arguments

Output

Project Structure

About

Uh oh!

Releases

Packages

Languages

perelloliver/convfinqa-task

Folders and files

Latest commit

History

Repository files navigation

ConvFinQA Task

Main components:

Getting Started

Requirements

Quickstart (Local)

Arguments

Output

Project Structure

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages