[MICCAI 2025] Multi-Agent Collaboration for Integrating Echocardiography Expertise in Multi-Modal Large Language Models

TODO

We will complete the TODO list in the following weeks.

[] Release the pipeline code.

Required API Keys

Overview

This automatic pipeline converts Echocardiography PDFs to Concept-Indexed Databases (EchoCardiography Expertise Database).

Images with their captions
Pure text content
Subfigure detection and splitting
Medical keywords for searchability

Knowledge Sources

Please check out the sheets here.

For books, we obtained either physical or digital copies (PDF versions) through our clinical collaborators. \
For clinical guidelines, we accessed the PDF versions under an Open Access license via the institutional library using Elsevier's Text and Data Mining (TDM) services.

All resources used in this work were obtained and processed solely for non-commercial research purposes. We strongly encourage others to obtain these resources through their institutional libraries or through appropriate purchasing channels, and to use them in accordance with licensing terms and strictly for non-commercial research purposes.

Installation

1. Clone the Repository

2. Install Dependencies

pip install mistralai
pip install openai
pip install pillow
pip install pandas
pip install tqdm
pip install qwen-vl-utils

3. Create API Key Configuration

Create a key.py file in the project root:

MISTRAL_KEY = "your-mistral-api-key"
OPENAI_KEY = "your-azure-openai-api-key"
QWEN_KEY = "your-qwen-api-key"

⚠️ Security Note: Never commit key.py to version control. Add it to .gitignore.

Directory Structure

release_v1/
├── step1-mistral_ocr.py
├── step2-rawContentSplit.py
├── step3-checkSplitSubcaption.py
├── step4-splitSubfigure.py
├── step5-cleanTextandKeywords.py
├── key.py (create this - not in repo)
├── Todo/
│   ├── ENG/
│   │   ├── Guideline/
│   │   └── Textbook/
│   └── CHN/
│       ├── Guideline/
│       └── Textbook/
├── Intermediate/
│   └── [Generated during processing]
└── final_output/
    └── [Final results with keywords]

Usage

Running the Complete Pipeline

Execute steps sequentially:

# Step 1: OCR Processing
python step1-mistral_ocr.py

# Step 2: Content Splitting
python step2-rawContentSplit.py

# Step 3: Subcaption Detection
python step3-checkSplitSubcaption.py

# Step 4: Subfigure Splitting
python step4-splitSubfigure.py

# Step 5: Keyword Extraction
python step5-cleanTextandKeywords.py

Input Requirements

Place PDF files in the appropriate directories:

English Guidelines: ./Todo/ENG/Guideline/
English Textbooks: ./Todo/ENG/Textbook/
Chinese Guidelines: ./Todo/CHN/Guideline/
Chinese Textbooks: ./Todo/CHN/Textbook/

Output Structure

Final outputs are stored in ./final_output/ with the following structure:

Image-caption pairs with keywords
Text content with keywords
Split subfigure images
Recheck files for manual validation

License

Contact

Contact yqinar@connect.ust.hk for any questions or feedback.

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

[MICCAI 2025] Multi-Agent Collaboration for Integrating Echocardiography Expertise in Multi-Modal Large Language Models

TODO

Required API Keys

Overview

Knowledge Sources

Installation

1. Clone the Repository

2. Install Dependencies

3. Create API Key Configuration

Directory Structure

Usage

Running the Complete Pipeline

Input Requirements

Output Structure

License

Contact

About

Uh oh!

Releases

Packages

License

xmed-lab/ECED

Folders and files

Latest commit

History

Repository files navigation

[MICCAI 2025] Multi-Agent Collaboration for Integrating Echocardiography Expertise in Multi-Modal Large Language Models

TODO

Required API Keys

Overview

Knowledge Sources

Installation

1. Clone the Repository

2. Install Dependencies

3. Create API Key Configuration

Directory Structure

Usage

Running the Complete Pipeline

Input Requirements

Output Structure

License

Contact

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Packages