article_extractor

Installation

Clone the repository:

https://github.com/stadiello/article_extractor

Navigate to the project directory:
```
cd article_extractor
```
Install dependencies using pyproject.toml:
```
pip install .
```
(BIS) Install dependencies using poetry:
```
poetry install
```

Usage

Launch the app :

streamlit run src/extractor/main.py

Connect at the url and follow the instructions.

Add or modify questions

You just have to add your new question on a new line in the questions.txt in the folder data.

Minimal config

AI Models Used

Ollama (in bot.py) with the model deepseek-r1:8b
Selenium for web scraping

Recommended Minimal Configuration

CPU:

Intel Core i5/AMD Ryzen 5 (minimum 4 cores)
2.5 GHz or higher

RAM:

Minimum recommended: 16 GB

Storage:

SSD with at least 20 GB of free space (for models and dependencies)

GPU:

Not mandatory but recommended
If using a GPU: NVIDIA with at least 4 GB VRAM
Without a GPU: The project will work but may run slower

Important Note:

The project can run without a GPU because:

Ollama can execute on a CPU
Streamlit and Selenium do not require a GPU However, using a GPU will significantly improve performance.

Supported Operating Systems:

macOS
Linux
Windows (WSL recommended)

Contact

For questions or support, please contact the development team at tadiello.sebastien@gmail.com.

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
data		data
src/extractor		src/extractor
tests		tests
.gitignore		.gitignore
README.md		README.md
poetry.lock		poetry.lock
pyproject.toml		pyproject.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

article_extractor

Installation

Usage

Add or modify questions

Minimal config

AI Models Used

Recommended Minimal Configuration

CPU:

RAM:

Storage:

GPU:

Important Note:

Supported Operating Systems:

Contact

About

Uh oh!

Releases

Packages

Uh oh!

Languages

stadiello/article_extractor

Folders and files

Latest commit

History

Repository files navigation

article_extractor

Installation

Usage

Add or modify questions

Minimal config

AI Models Used

Recommended Minimal Configuration

CPU:

RAM:

Storage:

GPU:

Important Note:

Supported Operating Systems:

Contact

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages