⚡ AutoEDA - Automated Data Preprocessing Toolkit

⚡ Automate tedious data cleaning — focus more on insights, not pipelines.

📌 Table of Contents

🔍 Overview
✨ Key Features
🧱 Project Architecture
📦 Requirements
🚀 Getting Started
🤝 How to Contribute
🛡 License
📬 Contact

🔍 Overview

AutoEDA is a lightweight yet powerful toolkit that streamlines data preprocessing for Exploratory Data Analysis (EDA) and Machine Learning.

It automates routine cleaning tasks such as missing value treatment, type correction, and feature engineering, helping data scientists and analysts unlock insights faster and with less friction.

✨ Key Features

✅ Seamless CSV upload & schema validation
✅ Null value imputation & type inference
✅ Smart duplicate detection & cleanup
✅ Feature extraction and transformation
✅ REST API support for integration
✅ Modern React + Vite frontend
✅ Dockerized deployment for easy setup

🧱 Project Architecture

🧠 Backend (Python)

Modular design for cleaning, transforming, and preprocessing datasets
Built with FastAPI for high-speed async REST APIs
Easily extendable for custom ML workflows

🎨 Frontend (React + Vite)

Sleek UI for uploading, previewing, and processing datasets
Designed for responsiveness and ease-of-use
Optional sections for documentation, help, and dataset history

🐳 Docker Support

Docker & Docker Compose configuration for cross-platform deployment
One command to launch the full app stack

📦 Requirements

🧑‍💻 Frontend: React.js, Vite
🐍 Backend: Python 3.x, FastAPI, Pandas
🐳 Containerization: Docker & Docker Compose

⚠️ Remember to configure your .gitignore and environment variables!

🚀 Getting Started

1️⃣ Clone the repository

git clone https://github.com/Nidhi-Satyapriya/AutoEDA-Automated-Data-Preprocessing-Toolkit
cd AutoEDA-Automated-Data-Preprocessing-Toolkit

2️⃣ Run the backend

cd backend
pip install -r requirements.txt
uvicorn main:app --reload

3️⃣ Launch the frontend

cd frontend
npm install
npm run dev

4️⃣ [Optional] Run everything with Docker

docker-compose up --build

🤝 How to Contribute

We 💖 community contributions! Here’s how you can make an impact:

🔧 Frontend

🧪 Model Pipeline

⚙️ Backend

📢 New to open source? Start here: CONTRIBUTING.md

✨ Looking for ideas? Explore Good First Issues

🛡 License

This project uses a Modified MIT License.

🔒 Please read the LICENSE file carefully before using or contributing.

📬 Contact

We'd love to hear from you!

🗨 Open an issue
🔁 Submit a pull request
⭐ Star the repo if you find it helpful

Built with ❤️ by passionate developers — for the community, by the community.

✨ If you found this project useful...

Please consider ⭐ starring the repository and sharing it with your team or on social media.

"Keep pushing boundaries — even small steps can lead to powerful transformations. 🌱"

"Believe in the process, trust your curiosity, and let every dataset take you one step closer to mastery. 💡📊"

Name		Name	Last commit message	Last commit date
Latest commit History 94 Commits
assets		assets
autoeda		autoeda
backend		backend
frontend		frontend
notebooks		notebooks
unit_tests		unit_tests
.gitignore		.gitignore
CODE_OF_CONDUCT.md		CODE_OF_CONDUCT.md
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE		LICENSE
README.md		README.md
SECURITY.md		SECURITY.md
autoEDA_outlier_report.json		autoEDA_outlier_report.json
autoEDA_outlier_summary.csv		autoEDA_outlier_summary.csv
autoEDA_outliers_capped.csv		autoEDA_outliers_capped.csv
autoEDA_outliers_flagged.csv		autoEDA_outliers_flagged.csv
autoEDA_outliers_removed.csv		autoEDA_outliers_removed.csv
docker-compose.yml		docker-compose.yml
package-lock.json		package-lock.json
package.json		package.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

⚡ AutoEDA - Automated Data Preprocessing Toolkit

📌 Table of Contents

🔍 Overview

✨ Key Features

🧱 Project Architecture

🧠 Backend (Python)

🎨 Frontend (React + Vite)

🐳 Docker Support

📦 Requirements

🚀 Getting Started

🤝 How to Contribute

🔧 Frontend

🧪 Model Pipeline

⚙️ Backend

🛡 License

📬 Contact

✨ If you found this project useful...

About

Uh oh!

Releases

Packages

Languages

License

Siddiha/AutoEDA-Automated-Data-Preprocessing-Toolkit

Folders and files

Latest commit

History

Repository files navigation

⚡ AutoEDA - Automated Data Preprocessing Toolkit

📌 Table of Contents

🔍 Overview

✨ Key Features

🧱 Project Architecture

🧠 Backend (Python)

🎨 Frontend (React + Vite)

🐳 Docker Support

📦 Requirements

🚀 Getting Started

🤝 How to Contribute

🔧 Frontend

🧪 Model Pipeline

⚙️ Backend

🛡 License

📬 Contact

✨ If you found this project useful...

About

Resources

License

Code of conduct

Contributing

Security policy

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages