⚡ Automate tedious data cleaning — focus more on insights, not pipelines.
- 🔍 Overview
- ✨ Key Features
- 🧱 Project Architecture
- 📦 Requirements
- 🚀 Getting Started
- 🤝 How to Contribute
- 🛡 License
- 📬 Contact
AutoEDA is a lightweight yet powerful toolkit that streamlines data preprocessing for Exploratory Data Analysis (EDA) and Machine Learning.
It automates routine cleaning tasks such as missing value treatment, type correction, and feature engineering, helping data scientists and analysts unlock insights faster and with less friction.
- ✅ Seamless CSV upload & schema validation
- ✅ Null value imputation & type inference
- ✅ Smart duplicate detection & cleanup
- ✅ Feature extraction and transformation
- ✅ REST API support for integration
- ✅ Modern React + Vite frontend
- ✅ Dockerized deployment for easy setup
- Modular design for cleaning, transforming, and preprocessing datasets
- Built with FastAPI for high-speed async REST APIs
- Easily extendable for custom ML workflows
- Sleek UI for uploading, previewing, and processing datasets
- Designed for responsiveness and ease-of-use
- Optional sections for documentation, help, and dataset history
- Docker & Docker Compose configuration for cross-platform deployment
- One command to launch the full app stack
- 🧑💻 Frontend: React.js, Vite
- 🐍 Backend: Python 3.x, FastAPI, Pandas
- 🐳 Containerization: Docker & Docker Compose
⚠️ Remember to configure your.gitignoreand environment variables!
1️⃣ Clone the repository
git clone https://github.com/Nidhi-Satyapriya/AutoEDA-Automated-Data-Preprocessing-Toolkit
cd AutoEDA-Automated-Data-Preprocessing-Toolkit2️⃣ Run the backend
cd backend
pip install -r requirements.txt
uvicorn main:app --reload3️⃣ Launch the frontend
cd frontend
npm install
npm run dev4️⃣ [Optional] Run everything with Docker
docker-compose up --buildWe 💖 community contributions! Here’s how you can make an impact:
📢 New to open source? Start here: CONTRIBUTING.md
✨ Looking for ideas? Explore Good First Issues
This project uses a Modified MIT License.
🔒 Please read the LICENSE file carefully before using or contributing.
We'd love to hear from you!
- 🗨 Open an issue
- 🔁 Submit a pull request
- ⭐ Star the repo if you find it helpful
Built with ❤️ by passionate developers — for the community, by the community.
Please consider ⭐ starring the repository and sharing it with your team or on social media.
"Keep pushing boundaries — even small steps can lead to powerful transformations. 🌱"
"Believe in the process, trust your curiosity, and let every dataset take you one step closer to mastery. 💡📊"