Skip to content
@Data-Centric-AI-Community

Data-Centric AI Community

The Data-Centric AI Community is the home of all things data. Help us achieve high-quality data for data science!

   

Welcome to the Data-Centric AI Community

We're a group of data science enthusiasts committed to developing AI and ML applications with a focus on data quality! We get together to learn, discuss, and collaborate on topics such as Data Quality, Data Profiling, and Synthetic Data, closing the gap between data understanding and improvement.

Start learning data science with beginner-friendly projects! 💻

  • 🐍 awesome-python-for-data-science: The roadmap to learn data science in 2023!
  • 📦 awesome-data-centric-ai: Stay up to data with the latest resources on Data-Centric AI
  • 👾 Join us on our Discord Server to meet and learn from other data enthusiasts!

You can follow our updates on the website blog or on Medium and subscribe to our newsletter to stay up to date with our events, data initiatives, and to receive monthly tricks and tips on how to achieve actionable data to feed your machine learning models.

📧 Feel free to get in touch! You can reach out to us at the Discord server. And good news, if you have questions about projects such as ydata-profiling or ydata-synthetic, you can find the right persons to help you in our community! 🥳

Popular repositories Loading

  1. ydata-profiling ydata-profiling Public

    1 Line of code data quality profiling & exploratory data analysis for Pandas and Spark DataFrames.

    Python 13.4k 1.8k

  2. ydata-synthetic ydata-synthetic Public

    Synthetic data generators for tabular and time-series data

    Jupyter Notebook 1.6k 257

  3. ydata-quality ydata-quality Public

    Data Quality assessment with one line of code

    Jupyter Notebook 454 56

  4. awesome-data-centric-ai awesome-data-centric-ai Public

    Open-Source Software, Tutorials, and Research on Data-Centric AI 🤖

    Jupyter Notebook 345 47

  5. awesome-python-for-data-science awesome-python-for-data-science Public

    A curated list of awesome resources such as books, tutorials, courses, open-source libraries, exercises, and other materials that support Pythonistas in the making, and Pythonistas migrating into D…

    Jupyter Notebook 91 20

  6. nist-crc-2023 nist-crc-2023 Public

    NIST Collaborative Research Cycle on Synthetic Data. Learn about Synthetic Data week by week!

    Jupyter Notebook 27 2

Repositories

Showing 10 of 11 repositories

Top languages

Loading…

Most used topics

Loading…