Some basic machine learning topics for learning purposes
This repository contains a collection of Jupyter notebooks designed to introduce fundamental machine learning concepts and techniques. The materials cover a range of topics from basic regression to clustering and deep learning approaches.
- Introduction_and_Regression.ipynb: Covers basic ML concepts and regression techniques
- Clustering_and_Probabilities.ipynb: Explores clustering algorithms and probability-based methods
- Comparison_of_DBSCAN_and_K_means.ipynb: Detailed comparison between two popular clustering algorithms
- Non_parametric_methods_and_deep_learning.ipynb: Introduction to non-parametric approaches and neural networks
The data/ directory contains several datasets used in the notebooks:
air_temp.csv: Temperature data for time series analysischooseK.csv: Dataset for K-means cluster analysisFMIData.csv: Finnish Meteorological Institute datamall_customers.csv: Customer segmentation datasetMushroom data.csv: Mushroom classification datasetyoutube_comments.csv: Text data for NLP demonstrations
- Python 3.x
- Jupyter Notebook
- Required libraries:
- NumPy
- Pandas
- Matplotlib
- Scikit-learn
- TensorFlow/Keras (for deep learning notebook)
- Clone this repository
- Install required dependencies
- Launch Jupyter Notebook
- Open any notebook to begin learning
For beginners, it's recommended to follow the notebooks in this order:
- Introduction_and_Regression.ipynb
- Clustering_and_Probabilities.ipynb
- Comparison_of_DBSCAN_and_K_means.ipynb
- Non_parametric_methods_and_deep_learning.ipynb