Welcome to the Data Warehouse and Analytics Project repository! 🚀
This project presents a complete end-to-end data warehousing and analytics solution — from building a modern data warehouse to generating actionable business insights.
Designed as a portfolio project, it showcases industry best practices in:
- 🏗️ Data Engineering
- 📈 Data Analytics
- 🧠 Business Intelligence
Develop a modern data warehouse using SQL Server to consolidate sales data and support analytical reporting and informed decision-making.
- 📥 Data Sources: Import data from two source systems (ERP and CRM) provided as CSV files
- 🧹 Data Quality: Cleanse and resolve data quality issues before analysis
- 🔗 Integration: Combine both sources into a single, user-friendly analytical data model
- ⏱️ Scope: Work with the latest dataset only (no historization required)
- 📝 Documentation: Provide clear documentation for both business stakeholders and analytics teams
Develop SQL-based analytics to deliver detailed insights into:
- 👥 Customer Behavior
- 📦 Product Performance
- 📉 Sales Trends
These insights empower stakeholders with key business metrics to support strategic decision-making.
data-warehouse-project/
│
├── datasets/ # 📁 Raw datasets (ERP and CRM data)
│
├── docs/ # 📚 Project documentation and architecture
│ ├── etl.drawio # 🔄 ETL techniques and methods
│ ├── data_architecture.drawio # 🏛️ Overall data architecture
│ ├── data_catalog.md # 📖 Dataset catalog with metadata
│ ├── data_flow.drawio # 🔀 Data flow diagrams
│ ├── data_models.drawio # ⭐ Data models (star schema)
│ ├── naming-conventions.md # 🏷️ Naming standards
│
├── scripts/ # 🧩 SQL scripts for ETL and transformations
│ ├── bronze/ # 🥉 Raw data extraction and loading
│ ├── silver/ # 🥈 Data cleansing and transformation
│ ├── gold/ # 🥇 Analytical and reporting models
│
├── tests/ # 🧪 Data quality and validation tests
│
├── README.md # 📄 Project overview and instructions
├── LICENSE # ⚖️ License information
├── .gitignore # 🚫 Git ignored files
└── requirements.txt # 📦 Project dependencies
This project is licensed under the MIT License.
You are free to use, modify, and distribute this project with proper attribution.