Skip to content

Building a modern data warehouse with SQL Server, including ETL processes, data modeling, and analytics.

License

Notifications You must be signed in to change notification settings

Pavel-Suba/sql-data-warehouse-project

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

7 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

📊 Data Warehouse and Analytics Project

Welcome to the Data Warehouse and Analytics Project repository! 🚀
This project presents a complete end-to-end data warehousing and analytics solution — from building a modern data warehouse to generating actionable business insights.

Designed as a portfolio project, it showcases industry best practices in:

  • 🏗️ Data Engineering
  • 📈 Data Analytics
  • 🧠 Business Intelligence

🚀 Project Requirements

🏗️ Building the Data Warehouse (Data Engineering)

🎯 Objective

Develop a modern data warehouse using SQL Server to consolidate sales data and support analytical reporting and informed decision-making.

📋 Specifications

  • 📥 Data Sources: Import data from two source systems (ERP and CRM) provided as CSV files
  • 🧹 Data Quality: Cleanse and resolve data quality issues before analysis
  • 🔗 Integration: Combine both sources into a single, user-friendly analytical data model
  • ⏱️ Scope: Work with the latest dataset only (no historization required)
  • 📝 Documentation: Provide clear documentation for both business stakeholders and analytics teams

📊 BI: Analytics & Reporting (Data Analysis)

🎯 Objective

Develop SQL-based analytics to deliver detailed insights into:

  • 👥 Customer Behavior
  • 📦 Product Performance
  • 📉 Sales Trends

These insights empower stakeholders with key business metrics to support strategic decision-making.


📂 Repository Structure

data-warehouse-project/
│
├── datasets/                           # 📁 Raw datasets (ERP and CRM data)
│
├── docs/                               # 📚 Project documentation and architecture
│   ├── etl.drawio                      # 🔄 ETL techniques and methods
│   ├── data_architecture.drawio        # 🏛️ Overall data architecture
│   ├── data_catalog.md                 # 📖 Dataset catalog with metadata
│   ├── data_flow.drawio                # 🔀 Data flow diagrams
│   ├── data_models.drawio              # ⭐ Data models (star schema)
│   ├── naming-conventions.md           # 🏷️ Naming standards
│
├── scripts/                            # 🧩 SQL scripts for ETL and transformations
│   ├── bronze/                         # 🥉 Raw data extraction and loading
│   ├── silver/                         # 🥈 Data cleansing and transformation
│   ├── gold/                           # 🥇 Analytical and reporting models
│
├── tests/                              # 🧪 Data quality and validation tests
│
├── README.md                           # 📄 Project overview and instructions
├── LICENSE                             # ⚖️ License information
├── .gitignore                          # 🚫 Git ignored files
└── requirements.txt                    # 📦 Project dependencies

🛡️ License

This project is licensed under the MIT License.
You are free to use, modify, and distribute this project with proper attribution.

About

Building a modern data warehouse with SQL Server, including ETL processes, data modeling, and analytics.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages