Add archive data auto-loading on database initialization by cchwala · Pull Request #16 · OpenSenseAction/GMDI_prototype

cchwala · 2026-02-11T09:03:38Z

Closes #8

Summary of changes in this PR:

Implement parse_netcdf_archive.py to load historical CML data from NetCDF files into PostgreSQL using efficient COPY FROM operations with configurable time range limiting
Add generate_archive.py script to create compressed archive data files (metadata + time-series) for demo setup and database initialization
Enhance Grafana dashboard with Interval (Auto/1min/5min/15min/1h/6h/1d) and Aggregation (Mean/Raw/Min/Max/Median/StdDev) dropdown controls (required because we now show more data with the longer archive)
Refactor dashboard queries to UNION ALL pattern separating raw and aggregated data paths, with safe interval casting for auto-scaling support (related to point above)
Add unit tests for archive scripts covering database truncation, file creation, and error handling (0.5s runtime)

- Create archive generation script using real NetCDF data with synthetic timestamps (7 days, 1.5M rows) - Add init script to auto-load gzip-compressed archive data on first database startup (~3 seconds) - Include archive CSV files in repo (7.6 MB total, small enough for version control) - Update database Dockerfile for proper init script execution order (01-init-schema.sql, 99-load-archive.sh) - Configure docker-compose to mount archive data directory and init script

codecov · 2026-02-11T09:07:05Z

Codecov Report

❌ Patch coverage is 89.70100% with 31 lines in your changes missing coverage. Please review.
✅ Project coverage is 68.72%. Comparing base (841b78a) to head (784536e).
⚠️ Report is 1 commits behind head on main.

Files with missing lines	Patch %	Lines
parser/parse_netcdf_archive.py	84.10%	31 Missing ⚠️

Additional details and impacted files

@@            Coverage Diff             @@
##             main      #16      +/-   ##
==========================================
+ Coverage   64.64%   68.72%   +4.08%     
==========================================
  Files          19       22       +3     
  Lines        1547     1848     +301     
==========================================
+ Hits         1000     1270     +270     
- Misses        547      578      +31

Flag	Coverage Δ
mno_simulator	`87.87% <100.00%> (+2.05%)`	⬆️
parser	`80.56% <86.75%> (+2.20%)`	⬆️
webserver	`29.63% <ø> (ø)`

Flags with carried forward coverage won't be shown. Click here to find out more.

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.

- Add parse_netcdf_archive.py for direct NetCDF-to-DB loading with PostgreSQL COPY - Support configurable time window via ARCHIVE_MAX_DAYS env var (default: 7 days) - Auto-download 3-month NetCDF dataset (~209 MB) on first run - Achieve ~155K rows/sec throughput with batched processing and timestamp shifting - Update README with dual archive loading methods (CSV default vs NetCDF high-resolution)

- Add Interval (Auto/1min/5min/15min/1h/6h/1d) and Aggregation (Mean/Raw/Min/Max/Median/StdDev) dropdown variables - Refactor RSL and TSL queries to UNION ALL pattern separating raw and aggregated paths - Support auto interval via $__interval_ms with safe ::interval casting outside CASE expression

cchwala added 4 commits February 11, 2026 13:29

Merge main into data_archive_parser

54d6b80

Add unit tests for archive generation and NetCDF parsing

784536e

cchwala mentioned this pull request Feb 12, 2026

Line coloring in CML map on real-time website does not work anymore #18

Closed

cchwala merged commit 9f3bd73 into main Feb 12, 2026
7 checks passed

cchwala deleted the data_archive_parser branch February 12, 2026 12:20

cchwala mentioned this pull request Feb 12, 2026

Parse large existing open CML data to database as fast as possible #8

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Comments

Add archive data auto-loading on database initialization#16

Add archive data auto-loading on database initialization#16
cchwala merged 5 commits intomainfrom
data_archive_parser

cchwala commented Feb 11, 2026 •

edited

Loading

Uh oh!

codecov bot commented Feb 11, 2026 •

edited

Loading

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Comments

Conversation

cchwala commented Feb 11, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

codecov bot commented Feb 11, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

cchwala commented Feb 11, 2026 •

edited

Loading

codecov bot commented Feb 11, 2026 •

edited

Loading