This repository contains the code that accompanies the paper
Knowledge-Guided Machine Learning Models to Upscale Evapotranspiration in the U.S. Midwest.
Aleksei Rozanov, Samikshya Subedi, Vasudha Sharma, Bryan Runck
Knowledge-Guided Machine Learning Models to Upscale Evapotranspiration in the U.S. Midwest.
Aleksei Rozanov, Samikshya Subedi, Vasudha Sharma, Bryan Runck
| Resource | DOI |
|---|---|
| Paper | (under review) |
| Dataset | 10.13020/37XE-QQ18 |
| Repository (code) |
ET_LCCMR/
├─ LICENSE
├─ README.md
├─ requirements.txt # Core deps for MSI usage
├─ requirements_local.txt # Extras for local/dev use
├─ .gitignore
├─ paper/ # Drafts, manuscript assets
├─ fig/ # Exported figures
├─ models/
│ └─ ligthgbm_model.txt
├─ preprocessing/ # One-off data prep notebooks (site/stack/mesonet)
│ ├─ 1. Carbon_Data.ipynb
│ ├─ 2. Site_Selection.ipynb
│ ├─ 3. Stack_Data.ipynb
│ └─ 4. Mesonet_ValidationData.ipynb
├─ notebooks/
│ ├─ src/
│ │ ├─ PM_eq.py # PM helpers (LE↔ET conversion, etc.)
│ │ ├─ train_ann.py # Simple ANN training
│ │ └─ train.py # Tree-models training / utils
│ ├─ 1. Final_Model_Training_and_Validation.ipynb
│ └─ 2. Model_Mesonet_Testing.ipynb
└─ pipeline/ # Data pipelines (ERA, MODIS) + utilities
├─ src/
│ ├─ config.py # Centralized config
│ ├─ PM_eq.py # PM helpers (duplicated with notebooks/src/)
│ ├─ utils_era.py # ERA5-Land acquisition and transforms
│ └─ utils_modis.py # MODIS acquisition and transforms
├─ driveDerive.py
├─ 1. ERA_Pipeline.ipynb
├─ 2. MODIS_Pipeline.ipynb
└─ 3. DataStacking.ipynb
To replicate the analysis presented in this repository, follow these steps:
First, clone the repository to your local machine:
git clone https://github.com/RTGS-Lab/ET_LCCMR.git
cd ET_LCCMR
It is highly recommended to use a virtual environment to manage dependencies.
python -m venv .venv
source .venv/bin/activate # On Windows, use .venv\Scripts\activate
pip install -r requirements_local.txt # Or requirements.txt to run on MSI
Navigate to the preprocessing/ directory and run the Jupyter notebooks in the specified order to prepare the data for model training:
1. Carbon_Data.ipynb
2. Site_Selection.ipynb
3. Stack_Data.ipynb
4. Mesonet_Validation_Data.ipynb