By Lance Jepsen & ChatGPT
Lance Jepsen
Data Science ยท Machine Learning ยท AutoML Systems
๐ LinkedIn: https://www.linkedin.com/in/lance-jepsen/
๐ฅ Project walkthrough: https://www.youtube.com/watch?v=vGuiYdUlMI8
ModelArena (modelarena-automl) is a competitive AutoML system with conformal prediction and uncertainty-aware modeling.
ModelArena is a professional AutoML playground where machine learning models compete, uncertainty is quantified, and predictions become explainable.
ModelArena helps you compare models, understand what drives predictions, and make decisions with uncertainty-aware outputs (locally adaptive conformal prediction intervals for regression).
Itโs built to be:
- Beginner-friendly (learn ML by doing)
- Professional-grade (tournament leaderboard + diagnostics + what-if)
- Practical (works on your own CSVs in minutes)
- Python 3.10+ recommended
- Windows / macOS / Linux
Open a terminal in the project folder:
# (optional) create & activate a virtual environment
python -m venv .venv
# Windows (PowerShell)
.venv\Scripts\Activate.ps1
# Windows (cmd)
.venv\Scripts\activate.bat
# macOS/Linux
source .venv/bin/activate
# install dependencies
python -m pip install --upgrade pip
pip install -r requirements.txtstreamlit run app.pyStreamlit will print a local URL (usually http://localhost:8501). Open it in your browser.
Goal: Predict monthly rent (a number)
Target column
target_monthly_rent_usd
Features
unit_size_sqftbedroomsbathroomsyear_builtdistance_to_downtown_milescrime_indexschool_rating
Goal: Predict tenant renewal (yes/no)
Target column
renewed_lease (0 = No, 1 = Yes)
Features
monthly_rentincome_usdtenure_monthslate_paymentsmaintenance_requestsunit_size_sqftsatisfaction_score
Upload one of the sample CSVs (or your own).
Tip: The target is the column you want to predict.
- Regression target example:
target_monthly_rent_usd - Classification target example:
renewed_lease
ModelArena works for both:
| Task | Predicts | Examples |
|---|---|---|
| Regression | a number | rent, price, time, cost |
| Classification | a category | renewal, churn, fraud |
- RMSE: penalizes large errors more
- MAE: average absolute error, easy to interpret
- Accuracy: % correct (simple baseline)
- F1: better when classes are imbalanced
- ROC-AUC: ranking quality (requires probabilities)
Important: If you choose ROC-AUC, ModelArena will only use models that can produce probabilities.
Click Run Tournament.
ModelArena will:
- train multiple models
- tune them (if tuning is enabled)
- rank them on your chosen metric
- crown a winner
Youโll see a leaderboard with the scores.
ModelArena ranks columns by permutation importance (model-agnostic):
- Higher = more predictive of the outcome
- Works for regression and classification
This is the โWhich columns matter most?โ chart.
- Points: predictions vs true values
- Diagonal line: perfect predictions
- Uncertainty band: prediction interval summary (adaptive conformal PI)
Shows:
- True positives / negatives
- False positives / negatives
This helps you see what kind of mistakes the model is making.
ModelArena provides locally adaptive conformal prediction intervals:
- Distribution-free (doesnโt assume normality)
- Works with any winning model
- Interval width adjusts by row (heteroskedastic)
Instead of only:
Predicted rent = $2,100
You also get:
95% interval โ [$1,920, $2,280]
After the tournament:
- Enter feature values
- Get an instant prediction
- See uncertainty (regression) or class outcome (classification)
Move sliders to answer:
- โWhat if unit size increases?โ
- โWhat if income drops?โ
- โWhat if crime index improves?โ
Predictions update live to make ML intuitive.
- Linear Regression / Logistic Regression
- Random Forest
- ExtraTrees
- HistGradientBoosting
- XGBoost
- LightGBM
- CatBoost
If streamlit run app.py fails, reinstall:
pip install -r requirements.txtIf you switch datasets and see odd UI behavior, refresh the page to clear Streamlit state (or use the appโs reset button if present).
Lance Jepsen โ product vision, architecture, ML direction
ChatGPT โ co-developer, ML engineering, education & documentation
MIT License โ free to use, modify, and learn from.