PhishGuard AI – Phishing Website Detection 🔒

PhishGuard AI is a machine learning project that detects phishing websites using XGBoost, trained on carefully engineered URL and domain features. The model leverages handcrafted indicators extracted from both PhishTank (phishing URLs) and UNB datasets (benign URLs).

🚀 Features

17 handcrafted features grouped into 3 categories:
- Address Bar features
- Domain features
- HTML & JavaScript features
Implements XGBoost Classifier for phishing detection.
Achieves strong accuracy on both training and testing datasets.
Supports easy retraining and extension with new data.
Saves trained model (phishguard_xgb.pkl) for reuse.

📂 Repository Structure

PhishGuard/
├── DataFiles/
│   └── urldata.csv          # Dataset with 17 features + label
├── train.py                 # Training script (XGBoost pipeline)
├── predict.py               # Script to classify new URLs
├── models/
│   └── phishguard_xgb.pkl   # Saved trained model
├── requirements.txt         # Python dependencies
└── README.md                # Project documentation

📊 Dataset & Results

Dataset size: ~10,000 URLs (5,000 phishing + 5,000 legitimate) sourced from:
- PhishTank (phishing)
- University of New Brunswick (UNB) dataset (benign)
Split: 80% training, 20% testing.
Achieved performance (original project benchmark):
- Train Accuracy: ~86.7%
- Test Accuracy: ~85.8%

📌 Future Directions:

Add deep learning baselines (LSTM, Transformers on raw URLs).
Integrate real-time URL scanning.
Deploy as a browser plugin or REST API for live phishing protection.

Name		Name	Last commit message	Last commit date
Latest commit History 14 Commits
README.md		README.md
train_model.py		train_model.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

PhishGuard AI – Phishing Website Detection 🔒

🚀 Features

📂 Repository Structure

📊 Dataset & Results

About

Uh oh!

Releases

Packages

Languages

krishdef7/PhishGuard-AI

Folders and files

Latest commit

History

Repository files navigation

PhishGuard AI – Phishing Website Detection 🔒

🚀 Features

📂 Repository Structure

📊 Dataset & Results

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages