🛡️QuishGuard: Advanced QR Phishing (Quishing) Detection

QuishGuard is a high-performance machine learning pipeline designed to detect malicious URLs embedded in QR codes. By leveraging an XGBoost classifier trained on over 2.2 million samples, it identifies phishing attempts with a precision-first approach to minimize False Positives in security environments.

🚀 Key Features

Advanced QR-Email Parsing

Stealth Extraction: Designed to parse complex .eml and multi-part MIME email files to extract all hidden "highly evasive" Qrs.

High-Fidelity Threat Intelligence

Fresh Dataset: Utilizes a curated live-stream of malicious URLs from top-tier threat intel sources (PhishTank, OpenPhish, phiusiil, mendeley and URLhaus).

Massive Scale: Trained on a robust corpus of 2.1M+ Benign URLs and 135k+ Fresh Malicious URLs, ensuring the model recognizes current-day attack patterns.

Deep Feature Engineering

Lexicographical Profiling: Transforms raw URLs into a multi-dimensional feature vector. This stage involves handling missing or inconsistent data, tokenization, and extraction of relevant features from URLs, such as domain names, subdomains, and URL lengths

Production-Grade Performance

Precision Focused: Optimized to maintain a near-zero False Positive Rate (FPR), critical for reducing "security fatigue" in SOC environments.

Fast Inference and integration: Optimized for low-latency, real-time URL classification and seamless integration into existing security pipelines

System Architecture

Ingestion: Receives .eml files via the Flask API.
Extraction: Scans body and attachments for QR code objects.
Transformation: Converts the extracted URL into 15+ numerical features (length, special character ratios, tld, etc.).
Classification: The XGBoost engine generates a safety verdict.
Response: Returns a JSON report with a safety verdict and other details about the email.

API Usage

QuishGuard provides a lightweight Flask API for seamless integration with existing SOAR or SIEM platforms.

Scan Email File

POST /submit

Request Body

file (binary): The .eml file to be analyzed.

Python Integration Example

import requests

# Load your suspicious email file
with open("suspicious_email.eml", "rb") as f:
    files = {"file": f}
    response = requests.post("http://127.0.0.1:5001/submit", files=files)
    
print(response.json())

Sample Response

{
  "Email status": "Rejected",
  "fragments": [],
  "https://split-flexbox.com": "malicious",
  "metadata": {
    "domain": "test.com",
    "sender": "test@test.com",
    "sender_ip": "Unknown",
    "subject": "Split QR - Flexbox"
  }
}

📦 Installation & Setup

Prerequisites

Python 3.10+
Google Chrome / Chromium: Required by the html2image dependency to properly render and process HTML fragments. Ensure Chrome is installed on your system before proceeding.

git clone https://github.com/Tenzzzzzz/QuishGuard.git
cd QuishGuard
python -m venv .venv
# Windows
.venv\Scripts\activate.bat
# Linux/Mac
source .venv/bin/activate
cd Requirements
pip install -r requirements.txt
cd ..
python app.py

or if you want to reproduce from the beginning

git clone https://github.com/Tenzzzzzz/QuishGuard.git
cd QuishGuard
python -m venv .venv
# Windows
.venv\Scripts\activate.bat
# Linux/Mac
source .venv/bin/activate
cd Requirements
pip install -r requirements.txt
cd ..

Then execute the code in the Jupyter notebook

python feature_extraction.py
python the_model.py
python app.py

Future Roadmap

Evasion Detection: Implement detection for more advanced "broken" or "obfuscated" QR techniques.
Zero-Trust Layer: Add an additional heuristic layer to reduce False Positives for known corporate domains further.

Contributing

Contributions are welcome! Please open an issue or submit a pull request for any feature additions or bug fixes.

Name		Name	Last commit message	Last commit date
Latest commit History 58 Commits
Data_Loading_And_Processing		Data_Loading_And_Processing
Requirements		Requirements
dataset		dataset
model		model
.gitattributes		.gitattributes
.gitignore		.gitignore
Parse_And_Extract.py		Parse_And_Extract.py
README.md		README.md
app.py		app.py
client.py		client.py
feature_extraction.py		feature_extraction.py
model_scan.py		model_scan.py
the_model.py		the_model.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

🛡️QuishGuard: Advanced QR Phishing (Quishing) Detection

🚀 Key Features

Advanced QR-Email Parsing

High-Fidelity Threat Intelligence

Deep Feature Engineering

Production-Grade Performance

System Architecture

API Usage

Scan Email File

Request Body

Python Integration Example

📦 Installation & Setup

Prerequisites

Future Roadmap

Contributing

About

Uh oh!

Languages

Tenzzzzzz/QuishGuard

Folders and files

Latest commit

History

Repository files navigation

🛡️QuishGuard: Advanced QR Phishing (Quishing) Detection

🚀 Key Features

Advanced QR-Email Parsing

High-Fidelity Threat Intelligence

Deep Feature Engineering

Production-Grade Performance

System Architecture

API Usage

Scan Email File

Request Body

Python Integration Example

📦 Installation & Setup

Prerequisites

Future Roadmap

Contributing

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Languages