Skip to content
View mahmoudalrefaey's full-sized avatar

Block or report mahmoudalrefaey

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
mahmoudalrefaey/README.md

Howdy!πŸ‘‹, I am Mahmoud Al-Refaey

Junior AI Engineer & Data Scientist

mahmoudalrefaey


πŸ‘¨β€πŸ’» About Me:

Innovative AI Engineer and Data Scientist with a B.Sc. in Artificial Intelligence (Honors) and hands-on experience in Distributed Computing and Generative AI. Proficient in the full ML lifecycleβ€”from fine-tuning LLMs (Llama 3, QLoRA) and implementing RAG pipelines to deploying production-ready applications using Django and React.

  • πŸ“‹ Resume: Check it out here
  • πŸ’¬ Ask me about: Generative AI, LLMs, RAG Pipelines, Machine Learning, Deep Learning, Data Science
  • 🧐 Interested in: NLP, Computer Vision, Distributed AI, MLOps
  • πŸ“« Reach me at: dev.mahmoudrefaey@gmail.com | +20 1026295189
  • πŸ“ Location: Cairo, Egypt

🌐 Connect with me:

LinkedIn - Abdullah Khaled Kaggle - Abdullah Khaled HackerRank - Abdullah Khaled WhatsApp - Abdullah Khaled


πŸŽ“ Education


🏫 B.Sc. in Artificial Intelligence (Honors)

  • University Badge
  • Grade Badge
  • Duration Badge
  • πŸ“š Key Courses: Machine Learning, Deep Learning, NLP, Computer Vision, Reinforcement Learning, Database Systems, Data Structures & Algorithms, OOP
  • πŸ§ͺ Graduation Project: AI-Powered Churn Prediction Platform for Telecom Companies

πŸ… Certifications & Courses

Certification Provider
πŸ“œ Machine Learning Specialization DeepLearning.AI & Stanford University
πŸ“œ IBM AI & Data Science Digital Egypt Pioneers Initiative (DEPI)
πŸ“œ Deep Learning with PyTorch Mahara Tech (ITI)
πŸ“œ Developing Applications with LangChain/LangGraph DataCamp
πŸ“œ AWS Cloud Practitioner Essentials Amazon Web Services
πŸ“œ ML & Deep Learning Training Course Zewail City of Science, Technology and Innovation


πŸ’Ό Experience


πŸ§‘β€πŸ’» AI Engineer

Beetleware Β· Remote Internship
Aug 2025 – Jan 2026

  • Built distributed AI models using the Ray framework and contributed to open-source projects.
  • Developed and deployed full-stack SaaS features with modern frameworks, APIs, and CI/CD.
  • Collaborated on real company projects in Agile teams, delivering production-ready solutions.

Ray Distributed AI CI/CD Open Source


πŸ§‘β€πŸ’» Data Science Trainee

Digital Egypt Pioneers Initiative (DEPI) Β· Hybrid Traineeship
Oct 2024 – May 2025

  • Built and deployed machine learning models using Python, Pandas, scikit-learn, and TensorFlow.
  • Collaborated in team projects simulating industry challenges, including data preprocessing and model evaluation.
  • Mentored and supported peers, helping them master core concepts in machine learning and data analysis.

Python TensorFlow Scikit-Learn



πŸ› οΈ Technical Skills


πŸ€– Generative AI & NLP

LLMs RAG Fine-tuning LangChain LangGraph FAISS Hugging Face Prompt Engineering


🧠 Machine Learning & Deep Learning

PyTorch TensorFlow Scikit-Learn Computer Vision Ensemble Methods


πŸ“Š Data Science & Visualization

Pandas NumPy Matplotlib Seaborn Plotly EDA Streamlit


πŸ–₯️ Programming Languages

Python TypeScript JavaScript SQL


🌐 Full-Stack Development

Django React REST APIs PostgreSQL MySQL


☁️ MLOps & Cloud Engineering

Docker CI/CD AWS Git GitHub


πŸ“ Mathematics & Problem Solving

Probability Statistics Calculus Analytical Thinking



πŸ“š Technical Publications & Research


πŸ“„ Graduation Project Documentation

"Customer Churn in Telecom: Predictive Modeling for Enhanced Retention Strategies" | Egyptian Russian University

  • Engineered a stacking ensemble (XGB-LGBM-RF) with 91% CV accuracy, deployed via a Django-React full-stack predictive platform.

Documentation


πŸ“„ Research Paper & Technical Documentation

"A Study of Generative Approaches for Balancing Imbalanced Data: SMOTE, GANs, and LLMs" | ResearchGate

  • Comparative study of SMOTE, GANs (CTGAN/TVAE), and LLMs for fraud detection, concluding traditional sampling offers superior stability over generative models.

ResearchGate


🌟 Soft Skills


🎯 Problem-Solving & Critical Thinking

  • Proven ability to analyze complex problems and devise data-driven solutions.
  • Highly adept at evaluating models and making decisions based on analytical insights.

🀝 Teamwork & Collaboration

  • Experience working in interdisciplinary teams to develop AI solutions.
  • Strong communication skills with both technical and non-technical stakeholders.

🧠 Creativity & Innovation

  • Passionate about solving unconventional challenges through innovative use of AI and data.

πŸ—£οΈ Communication

  • Ability to explain technical concepts clearly to non-technical stakeholders.

πŸ‘¨β€πŸ« Leadership

  • Led group projects and mentored junior team members in machine learning methodologies.

⏳ Time Management

  • Efficiently managed overlapping academic and internship responsibilities while delivering high-quality work.

πŸ” Attention to Detail

  • Rigorous in model tuning, ensuring precision and accuracy in results.

πŸ”₯ Self-Motivation

  • Consistently driven to improve technical skills and apply them in real-world scenarios.

🌍 Languages

  • πŸ‡ͺπŸ‡¬ Arabic: Native
  • πŸ‡¬πŸ‡§ English: Fluent

πŸ”¬ Featured Projects


πŸ–₯️ NexaOS – AI-Powered Desktop Environment

AI Engineer – Full-Stack Developer | React 19, TypeScript, Vite, Python, LLMs

  • Developed an AI-powered document management ecosystem featuring intelligent document, data, and PDF editors with automated content generation, custom file formats (.nd, .np, .ndf), and robust file-storage architecture.
  • Enhanced communication by connecting AI services with real-time chat (Socket.io) and internal email modulesβ€”providing smart, context-aware collaboration.

GitHub Live Demo


πŸ“„ Interactive Multi-PDF Chat and Semantic Search with Local AI

AI Engineer | Python, RAG, TinyLlama, DialoGPT, Phi-2

  • Engineered a modular RAG pipeline using Streamlit and FAISS for real-time semantic search and Q&A over multiple PDF documents.
  • Integrated local, open-source LLMs (TinyLlama, Phi-2) and Sentence-Transformers to enable private, cost-effective document intelligence.

GitHub Live Demo


πŸ“Š AI-Powered Churn Prediction Platform for Telecom Companies

Data Scientist – Backend – Team Leader | Python, Django, Machine Learning

  • Developed a high-performance stacking ensemble (XGBoost, LightGBM, RF) achieving 91% CV accuracy and 84.5% ROC-AUC for telecom retention.
  • Architected a multi-tenant Django REST backend with JWT authentication and real-time prediction APIs featuring probability scoring.
  • Led the end-to-end development life cycle as Team Leader, from technical documentation to full-stack deployment.

GitHub Live Demo


πŸ”— Additional Projects

Project Links
πŸ¦™ Fine-Tuning LLaMA3 for Coding Tasks HuggingFace Inference
⚑ Energy Consumption Forecasting GitHub Live App
πŸ• Food Classification ViT Model HuggingFace Live App
πŸ›°οΈ Land Cover Classification with ResNet50 (EuroSAT) GitHub Demo

Pinned Loading

  1. streamlit-cancer-diagnosis streamlit-cancer-diagnosis Public

    AI-powered app designed to enhance breast cancer diagnosis by connecting to your cytology lab. Utilizing a machine learning model, it predicts whether a breast mass is benign or malignant based on …

    Python

  2. Arabic-Abusive-Text-Detection Arabic-Abusive-Text-Detection Public

    Leverage advanced NLP and ML to detect abusive language in Arabic content. Contribute to fostering a safer online space. #NLP #ArabicTextAnalysis #AbuseDetection

    Jupyter Notebook