Hi, I'm Hafsa El-Mahdi — focused on building robust Big Data pipelines, distributed systems & AI solutions using modern data technologies.
I'm Hafsa El-Mahdi, a Data Engineering student at ENSAH, Morocco.
Passionate about building scalable data pipelines and exploring distributed systems.
End-to-end Lambda Architecture pipeline. Ingestion via HDFS, federated querying with Trino/Presto, orchestration via Airflow, real-time dashboarding with Streamlit.
RAG-based chatbot using local LLMs via Ollama. Processes university documents for contextual Q&A.
NoSQL sharding & replication architecture for horizontal scaling and high availability.
Automated extraction, cleaning & visualization pipeline with BeautifulSoup and Scrapy.
Movie rating prediction, Iris classification, and sales forecasting with scikit-learn.
Console-based C application with file I/O, data structures, and interactive CLI.
Udemy — Digital Innovation | Les Experts
November 2024
AJI T3LM Programming
December 2024
Udemy — Mikail Altundas
February 2025
Hult Prize at ENSAH Al-Hoceima
February 2023
Open to internships, collaborations & interesting data challenges.
Let's collaborate on data pipelines, ML systems, or AI-driven products.
Download CV