AI Engineer · MPhil · Lahore, Pakistan

Taha Tanvir

Building ML pipelines, deep learning models, and production-ready AI systems — from research prototype to deployed application.


Building intelligence
from research to reality.

AI Engineer with a full-stack background, completing an MPhil in Artificial Intelligence at PUCIT. Experienced in building ML pipelines, deep learning models, LLMs, and production-ready AI applications from research to deployment.

Currently pursuing my MPhil in AI at PUCIT while shipping cutting-edge projects spanning generative models, retrieval-augmented generation, and multimodal deep learning.

My background bridges rigorous research and production engineering — from fine-tuning 6B-parameter diffusion models to deploying hybrid RAG pipelines with Docker and FastAPI.

LLMsRAG SystemsDiffusion ModelsTransfer LearningFastAPIPyTorch
0

Degrees

BS + MPhil AI

0

Projects

Research to prod

0

Publication

Peer-reviewed paper

Based in

Lahore, Punjab, Pakistan

Open to remote opportunities


Technical Expertise

A breadth of tools across the full AI/ML stack — from model training to production deployment.

AI & Machine Learning

PyTorchScikit-learnHuggingFaceLSTMGRUCNNTransformersViTSwin TransformerDeep Learning

RAG & Information Retrieval

LangChainPineconeFAISSBM25Cohere RerankingRagasTF-IDF

Generative AI

DiffusersDreamBoothLoRASDXLCLIPStable Diffusion

Backend & Infrastructure

DockerFastAPIFlaskFirebasePostgreSQLMongoDBREST APIs

Programming Languages

PythonJavaScriptC++HTMLCSS

Web & Mobile

React.jsNode.jsExpress.jsFlutter

Selected Work

Research-driven engineering — production AI systems with measurable outcomes.

RAG
2026

RAG Knowledge Base System

Hybrid RAG pipeline combining dense vector search (HuggingFace + Pinecone) with BM25 sparse retrieval, Cohere Reranking, and Gemini generation. Plug-and-play retriever/LLM architecture deployed via FastAPI, Docker, and Streamlit.

1.0 Ragas faithfulness score
Hybrid dense + sparse retrieval
Plug-and-play architecture
PythonLangChainPineconeCohereFastAPIDockerStreamlit
Generative AI
2026

DreamBooth LoRA — Few-Shot Subject Generation on SDXL

Fine-tuned SDXL (6.6B params) with LoRA Rank-32 on only 5 photos per subject on a 15GB GPU using 13 memory optimization techniques. Full rembg background removal + prior preservation pipeline.

70.88% CLIP-I fidelity
vs 48.20% SD 1.5 baseline
+47% relative improvement
PyTorchDiffusersCLIPSDXLLoRArembg
ML Research
2025

CIFAR-100 Architecture Benchmark

Comprehensive benchmark of ResNet50, ViT-B/16, and Swin Transformer Tiny on CIFAR-100 with ImageNet pre-trained weights via transfer learning. Analyzed convergence speed, memory footprint, and generalization tradeoffs.

ResNet50: 82.22%
ViT-B/16: 87.73%
Swin Tiny: 87.23%
PythonPyTorchHuggingFaceViT-B/16Swin Transformer
Research✦ Published
2025

Multimodal Human Activity Recognition

Early vs Late Fusion comparison for 12-class activity recognition across smartphone, smartwatch, and smart glasses sensor streams using LOSO validation. Findings published as a formal research paper.

55.18% subject-independent accuracy
LOSO validation
Published research paper
PythonPyTorchCNN-LSTMCogAge
FYP
2025

AI Voice Cloning Application

Final Year Project integrating pre-trained TTS deep learning models via Flask backend for high-fidelity voice cloning from reference audio. Cross-platform Flutter mobile app with Firebase auth, audio storage, and real-time sync.

Cross-platform mobile app
Firebase real-time sync
High-fidelity voice cloning
PythonFlaskDeep LearningTTSFlutterFirebase

Experience & Education

Academic research and professional engineering — from Lahore to Switzerland.

Education2025 — Present

MPhil in Artificial Intelligence

University of the Punjab — PUCIT · Lahore, Pakistan

Postgraduate research in AI, specializing in advanced machine learning systems and production AI deployment pipelines.

CertificationDecember 2025

IBM Full Stack Software Developer Professional Certificate

Coursera / IBM · Remote

Issued by IBM, verified on Credly.

Experience2023 — 2024

Freelance Full Stack Developer

Fiverr · Remote

Built a full-stack data management system for a Switzerland-based client using MERN stack — from database architecture to React frontend. Managed end-to-end deployment including domain config, cloud hosting, and REST API integration for secure data handling.

Education2021 — 2025

Bachelor of Science in Software Engineering

University of Lahore (UOL) · Lahore, Pakistan

Completed BS in Software Engineering. Final Year Project: AI Voice Cloning application using deep learning TTS models and Flutter.


Let's build
something together.

Whether it's a production AI system, a research collaboration, or a freelance project — reach out and let's talk.

Taha Tanvir · 2026 · Lahore, Pakistan