AI Engineer · MPhil · Lahore, Pakistan

Taha Tanvir

—

Building ML pipelines, deep learning models, and production-ready AI systems — from research prototype to deployed application.

GitHub LinkedIn Email Me

About

Building intelligence
from research to reality.

AI Engineer with a full-stack background, completing an MPhil in Artificial Intelligence at PUCIT. Experienced in building ML pipelines, deep learning models, LLMs, and production-ready AI applications from research to deployment.

Currently pursuing my MPhil in AI at PUCIT while shipping cutting-edge projects spanning generative models, retrieval-augmented generation, and multimodal deep learning.

My background bridges rigorous research and production engineering — from fine-tuning 6B-parameter diffusion models to deploying hybrid RAG pipelines with Docker and FastAPI.

LLMsRAG SystemsDiffusion ModelsTransfer LearningFastAPIPyTorch

Degrees

BS + MPhil AI

Projects

Research to prod

Publication

Peer-reviewed paper

Based in

Lahore, Punjab, Pakistan

Open to remote opportunities

Skills

Technical Expertise

A breadth of tools across the full AI/ML stack — from model training to production deployment.

AI & Machine Learning

PyTorchScikit-learnHuggingFaceLSTMGRUCNNTransformersViTSwin TransformerDeep Learning

RAG & Information Retrieval

LangChainPineconeFAISSBM25Cohere RerankingRagasTF-IDF

Generative AI

DiffusersDreamBoothLoRASDXLCLIPStable Diffusion

Backend & Infrastructure

DockerFastAPIFlaskFirebasePostgreSQLMongoDBREST APIs

Programming Languages

PythonJavaScriptC++HTMLCSS

Web & Mobile

React.jsNode.jsExpress.jsFlutter

Projects

Selected Work

Research-driven engineering — production AI systems with measurable outcomes.

RAG

2026

RAG Knowledge Base System

Hybrid RAG pipeline combining dense vector search (HuggingFace + Pinecone) with BM25 sparse retrieval, Cohere Reranking, and Gemini generation. Plug-and-play retriever/LLM architecture deployed via FastAPI, Docker, and Streamlit.

◆1.0 Ragas faithfulness score

◆Hybrid dense + sparse retrieval

◆Plug-and-play architecture

PythonLangChainPineconeCohereFastAPIDockerStreamlit

View on GitHub

Generative AI

2026

DreamBooth LoRA — Few-Shot Subject Generation on SDXL

Fine-tuned SDXL (6.6B params) with LoRA Rank-32 on only 5 photos per subject on a 15GB GPU using 13 memory optimization techniques. Full rembg background removal + prior preservation pipeline.

◆70.88% CLIP-I fidelity

◆vs 48.20% SD 1.5 baseline

◆+47% relative improvement

PyTorchDiffusersCLIPSDXLLoRArembg

View on GitHub

ML Research

2025

CIFAR-100 Architecture Benchmark

Comprehensive benchmark of ResNet50, ViT-B/16, and Swin Transformer Tiny on CIFAR-100 with ImageNet pre-trained weights via transfer learning. Analyzed convergence speed, memory footprint, and generalization tradeoffs.

◆ResNet50: 82.22%

◆ViT-B/16: 87.73%

◆Swin Tiny: 87.23%

PythonPyTorchHuggingFaceViT-B/16Swin Transformer

View on GitHub

Research✦ Published

2025

Multimodal Human Activity Recognition

Early vs Late Fusion comparison for 12-class activity recognition across smartphone, smartwatch, and smart glasses sensor streams using LOSO validation. Findings published as a formal research paper.

◆55.18% subject-independent accuracy

◆LOSO validation

◆Published research paper

PythonPyTorchCNN-LSTMCogAge

View on GitHub

FYP

2025

AI Voice Cloning Application

Final Year Project integrating pre-trained TTS deep learning models via Flask backend for high-fidelity voice cloning from reference audio. Cross-platform Flutter mobile app with Firebase auth, audio storage, and real-time sync.

◆Cross-platform mobile app

◆Firebase real-time sync

◆High-fidelity voice cloning

PythonFlaskDeep LearningTTSFlutterFirebase

View on GitHub

Journey

Experience & Education

Academic research and professional engineering — from Lahore to Switzerland.

◎ Education2025 — Present

MPhil in Artificial Intelligence

University of the Punjab — PUCIT · Lahore, Pakistan

Postgraduate research in AI, specializing in advanced machine learning systems and production AI deployment pipelines.

◎ Education2025 — Present

MPhil in Artificial Intelligence

University of the Punjab — PUCIT · Lahore, Pakistan

Postgraduate research in AI, specializing in advanced machine learning systems and production AI deployment pipelines.

✦ CertificationDecember 2025

IBM Full Stack Software Developer Professional Certificate

Coursera / IBM · Remote

Issued by IBM, verified on Credly.

✦ CertificationDecember 2025

IBM Full Stack Software Developer Professional Certificate

Coursera / IBM · Remote

Issued by IBM, verified on Credly.

◈ Experience2023 — 2024

Freelance Full Stack Developer

Fiverr · Remote

Built a full-stack data management system for a Switzerland-based client using MERN stack — from database architecture to React frontend. Managed end-to-end deployment including domain config, cloud hosting, and REST API integration for secure data handling.

◈ Experience2023 — 2024

Freelance Full Stack Developer

Fiverr · Remote

◎ Education2021 — 2025

Bachelor of Science in Software Engineering

University of Lahore (UOL) · Lahore, Pakistan

Completed BS in Software Engineering. Final Year Project: AI Voice Cloning application using deep learning TTS models and Flutter.

◎ Education2021 — 2025

Bachelor of Science in Software Engineering

University of Lahore (UOL) · Lahore, Pakistan

Completed BS in Software Engineering. Final Year Project: AI Voice Cloning application using deep learning TTS models and Flutter.

Contact

Let's build
something together.

Whether it's a production AI system, a research collaboration, or a freelance project — reach out and let's talk.

tahatanvir605@gmail.com

Say hello or discuss a project

Open →

tahatanvir

Connect professionally

Open →

GitHub

TahaUser5

Browse projects and code

Open →

Taha Tanvir · 2026 · Lahore, Pakistan

Taha Tanvir

Building intelligencefrom research to reality.

Technical Expertise

AI & Machine Learning

RAG & Information Retrieval

Generative AI

Backend & Infrastructure

Programming Languages

Web & Mobile

Selected Work

RAG Knowledge Base System

DreamBooth LoRA — Few-Shot Subject Generation on SDXL

CIFAR-100 Architecture Benchmark

Multimodal Human Activity Recognition

AI Voice Cloning Application

Experience & Education

MPhil in Artificial Intelligence

MPhil in Artificial Intelligence

IBM Full Stack Software Developer Professional Certificate

IBM Full Stack Software Developer Professional Certificate

Freelance Full Stack Developer

Freelance Full Stack Developer

Bachelor of Science in Software Engineering

Bachelor of Science in Software Engineering

Let's buildsomething together.

Building intelligence
from research to reality.

Let's build
something together.