Skip to content
View EnggTalha's full-sized avatar
🎯
Focusing
🎯
Focusing

Block or report EnggTalha

Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
EnggTalha/README.md
Typing SVG

LinkedIn Kaggle Email Profile Views


πŸš€ About Me

I'm an AI/ML Engineer with 3+ years of experience building and shipping production-grade LLM systems at scale. I specialize in RAG architectures, Voice AI, distributed inference, and real-time conversational agents β€” turning cutting-edge research into systems that actually work in the real world.

  • πŸ—οΈ Architected distributed LLM inference platforms serving 1M+ daily queries on AWS EKS
  • πŸŽ™οΈ Built multilingual Voice AI agents using LiveKit, WebRTC, Twilio, OpenAI & Deepgram β€” automating 50K+ calls/month
  • ⚑ Reduced LLM inference latency 50% (400ms β†’ 200ms p95) and costs 30% via GPTQ quantization & vLLM
  • πŸ“‰ Cut hallucinations 40% using hybrid Graph + Vector RAG on enterprise knowledge bases (10M+ docs)
  • 🌍 Published low-resource NLP research for Urdu (70M+ speakers) β€” targeting ACL/EMNLP 2025

πŸ› οΈ Tech Stack

🧠 LLM & AI Frameworks

LangChain LangGraph vLLM CrewAI OpenAI SDK HuggingFace

πŸŽ™οΈ Voice AI & Real-Time

LiveKit WebRTC Twilio Deepgram Whisper

πŸ€– Models

GPT-4 Claude Gemini LLaMA 3 Mistral Qwen 2.5

πŸ—„οΈ Vector Databases

Pinecone Weaviate Qdrant ChromaDB pgvector FAISS

☁️ Cloud & Infrastructure

AWS Azure Kubernetes Docker

βš™οΈ MLOps & Data

PyTorch FastAPI Apache Spark Kafka MLflow n8n


πŸš€ Featured Projects

Project Stack Highlights
AI Calling Agents LiveKit, WebRTC, Twilio, OpenAI, Deepgram 100K+ users, 95% automation, 50+ business customers
Call Analytics SaaS FastAPI, LLM, Dashboards Real-time sentiment, summaries & conversation analytics
Dental AI Bot RAG, WhatsApp/Instagram/Twitter Domain-specific omnichannel chatbot
Restaurant AI Bot Multimodal RAG, Pinecone, LangGraph Multi-platform reservations, menu image understanding
ICAP AI Bot LLaMA 3.1, RAG, Ubuntu On-Prem On-premises deployment, enhanced data privacy
Sports Commentary AI OpenCV, GPT-4, TTS, Edge Computing Live cricket commentary, sub-second latency
Urdu NLP Research Gemma, PEFT/QLoRA, Custom TTS Fine-tuned for 70M+ Urdu speakers, targeting ACL/EMNLP 2025

πŸ“Š GitHub Stats

GitHub Stats Top Languages

πŸ”₯ 2025 Contribution Streak

Streak


πŸŽ“ Education & Certifications

πŸŽ“ BS Software Engineering β€” University of Karachi (UBIT) Β· 2020–2024

πŸ“œ Certifications:

  • LLMOps Β· Agentic RAG with LlamaIndex Β· Pretraining LLMs Β· Prompt Engineering Β· Building Systems with ChatGPT β€” DeepLearning.AI
  • Introduction to Generative AI β€” Google Cloud
  • Machine Learning Β· Deep Learning Β· Notebook Expert β€” Kaggle
  • n8n Automation β€” Simplilearn
  • Intermediate Python β€” DataCamp

πŸ”¬ Research (In Progress)

  • πŸ“„ Urdu TTS Architectures β€” Novel approaches to natural-sounding speech synthesis for low-resource languages
  • πŸ“„ Efficient LLM Adaptation for Low-Resource Languages β€” PEFT/QLoRA techniques for Urdu and similar languages

Targeting ACL / EMNLP 2025


πŸ’‘ Open to AI Engineer roles Β· LLM Systems Β· Voice AI Β· Applied Research

Buy Me a Coffee

"Building the future with AI β€” one production system at a time."

Pinned Loading

  1. graphrag graphrag Public

    Open-source RAG system combining vector search with knowledge graphs. Use free Ollama locally or your favorite LLM API. 100% free, MIT License.

    Python

  2. code-review-agent code-review-agent Public

    An agentic AI-powered code reviewer that clones any repository, analyzes every file using advanced LLM reasoning, and generates structured feedback, refactored code, and optional pull requests β€” al…

    Python

  3. agentic-productivity-assistant agentic-productivity-assistant Public

    An agentic AI productivity assistant that triages emails, schedules events, extracts tasks, and delivers daily briefs via Slack.

    Python

  4. autonomous-job-agent autonomous-job-agent Public

    Agentic AI that scrapes job boards, scores fit with Claude, and writes tailored cover letters autonomously

    Python 1

  5. Gpt4oImage Gpt4oImage Public

    Forked from OrenGrinker/Gpt4oImage

    This project is a Streamlit web application that leverages OpenAI's GPT-4o to generate descriptions for uploaded images

    Python

  6. RAG-Bot-with-Live-Agent-Support RAG-Bot-with-Live-Agent-Support Public

    An AI-powered Interactive RAG Bot with seamless live agent integration. Leverages Retrieval-Augmented Generation (RAG) for context-aware responses in web development, digital marketing, and more. F…

    Python