Talal Ahmed

AI & Backend Engineer

I'm Talal Ahmed — an AI & backend engineer.

I build LLM systems, RAG pipelines, and the scalable backends behind them.

Experience

  1. May 2025 — June 2026

    Effica

    · AI Engineer (Contract)

    Remote · Canada

    • Built a serverless document ingestion pipeline on AWS Lambda for large health documents and research articles, using a two-stage parser for structured extraction and pgvector for semantic search.
    • Developed an end-to-end health report generation system from doctor–patient calls: audio → transcription → LLM/NLP pipelines producing multiple clinical and practitioner-ready report types.
    • Built prompt-configurable report workflows letting practitioners dynamically tune report structure, tone, and medical detail.
  2. May 2025 — Aug 2025

    Metal

    · AI Backend Engineer (Contract)

    Remote · Singapore

    • Built an intelligent network-discovery system analyzing LinkedIn/Gmail connections to find intro-paths to investors, backed by complex SQL (15+ filters).
    • Built an AI-enhanced investor search engine using pgvector embeddings; optimized PostgreSQL queries to sub-second on 100K+ profiles (500ms → <50ms).
    • Led the SuperTokens authentication migration to the latest versions.
  3. Feb 2024 — Feb 2025

    Mosaik

    · AI Engineer

    Remote · United States

    • Fine-tuned open-source LLMs (Phi, Llama) for query classification with Llama Factory.
    • Built a RAG-based chatbot (FastAPI) over client property documents and a Text-to-SQL system using RAG + LangChain + GPT.
    • Built an end-to-end image auto-labeling pipeline with GroundingDINO and AutoDistill.
  4. Mar 2023 — Jan 2024

    Beam

    · ML / Backend Engineer

    Remote · Germany

    • Built semantic routers using GPT for intent classification and fine-tuned a BERT model for query classification.
    • Developed an AI backend with LangChain, RAG, and FastAPI; designed a RAG pipeline for dynamic tool identification.
    • Engineered agent workflows via N8N connecting Slack, Gmail, and Google Docs; set up GitHub Actions for automated AWS deployment.
  5. Jan 2023 — Mar 2023

    Liquid Technologies

    · ML Engineer
    • Computer-vision model training with YOLOv7/YOLOv5, Roboflow, and image-processing techniques.
  6. Sep 2022 — Dec 2022

    University of Karachi

    · Research Assistant
    • Worked with a professor on deep neural networks for medical image segmentation, improving MRI analysis.

Projects

Felixai

Event-driven platform syncing and processing Canvas LMS content. Apache Kafka pipelines for large-scale ingestion, OCR, and AI workflows; Server-Sent Events for real-time sync progress; tuned for I/O-bound performance.

AI Calling Agent

Outbound voice-calling agent built with Vapi. Real-time STT/TTS pipelines for natural dialogue, with dynamic prompt logic and external tool integration for scenario-based workflows.

HailoChat

Multi-tenant Zendesk-style support SaaS. Scalable WebSocket chat with Redis-backed sessions, hybrid PostgreSQL + MongoDB storage, and subscription-based access control.

VecLite

Efficient vector store using a K-means tree over SQLite. Cluster-based search, full scans, and Random Projection dimensionality reduction for fast retrieval.

Email Intelligence System

FastAPI + Azure OpenAI service that extracts structured data (order/PO/registration numbers) from .eml files and attachments, cross-verified against a PostgreSQL inventory DB.

Majai Platform

AI ad-automation for Facebook/Google Ads. Content generation backend with image/video models, Facebook Graph API auto-posting, ESRGAN enhancement, and an OpenCV video pipeline.

Pinnacle Bot

RAG-based document tool processing 100–700 page PDFs, generating structured documents for civil engineers with extracted insights and reference links.

AutoGrad Engine

A from-scratch automatic differentiation engine in C++ for efficient gradient computation.

Skills