Ayaan Khan

Applied AI & ML

AI engineer and backend developer building production LLM systems, including retrieval augmented generation, semantic search, and multi-stage agent workflows.

Chicago, IL

[email protected]

Building
Clutch (staging)

AI course generation platform

SyntecAI (deployed)

Enterprise RAG platform

Internal Agent

Automating firm workflows

This site

continuously refining

Using

Hardware

MacBook Pro M4 Pro

iPhone 17 Pro

Samsung Z Fold 7

Software

VS Code

Claude Code

Codex

Gemini

Ghostty

GarageBand

Watching

Currently

Industry

Show · 2024

Past Watched

Silicon Valley

Severance

Succession

About Me

Education

B.S. in Artificial Intelligence

Minor in Architecture

Illinois Institute of Technology

Relevant Coursework

DSAAIMLNLPDBMSAssemblyData MiningDiscrete MathLinear AlgebraProbabilityStatisticsOOP

Chicago, Illinois

Expected May 2026

Experience

AI & Digital Development Intern

The Syntec Group

Chicago, Illinois

May 2025 – Present

  • Built and deployed a semantic RAG chatbot on Chatbase over firm documents, delivering cited, context-grounded answers to reduce lookup time and improve response consistency.
  • Developed an internal agentic system using OpenAI function calling to manage building module codes through natural language, with confirmation flows for destructive operations and ChromaDB sync for semantic search.
  • Implemented ingestion and retrieval workflow (chunking, embeddings, indexing) across PDFs, CSVs, website pages, and WordPress blog posts, with embedding caching via Redis that reduced inference cost by approximately 65%.
  • Led an information architecture plus website redesign improving navigation and access to resources; used engagement analysis to iterate content performance.

Remote (India)

May 2021 – July 2021

  • Created and managed a 24/7 helpline during India’s second COVID wave, connecting thousands of patients with critical resources like oxygen, beds, and medicines.

Project Manager

Excelerate (Globalshala)

New Delhi, India

June 2023 – July 2023

  • Led a global team to organize an academic event with a $30,000 budget, managing documentation, risk assessment, and external outsourcing.

Projects

Clutch (staging)

  • Built a staging-deployed SaaS that generates research-backed courses using a multi-stage agent pipeline, orchestrating 5 stages with persisted job state, retry policies, and failure isolation to keep long-running workflows reliable.
  • Implemented an asynchronous FastAPI backend with PostgreSQL plus pgvector and Redis caching, and streamed real-time job progress to clients using SSE backed by Redis to keep generation workflows responsive.
  • Cut inference cost and improved output quality using LiteLLM routing and shared source deduplication; enforced structured outputs with PydanticAI validation and added observability via Sentry and PostHog.

FastAPI, Postgres/pgvector, Redis, Inngest, SvelteKit, LiteLLM, PydanticAI, Docker, Sentry, PostHog

Syntec AI Chatbot (deployed)

  • Shipped a semantic search and retrieval augmented chatbot using GPT-4o and ChromaDB embeddings to deliver question answering across PDF documents, CSVs, blog posts, and website content with automated source citations.
  • Built an agentic system using OpenAI function calling that enables authorized users to read, write, and modify mastercode, unicode, and omnicode entries through natural language, with role-based access controls and confirmation flows for destructive operations.
  • Containerized the service with Docker Compose for consistent deployments, implemented a dual-LLM fallback strategy for cost optimization, and added security controls including input sanitization, rate limiting, and CORS restrictions.

React, Flask, ChromaDB, Redis, Docker, WordPress

InvestoChat

  • RAG system for real estate investment queries with multi-path retrieval (pgvector + SQL fallbacks with MMR), OCR processing, and automated table extraction from PDF brochures.
  • WhatsApp Business API lead qualification + Airtable CRM sync + commission tracking + rate limiting + PII detection.

FastAPI, Postgres/pgvector, OCR, WhatsApp Business API, Airtable

Trend Analyzer for Raw Materials

  • Cotton price forecasting using Facebook Prophet with external regressors (oil, gas, soybeans); evaluated with MAPE.
  • Multivariate regression analysis of correlated commodity markets; time-series cross validation.

Python, Prophet, Pandas, Matplotlib

Sports and Metrics Tracker

  • Built an end-to-end soccer video analysis pipeline using YOLOv8 detection and ByteTrack tracking, adding temporal smoothing for stable motion analytics and CPU only local processing with robust cross-platform video input and output.
  • Developed export and visualization tooling, including CSV and JSON metrics with track filtering (minimum 15 frames), heatmap rendering with percentile transforms, and tiered error handling with debug mode progress reporting.

Python, YOLOv8, ByteTrack, OpenCV

Systems Thinking

  1. 01. Data ingestion and cleaning
  2. 02. Retrieval strategy (vector + lexical + rerank)
  3. 03. Structured outputs + validation
  4. 04. Latency/cost tradeoffs + caching
  5. 05. Observability + failure modes