Building

An engineering chronicle. Projects exploring AI, systems, and the intersection of technology with human experience.

2025

Enterprise RAG System

Production AI with 89% accuracy. Built an 8-metric evaluation framework including BERTScore and RAGAS. Achieved 236% improvement in context precision. $7.8M annual value creation.

[Cohere, LangChain, RAGAS, ChromaDB]

2025

FastSentiment API

ML inference with Redis caching. Sub-100ms response time, 95% cache hit rate. Production-grade with async processing and horizontal scaling.

[FastAPI, Redis, DistilBERT, Kubernetes]

→ github

2025

AI Papers Research Agent

Automated research synthesis. Processes 100+ papers daily from arXiv. Extracts insights and generates personalized digests.

[Python, LangChain, arXiv API, GPT-4]

→ github

2024

Multimodal Chain-of-Thought with DPO

Research on enhancing multimodal reasoning through RLAIF and Direct Preference Optimization. Created first public multimodal CoT preference dataset. Published findings on DPO limitations for vision-language models.

[LLaVA, DPO, RLAIF, PyTorch, HuggingFace]

→ dataset

2024

ML Systems Engineering

End-to-end deployment pipeline. Reduced model serving latency by 70%. Implemented caching, health checks, auto-scaling.

[Kubernetes, Docker, Redis, Prometheus]

2024

Traffic Sign Classification

Computer vision with 96% accuracy. Multi-class CNN with ResNet50 transfer learning. Real-time inference pipeline.

[TensorFlow, CNN, ResNet50, OpenCV]

2024

Image to Ingredients

Food recognition from images. 92% ingredient accuracy using GPT-4 Vision. Full-stack with real-time processing.

[Next.js, TypeScript, GPT-4V, Vercel]

→ github → live

2024

Attention Mechanism Visualizer

Interactive education tool. Step-by-step visualization of transformer attention. Used by 200+ students.

[JavaScript, D3.js, WebGL, React]

→ github

2024

PEFT & LoRA Implementation

Parameter-efficient fine-tuning. Reduced memory by 90% while maintaining 98% performance. Comprehensive benchmarks across models.

[PyTorch, Transformers, PEFT, LoRA]

→ github