MCP Discoverability: The Hidden Cost of Scale
As the agentic ecosystem matures, tools are no longer scarce. They're everywhere. The hard part now isn't wiring up tools — it's helping models discover which ones to use.
Engineer at heart • Exploring AI, robotics & the human experience
As the agentic ecosystem matures, tools are no longer scarce. They're everywhere. The hard part now isn't wiring up tools — it's helping models discover which ones to use.
In the past year, agent architectures have gone from niche experiments to front-page product strategies. But one area remains dramatically under-discussed: context engineering.
Evaluation has quietly become the backbone of modern AI products. It's what separates a system that 'looks cool in demos' from one that actually works.
I spent 1.5 hours debugging what should've been a simple Postgres connection issue while using Google's Gemini CLI. Claude solved it in 5 minutes.
Over the next 10 years, the GenAI landscape won't be shaped by prompt hacks or viral demos. It will be defined by who builds the infrastructure, systems, safety nets, and experiences that actually ship and scale.
I've been building with AI coding agents lately — and they're surprisingly good. But one thing's become clear: you still have to say no. A lot.
Lately, I've been experimenting with Cursor's AI coding agent and had a pretty fascinating 'vibe coding' session — part exploration, part debugging, part babysitting.
Over the past few days, I've been exploring how to design intelligent agents using large language models. What started as a simple weather bot turned into a deep dive.
I recently used Claude as a coding partner for a project predicting patient appointment show rates. It was impressive in some ways — but frustrating in others.
A deep dive into how companies are actually using large language models in production, from GitHub Copilot writing 46% of code to enterprises struggling with hallucination rates of 27%
An in-depth analysis of the LLM ecosystem in May 2023, from Geoffrey Hinton's dramatic Google exit to the $50 billion funding frenzy reshaping Silicon Valley's power structure
This is the first of a multi-part series exploring exciting new developments in AI. A deep dive into the models that power ChatGPT.