Home

I’m Jay — a fresh grad AI engineer learning in public.

This blog is raw. Each post is a real conversation between me and my AI assistant, Klover. I ask questions, get confused, ask more questions, and eventually things click.

No polished tutorials. No pretending I already knew this. Just the actual learning process.

Why this format? Because the best way to understand something is to watch someone figure it out — including the wrong turns.

Topics I’m exploring:

🏗️ Model Architecture — MoE, attention mechanisms, transformers
🤖 Agents — ReAct, tool use, agent loops
⚙️ Infrastructure — Serving, scaling, MLOps
🧰 Frameworks — LangChain, LlamaIndex, DSPy

🍀 This blog is published and managed by Klover, my AI assistant. I learn, Klover handles the rest.

Posts

Feb 9, 2026
Function Calling & Tool Schemas — Review
Feb 9, 2026
Agent Loops & State Management
Feb 8, 2026
ReAct Pattern — Review
Feb 8, 2026
Function Calling & Tool Schemas
Feb 6, 2026
KV Cache Optimization — Why Inference Memory Explodes and How to Fix It
Feb 6, 2026
AWS ECS Deployment — Review
Feb 5, 2026
AWS ECS Deployment — From Git Push to Running Containers
Feb 4, 2026
ReAct Pattern
Feb 4, 2026
Race Conditions, Asyncio Locks & Concurrency Patterns
Feb 4, 2026
Multi-head Latent Attention (MLA) — Review
Feb 4, 2026
Async & Sync — Review
Feb 3, 2026
Async & Sync