Your daily dose of complex AI concepts made simple, practical, and accessible for everyone.
blogpost
LLM-as-Judge: How to Evaluate AI Models Automatically at Scale
Read post →
blogpost
Edge AI: Running LLMs on Your Phone Without the Cloud
Read post →
blogpost
AI Coding Assistants in 2026: Cursor, GitHub Copilot, and the Future of Software Development
Read post →Ask which post fits your problem, or anything about Peri's work on LLMs, RAG, and agents. It'll link you straight to the article.
blogpost
Prompt engineering is giving way to something deeper: context engineering. How you structure what goes into the context window, what you include,...
ai-engineering
Vision Language Models bridge the gap between pixels and language. This post covers how CLIP encodes images, how visual tokens are projected...
ai-engineering
Mixture of Experts scales model capacity without scaling compute. Instead of activating all parameters for every token, MoE routes each token to...
blogpost
Diffusion models generate images by learning to reverse a noise process. No math required. Here is the intuition behind Stable Diffusion, DALL-E,...
blogpost
Stateless LLMs forget everything when the context window closes. Building agents that actually remember requires understanding four distinct memory types and when...
blogpost
Reasoning models like o1, o3, and DeepSeek R1 don't just predict the next token. They think through problems step by step before...
ai-engineering
Full fine-tuning a 7B parameter model requires 112 GB of VRAM and costs thousands of dollars. LoRA reduces trainable parameters by 99%...
ai-engineering
Standard vector RAG fails on multi-hop questions that require connecting information across multiple documents. Graph RAG solves this by building a knowledge...
ai-engineering
Standard unit tests cannot evaluate LLM outputs because the same question can have dozens of correct answers. This post covers LLM-as-judge scoring,...
ai-engineering
Exact-match caching fails for LLMs because users never ask the same question twice in identical words. Semantic caching solves this by comparing...