Technical Blog
In-depth articles on AI systems, software engineering, web development, and developer productivity
The Hidden Indexing Trap: LangChain-Postgres, pgvector, and Custom Vector Stores
How a 20,000-example few-shot similarity pipeline exposed silent full-table scans in pgvector — and what it took to fix them. Covers missing vector_size, operator class mismatches, HNSW migrations with Alembic, iterative scans, quantization, and production observability.
A cognitive-science approach to AI-assisted coding
Aligning AI-assisted code generation with human reasoning: Deep Work, atomic changes, and protocol-driven prompting to improve productivity and code quality.
Prompt Design Patterns in Agentic AI: A Comprehensive Technical Guide
A technical guide to prompt design patterns for building production-ready agentic AI systems. Covers core patterns like ReAct and chain-of-thought, advanced patterns, and evaluation strategies.
Deploying AI Systems in Hospitals: From Lambda Function to SaMD
A technical guide to moving AI from prototype to regulated, production-grade hospital systems: serverless limits, on‑prem deployment, SaMD, and GDPR/HIPAA alignment.
Scaling XGBoost: Training Classification Models with Thousands of Classes
How to fix XGBoost out-of-memory errors when training multi-class models with thousands of classes on GPUs: gradient-pair matrix, three strategies (CPU, One-vs-Rest, Hierarchical Softmax), and when to use each.