How a 20,000-example few-shot similarity pipeline exposed silent full-table scans in pgvector — and what it took to fix them. Covers missing vector_size, operator class mismatches, HNSW migrations with Alembic, iterative scans, quantization, and production observability.
Aligning AI-assisted code generation with human reasoning: Deep Work, atomic changes, and protocol-driven prompting to improve productivity and code quality.
A technical guide to prompt design patterns for building production-ready agentic AI systems. Covers core patterns like ReAct and chain-of-thought, advanced patterns, and evaluation strategies.
A technical guide to moving AI from prototype to regulated, production-grade hospital systems: serverless limits, on‑prem deployment, SaMD, and GDPR/HIPAA alignment.
How to fix XGBoost out-of-memory errors when training multi-class models with thousands of classes on GPUs: gradient-pair matrix, three strategies (CPU, One-vs-Rest, Hierarchical Softmax), and when to use each.
mkavie.tech - Technical Blog on AI, Software Engineering & Web Development