Posts tagged with "machine-learning"
The Hidden Indexing Trap: LangChain-Postgres, pgvector, and Custom Vector Stores
How a 20,000-example few-shot similarity pipeline exposed silent full-table scans in pgvector — and what it took to fix them. Covers missing vector_size, operator class mismatches, HNSW migrations with Alembic, iterative scans, quantization, and production observability.
Scaling XGBoost: Training Classification Models with Thousands of Classes
How to fix XGBoost out-of-memory errors when training multi-class models with thousands of classes on GPUs: gradient-pair matrix, three strategies (CPU, One-vs-Rest, Hierarchical Softmax), and when to use each.