Tag: redis

2 matching entries.

Post 21 May 2026 3 min read

I benchmarked five embedding models across four NanoBEIR datasets and found that bigger embeddings did not always produce better retrieval.

Post 27 Mar 2026 3 min read

Repeated user intents can quietly inflate LLM cost and latency. Semantic caching helps, but production use comes with trade-offs.