Discussion about this post

User's avatar
Neural Foundry's avatar

Excellent roundup, especially the Vinted dense retrieval piece. The index sharding-by-market strategy they disscuss is clever because it sidesteps the global coordination overhead thatmost dense retrieval implementations trip over at scale. Geographic partitioning feels obvious in hindsight but combining that with their reciprocal rank fusion in the global phase is what really makes the whole setup work under tight latency budgets.

No posts

Ready for more?