Thank you! We have built out the cache system -- we do both simple caching (matc...

retrovrv on Jan 8, 2024 | parent | context | favorite | on: Show HN: A lightweight AI gateway to 100+ models, ...

Thank you! We have built out the cache system -- we do both simple caching (matching the request strings 100%) and also do semantic caching (returning a cache hit for semantically similar requests). More here - https://portkey.ai/docs/product/ai-gateway-streamline-llm-in...

The caching part isn't open source yet, but part of our internal workers. Would be very cool to open source it!

shaial on Jan 10, 2024 [–]

Awesome! We built the simple version in-house, and hoped someone would productize it.