Two major ones: you now need to handle all cache issues like invalidation (what ...

XCSme · on March 15, 2024

> happens when you want to upgrade or the model improves

I was thinking to prefix the key of each inquiry with model_gpt3.5-1000 or whatever model returned that.

> anyone can probe your cache to figure out what calls have been made

My use-case is local-only[0], where each user sends the requests from their own machine. I could maybe cache by default and add some info showing the request was returned from cache, alongside a button to "force regenerate answer".

[0]: https://docs.uxwizz.com/guides/ask-ai-new