mmmm just a clarification here – we're not using the Assistants API and the query goes straight to OpenAI – the added latency is humanly indetectable (30-50ms).
we have heavy prompt caching – so if anything using botpress vs OpenAI directly just means cost savings and a fair chance at getting instant results back