I agree. Performance is poor. its 5+ seconds for every answer and for initial welcome message. I would happily pay more for the performance to be no more than 2 seconds per answer. GPT 3.5 is instantaneous, so I'm not sure where the bottle neck is but it's making me seriously consider looking elsewhere.