HUGE BILLING bug. Seems like a scum Botpress #🤝help

Join Discord

HUGE BILLING bug. Seems like a scum

# 🤝help

gentle-baker-47357

02/03/2024, 5:14 PM

I will ad explanation in topick

gentle-baker-47357

02/03/2024, 5:15 PM

Guys, I have a huge problem with billing

gentle-baker-47357

02/03/2024, 5:15 PM

Specifically the Open ai payment. I am charged very large sums for dialogue. A small dialogue with the bot costs me 0.4 $

gentle-baker-47357

02/03/2024, 5:15 PM

After posting my bot in telegram there were a few dialogs, and it costed me 4.4 dollars! If I have 10,000 usual dialogs, I need to pay 4k$ only to open ai

gentle-baker-47357

02/03/2024, 5:16 PM

Also I notice that after changing model in ai task there is no difference in token pricing. I made some calculations

gentle-baker-47357

02/03/2024, 5:16 PM

gentle-baker-47357

02/03/2024, 5:17 PM

It was per tocken price test for: gpt 3.5 turbo version 1 and 2 And gpt 4 turbo versions 1 and 2

gentle-baker-47357

02/03/2024, 5:17 PM

LOGS AVIDANCE THAT THERE IS SOME BUG In this case I asked small question that is not in my KB. So ai task performed (gpt 3.5 turbo and gpt 4 turbo have thesame price results) Gpt 3.5 test - Question was: "How are you". - Answer was: I'm doing great, thanks for asking! How can I help you today? Gpt 4 test - Question was: "How are you". - Answer was: I always enjoy chatting with you guys! I'm fine, thanks for asking 😊. Is there anything else I can help you with or do you have any other questions? LOOK in logs!!! I pay 95% of my money to kb, that didn't perform in this case

gentle-baker-47357

02/03/2024, 5:17 PM

gentle-baker-47357

02/03/2024, 5:18 PM

Also look how much tokens has been calculated to this little query

gentle-baker-47357

02/03/2024, 5:18 PM

SECOND CASE In this case I asked question that is in my KB. And here is results

gentle-baker-47357

02/03/2024, 5:18 PM

gentle-baker-47357

02/03/2024, 5:18 PM

I decided to ask my lovely chat gpt to calculate how much tokens in given answer from kb. There was only 107 tokens in answer.

gentle-baker-47357

02/03/2024, 5:18 PM

gentle-baker-47357

02/03/2024, 5:19 PM

Only summary, personality and translator agents tacked 2,347 tokens that is 22 times bigger than tokens in answer

fresh-fireman-491

02/03/2024, 5:58 PM

How big is your KB?

gentle-baker-47357

02/03/2024, 5:59 PM

Little

gentle-baker-47357

02/03/2024, 6:00 PM

gentle-baker-47357

02/03/2024, 6:08 PM

fresh-fireman-491

02/03/2024, 6:22 PM

In RAG, the user query is sent to a vectorial database, where the most similar piece of information is retrieved and sent to the LLM along with the query. The number of tokens sent to the LLM can be significant depending on how many chunks of text are retrieved from the database. A lot of redundant text in your KB will ramp up the costs. Try and go thru you KB and delete everything that is redundant. If you are lazy and smart you will give your documents to ChatGPT and prompt it to optimize the text for RAG and delete all redundant text.

gentle-baker-47357

02/03/2024, 6:25 PM

If the problem is with database optimisation, why is the price many times less than in cases where the database is not used? And why is there no difference in pricing of different gpt versions?

fresh-fireman-491

02/03/2024, 6:28 PM

There is a big difference between the pricing between GPT-3.5 and GPT-4. GPT4 is ≈20 times more expensive that GPT-3.5

fresh-fireman-491

02/03/2024, 6:29 PM

You can see that the most expensive part is the KB agent

gentle-baker-47357

02/03/2024, 6:29 PM

Please, can you review my messages again?

gentle-baker-47357

02/03/2024, 6:29 PM

There is no difference in two versions

fresh-fireman-491

02/03/2024, 6:30 PM

Even if you query the KB and it is not able to answer it will still have a cost

gentle-baker-47357

02/03/2024, 6:32 PM

And here, where was no answer in database I was charged for 0,024$ for second database quary

gentle-baker-47357

02/03/2024, 6:32 PM

In this case it was only 0,0032

fresh-fireman-491

02/03/2024, 6:32 PM

Have you selected Hybrid in your KB agent?

gentle-baker-47357

02/03/2024, 6:32 PM

Yes

fresh-fireman-491

02/03/2024, 6:33 PM

It used GPT-3.5 first which is the first cost, then it used GPT-4 which is the second cost

fresh-fireman-491

02/03/2024, 6:34 PM

GPT-4 is so much more expensive that GPT-3.5

gentle-baker-47357

02/03/2024, 6:34 PM

Thanks for response

gentle-baker-47357

02/03/2024, 6:36 PM

But why it charges 2,347 tokens for summary, personality and translator agents if text is only 107 tokens?

gentle-baker-47357

02/03/2024, 6:41 PM

Thanks for this advice

fresh-fireman-491

02/03/2024, 6:43 PM

I can't tell you exactly what is going on behind the scenes but my guess is that the all of the agents tokens costs works like this: Receives a message of x tokens, uses a prompt of y tokens and outputs z tokens. Total token cost is there for x + y + z and you can only see the total cost. It is kind of like a black box for us, since we can't really see what is going on except the input and output. This post is sent to the team and when they are back in office on Monday that can take a look and hopefully give you some more information about this than me.

gentle-baker-47357

02/03/2024, 6:45 PM

Thank you very much for the reply, I look forward to hearing back from the team

gentle-baker-47357

02/03/2024, 7:30 PM

Sorry for another twitch, but after your advice on optimising skew gpt the database started using 1k more tokens for analysis

gentle-baker-47357

02/03/2024, 7:31 PM

3500 tokens instead 2450

fresh-fireman-491

02/03/2024, 7:34 PM

Do you think that it was optimised? Did it reduce the amount of text in the KB?

gentle-baker-47357

02/03/2024, 7:35 PM

Yes

gentle-baker-47357

02/03/2024, 7:36 PM

But in any case my bot has a very small database, but the spend is very large. What happens when I have a bot with a big database?

gentle-baker-47357

02/03/2024, 7:36 PM

My database is only 500kb from 100mb

fresh-fireman-491

02/03/2024, 7:36 PM

It used more tokens with the exactly same query?

gentle-baker-47357

02/03/2024, 7:37 PM

Yes

gentle-baker-47357

02/03/2024, 7:37 PM

I tried different queries

gentle-baker-47357

02/03/2024, 7:37 PM

I will give you my document and optimized one

fresh-fireman-491

02/03/2024, 7:38 PM

I might be thinking about to different posts but didn't you say that your KB was only 0.5 MB earlier? Or am I thinking of another help post?

fresh-fireman-491

02/03/2024, 7:38 PM

Yea here

fresh-fireman-491

02/03/2024, 7:39 PM

gentle-baker-47357

02/03/2024, 7:39 PM

creo-agency-faq creo-agency-faq-optimized

gentle-baker-47357

02/03/2024, 7:39 PM

Yes

gentle-baker-47357

02/03/2024, 7:40 PM

It is only 0.5 mb that is 500kb

fresh-fireman-491

02/03/2024, 7:40 PM

So it is not 500kb from being 100mb it is just 500kb?

gentle-baker-47357

02/03/2024, 7:41 PM

yes my database is only 500kb

gentle-baker-47357

02/03/2024, 7:41 PM

And it takes 2500 tokens to review it with gpt 3.5

gentle-baker-47357

02/03/2024, 7:43 PM

And as i understand, if my KB will be 50mb each call to my database will take 250r tokens, that is minimum 2 dollars

gentle-baker-47357

02/03/2024, 7:45 PM

@fresh-fireman-491

fresh-fireman-491

02/03/2024, 7:48 PM

No. Here is a really simplified picture of how RAG works.

fresh-fireman-491

02/03/2024, 7:50 PM

The query will remain the same amount of tokens. The amount of context given by the database will also be the same. The amount of tokens used will not increase a lot since the LLM will roughly process the same amount of tokens. It will have to search in a larger DB but it shouldn't ramp up the costs by 100x

gentle-baker-47357

02/03/2024, 7:53 PM

Thanks for your professional explanation

fresh-fireman-491

02/03/2024, 7:53 PM

You are very welcome, I am testing it right no to make sure I am not wrong!

gentle-baker-47357

02/03/2024, 7:54 PM

Please, provide me with test results

fresh-fireman-491

02/03/2024, 8:33 PM

With a KB of just 587 bytes.

fresh-fireman-491

02/03/2024, 8:34 PM

With a KB of 59.336.422 bytes. 59.3MB

fresh-fireman-491

02/03/2024, 8:35 PM

So about 4 times more in token usage

fresh-fireman-491

02/03/2024, 8:35 PM

And the KB is 101084,1942 times larger

fresh-fireman-491

02/03/2024, 8:36 PM

So as stated, there will be an increase in the token usage the larger the KB is but it will not go up by 100x just because the KB size does

fresh-fireman-491

02/03/2024, 8:49 PM

With a KB of 81.054.142 bytes. 81MB

fresh-fireman-491

02/03/2024, 8:51 PM

Basically the same amount of tokens used between 59MB and 81MB KB even though the new KB is around 1.37 times bigger.

gentle-baker-47357

02/03/2024, 10:27 PM

Thanks

gentle-baker-47357

02/03/2024, 10:27 PM

Can i ask you for a kb template?

gentle-baker-47357

02/03/2024, 10:28 PM

In order to figure out why my kb is si weight

gentle-baker-47357

02/03/2024, 10:30 PM

and in your experience, approximately how much money is made for an average conversation between a client and a bot

fresh-fireman-491

02/04/2024, 6:15 AM

Hey I won't be able to share a template with you until much later today. I would recommend you to watch a YouTube video about it instead. It is 6 months old so somethings has changed but the principles are the same.

https://youtu.be/lWihOWJRMaM?si=Mp_RQhodGE9d74v5▾

The amount of money earned differentiates a lot. Some bots doesn't even directly earn money they are just there to make the customer happier while some might earn a lot. I am not the best at giving business advice so I would recommend you to ask those types of questions in #1174812513636987042

gentle-baker-47357

02/04/2024, 11:04 AM

Thanks for your response. You really helped me. However, if it is possible I really would like to look at your bot with 50 mb knowledge base

gentle-baker-47357

02/04/2024, 11:04 AM

Appreciate your time

fresh-fireman-491

02/04/2024, 1:23 PM

concerned-woodpecker-2024-feb-04

fresh-fireman-491

02/04/2024, 1:23 PM

I just used botpress.com as the kb

gentle-baker-47357

02/04/2024, 8:03 PM

Thank you so much

fresh-fireman-491

02/04/2024, 9:43 PM

You are very welcome

gentle-baker-47357

02/08/2024, 7:37 AM

Guys, it’s an actual question

fresh-fireman-491

02/08/2024, 8:13 AM

Hey Could you give a recap of what you are missing an answer to?

gentle-baker-47357

02/08/2024, 8:17 AM

How can I decrease amount of tokens of kb query

refined-coat-89502

02/08/2024, 6:03 PM

Wow, I see. I'm not alone for this fast consumption of AI service problem 😉 You can check out what we discuss in this post https://discord.com/channels/1108396290624213082/1204990801265037362

fresh-fireman-491

02/08/2024, 6:19 PM

Make you KB more concise

hundreds-petabyte-10180

02/27/2024, 4:03 PM

I also have the same problem with GPT4. I created a test bot with a single 37-page 1.22Mb pdf. The cost for each single response is $0.16. I can't understand how this cost can be brought down considering that I can't optimize the PDF in any way.

37 Views

Previous Next