Please help! Our AI spending is too much high! Botpress #🤝help

Please help! Our AI spending is too much high!

rhythmic-lizard-47527

03/27/2024, 7:54 AM

how is possible for every 10 messages it costs 1$ in AI spend? can someone explain how I can get it lower, please? it's too expensive 🥵 https://cdn.discordapp.com/attachments/1222453583728087090/1222453583862173696/image6.png?ex=6616459e&is=6603d09e&hm=dbf35ed7a5047d86ff846e17190dabf7bd44610c05c19b2f8fa71f68043298bf& https://cdn.discordapp.com/attachments/1222453583728087090/1222453584390524928/image5.png?ex=6616459e&is=6603d09e&hm=52cefd24aa2cc8d0e36c48eb117f5ddd041439831bc8ae4a0ae724f483f09849&

rhythmic-lizard-47527

03/27/2024, 7:54 AM

https://cdn.discordapp.com/attachments/1222453583728087090/1222453667714826322/image.png?ex=661645b2&is=6603d0b2&hm=8d492e0827757a2d438b0c4fab992aaec3ac535af2071f13a8ab674091ac1af0& https://cdn.discordapp.com/attachments/1222453583728087090/1222453668138323978/image2.png?ex=661645b2&is=6603d0b2&hm=e1219168fa5decc3301421ced6e97a6080c4fc54c442ae85de7ff0f4e81d4a4b& https://cdn.discordapp.com/attachments/1222453583728087090/1222453668549492787/image3.png?ex=661645b2&is=6603d0b2&hm=6745fcc699ca9fa59758bfd0ef2f0baa88cb1b4622364fab0ab2fbb9c212c199& https://cdn.discordapp.com/attachments/1222453583728087090/1222453668901556254/image4.png?ex=661645b2&is=6603d0b2&hm=b617f936ae143c392e0067be3044c419390b67e9758a3f167cb21799bf520245&

fresh-fireman-491

03/27/2024, 7:55 AM

Hey there, It will be lower if you use GPT-3.5 it's around 30x cheaper

fresh-fireman-491

03/27/2024, 7:56 AM

Maybe we can separate these posts? So this post is about AI spending, and you can make a new post about the issue with the Query of the KB

rhythmic-lizard-47527

03/27/2024, 8:09 AM

done it

rhythmic-lizard-47527

03/27/2024, 8:11 AM

any suggestions for how to make it lower? https://cdn.discordapp.com/attachments/1222453583728087090/1222458024858353724/image.png?ex=661649c1&is=6603d4c1&hm=b83d641f3357a2d4ee1bbe5c7bb8d78fc02dff52dc75f1b94772717d2039332f&

rhythmic-lizard-47527

03/27/2024, 8:12 AM

We wanted to have all types of situations but looks like we need to use less values

fresh-fireman-491

03/27/2024, 8:12 AM

Why not just include the answers to all of those in the KB, instead of there?

rhythmic-lizard-47527

03/27/2024, 8:17 AM

because some of them traffic to another nodes the handle this types of situations

rhythmic-lizard-47527

03/27/2024, 8:18 AM

like some one says he wants to cancel i different ways so it will send him to where it needed

cold-jewelry-54343

03/27/2024, 8:20 AM

can you post a bit more of your flow with higher resolution ?

cold-jewelry-54343

03/27/2024, 8:21 AM

generally it is a good practice to minimize as much possible within your flow

cold-jewelry-54343

03/27/2024, 8:21 AM

this includes the options for the AI task node.

cold-jewelry-54343

03/27/2024, 8:24 AM

instead of branching out to every possible solution with the AI expression node you can try to "catch and parse" the response using an AI node e.g.: - if a user says "I would like to buy a product" set "workflow.intent = buy" - if a user "wants to order a coffee" set "workflow.intent = coffee" - then we can set an expression node that triggers the flow if it matches one of these outputs.

rhythmic-lizard-47527

03/27/2024, 8:35 AM

maybe you can see and give me some tips my friend? https://cdn.discordapp.com/attachments/1222453583728087090/1222464068846092288/TA.bpz?ex=66164f62&is=6603da62&hm=dc2ebeb87926878db94318a6c52efc034e64120a20b1ce1c77eef82f7a518c82&

rhythmic-lizard-47527

03/27/2024, 11:30 AM

@User Someone can assist us?

fresh-fireman-491

03/27/2024, 11:43 AM

I think Remo and I have given you enough to start changing some things, and trying those 🙂

faint-pizza-68420

03/27/2024, 10:44 PM

Hey there , from where can I check my AI spending??

fresh-fireman-491

03/27/2024, 10:47 PM

Hey there https://cdn.discordapp.com/attachments/1222453583728087090/1222678495327551600/image.png?ex=66171715&is=6604a215&hm=a7358b9ad771875c6d227a53d531974f9c1b095d5897855d5b3615a676d93721&

faint-pizza-68420

03/27/2024, 10:48 PM

Thanks 👍🏻

fresh-fireman-491

03/27/2024, 10:50 PM

You are very welcome

rhythmic-lizard-47527

04/01/2024, 7:35 AM

@User @bumpy-butcher-41910 anyone here can assist us? each response is now 0.1$ even from the KB (all organized there in a Q&A) and its really lot in a basic conversation...

rhythmic-lizard-47527

04/01/2024, 7:36 AM

the ai spend can go to end i only one conversation

rhythmic-lizard-47527

04/01/2024, 7:37 AM

we tryied to switch to GPT 3.5 in KB , and downed the prices to 0.01 but it cant answer nothing for our KB and skips to our ai fallback like nothing is there

rhythmic-lizard-47527

04/01/2024, 1:10 PM

Hi , the main cost is on the knowledge search and not even the ai cards

rhythmic-lizard-47527

04/01/2024, 1:11 PM

its 0.1 for each response

delightful-ram-2501

04/01/2024, 1:14 PM

Please show me your KB what type of files u have

rhythmic-lizard-47527

04/01/2024, 1:49 PM

only rich text file

rhythmic-lizard-47527

04/01/2024, 1:49 PM

and a website

rhythmic-lizard-47527

04/01/2024, 1:49 PM

homepage

rhythmic-lizard-47527

04/01/2024, 3:32 PM

how is that posible i cant get lower than 0.1? here its 0.0025 https://cdn.discordapp.com/attachments/1222453583728087090/1224380823826858064/image.png?ex=661d4880&is=660ad380&hm=56973c5b1855700bfa21d360c9ece7240316d0ce7aaa4ea118458ab786c5eb93&

rhythmic-lizard-47527

04/01/2024, 3:33 PM

https://cdn.discordapp.com/attachments/1222453583728087090/1224381090874265641/image.png?ex=661d48bf&is=660ad3bf&hm=37f372b2868a7f24f9c0fd3f9bdb39c6b26d6f9051b782f0267b94c8aba3d01f&

fresh-fireman-491

04/01/2024, 4:29 PM

How big is your rich text file?

fresh-fireman-491

04/01/2024, 4:29 PM

And are you using the Website as your source or the Search the web?

delightful-ram-2501

04/01/2024, 5:28 PM

Maybe its language issue, because i work with polish and ai spend for polish is a litle bit higher than for english

rhythmic-lizard-47527

04/01/2024, 6:21 PM

tryied both, search was more cheaper

rhythmic-lizard-47527

04/01/2024, 6:22 PM

what are your prices for polish? and what kind of model you run?

rhythmic-lizard-47527

04/01/2024, 6:22 PM

not too much big, just the details we need to sell the product

fresh-fireman-491

04/01/2024, 6:47 PM

So are you still using Search?

rhythmic-lizard-47527

04/01/2024, 6:50 PM

yes

rhythmic-lizard-47527

04/01/2024, 7:09 PM

what do you recommend?

fresh-fireman-491

04/01/2024, 7:35 PM

I don't think I can do much more here unfortunately. I have given you the tips and tricks that I know.

bumpy-butcher-41910

04/01/2024, 9:09 PM

just to confirm, it sounds like you're using GPT-4?

rhythmic-lizard-47527

04/01/2024, 10:06 PM

Now its on GPT 3.5, On 4 all the prices are going to 0.1$

rhythmic-lizard-47527

04/01/2024, 10:08 PM

only my tests today...😔 https://cdn.discordapp.com/attachments/1222453583728087090/1224480421031710720/image.png?ex=661da542&is=660b3042&hm=780d60b8685d444c2b9493fc14b8efd7894e8852fabc0247fc2b9a2c852ff16d&

rhythmic-lizard-47527

04/02/2024, 9:18 AM

https://tenor.com/view/pepe-peppo-cry-gif-22663734

rhythmic-lizard-47527

04/02/2024, 9:18 AM

https://cdn.discordapp.com/attachments/1222453583728087090/1224649207038873600/image.png?ex=661e4273&is=660bcd73&hm=531c6c37d604f0f95fc65725ab5d3e631f721e1a245af0147239800756b6eb20&

bumpy-butcher-41910

04/02/2024, 3:26 PM

if you've been using GPT-4 during your tests, that's just the cost of the model

bumpy-butcher-41910

04/02/2024, 3:26 PM

you can learn more at https://openai.com/pricing

bumpy-butcher-41910

04/02/2024, 3:27 PM

as I show in the video you linked above, it's possible to reduce the cost of your knowledge base queries using the tips we outline there

rhythmic-lizard-47527

04/02/2024, 4:11 PM

Hi Robert thank you for your reply, We have been working on it for the past few days and trying to fix it. I would like to describe it hope you will know the answer. *• When we go on 'Fast (GPT3.5)' * All our query knowledge base cards are getting ignored, or with bad responses price for each response is around 0.00X$. • When we go on 'Hybrid (GPT3.5\4) reads the kb query knowledge base cards working much better The bot knows how to answer and do it better, price for each response is around 0.08$ on GPT 4 and when it is on GPT 3.5 price for each response is around 0.00X$ • When we go on 'Slow (GPT4)' Good response but the price for each response is 0.08 on GPT 4 after this 2 days Hybrid is the more accurate one, but the prices are very high, with calculating of a typical chat of 30 responses, it can cost around 1$ for each conversation.

rhythmic-lizard-47527

04/02/2024, 4:13 PM

https://cdn.discordapp.com/attachments/1222453583728087090/1224753619753242694/image.png?ex=661ea3b1&is=660c2eb1&hm=4150ca169eeaca66dfaa6eb9ab9ad0358e4ed2f9b5833db7b89839441db6af78&

rhythmic-lizard-47527

04/02/2024, 4:15 PM

and this is the pricing now when it enters the GPT3.5 on hybrid (after first response is sent from KB) https://cdn.discordapp.com/attachments/1222453583728087090/1224754091339681842/image.png?ex=661ea422&is=660c2f22&hm=11f77db2d572bb3f2824f589cabfbe4d53affb17b115fb1166100ed621564dea&

rhythmic-lizard-47527

04/02/2024, 4:17 PM

all the tips that are here are very welcomed and big thanks for them but as you can see @bumpy-butcher-41910 @fresh-fireman-491 the problem is not the AI classification or any other thing, Its just the Query KB agent that spends 10000 tokens for small responses It is a lot...

rhythmic-lizard-47527

04/02/2024, 4:20 PM

All KB is organized in one rich small text file with Faq and explention. Also, @User I am not this type of person but I must say that we have been working on the Botpress platform for about a year, learning all the features and techniques and we are huge fans of it that looking forward. but each time we think it's perfect and it's working for about a week it pulls us back again and again by messing with our responses randomly after a week or those high prices of conversation. we really don't know how to sell the bots to our customers until we figure out how to solve those things.. And we see that a lot of bot builders are working. so why we can't? (Have just checked the Claude LLM, the prices are much lower, work great with other languages like Hebrew and etc I think it can be an amazing option to add it to botpress)

bumpy-butcher-41910

04/02/2024, 5:27 PM

what is giving you the sense that the token usage here is higher than expected?

bumpy-butcher-41910

04/02/2024, 5:27 PM

GPT-4 is just an expensive model, unfortunately

bumpy-butcher-41910

04/02/2024, 5:27 PM

I typically don't have problems with KB queries on GPT 3.5

bumpy-butcher-41910

04/02/2024, 5:30 PM

the ideal solution would be to use GPT 3.5, since it's about 30x cheaper than GPT-4

rhythmic-lizard-47527

04/02/2024, 7:10 PM

Because it a lot of time costs 0.07 instead of 0.0043 which takes one simple conversation more than 1$ as I mentioned before in the images

rhythmic-lizard-47527

04/02/2024, 7:38 PM

From your experience, is that the ideal price? I gave a try to check claude haiku with the same amount of data and tokens price was there about 0.0000X, its a big difference..

rhythmic-lizard-47527

04/02/2024, 7:42 PM

We try to understand how to do that it will use the GPT 3.5 Prices and also answer good from the KB, sadly it works normal only on 'HYBRID' mode that takes us to those prices. We realy have done a well explained (not too much) KB that has all the data we need to get answers about. So we cant figure it

rhythmic-lizard-47527

04/02/2024, 7:44 PM

This what we trying to do 🫨

fresh-fireman-491

04/02/2024, 8:02 PM

You can't really compare GPT-4 or GPT-3.5 to Haiku. They are different models, from different companies with different pricing. Haiku is incredibly cheap and fast. You could compare GPT-4 to their Opus model.

fresh-fireman-491

04/03/2024, 6:09 PM

https://botpress.com/blog/how-to-optimize-ai-spend-cost-in-botpress

rhythmic-lizard-47527

04/03/2024, 6:45 PM

Will check that thank you!

fresh-fireman-491

04/03/2024, 7:23 PM

You are very welcome

rhythmic-lizard-47527

04/03/2024, 9:18 PM

I am trying to do the table part with the faq in my KB

rhythmic-lizard-47527

04/04/2024, 2:59 PM

The thing with the tables is getting little messy with translation and finding it

fresh-fireman-491

04/04/2024, 3:00 PM

The tables are for structured data. It will yield the as good results if the data is not structed, but it will yield better results if the data is

rhythmic-lizard-47527

04/04/2024, 3:02 PM

We have done some changes but It is still expensive when we use the 'Hybrid' KB model because of using those Query knowledge cards, does anyone here have a recommendation for a way to reduce the cost of it somehow? we trying to figure it out for long time

rhythmic-lizard-47527

04/04/2024, 3:02 PM

https://cdn.discordapp.com/attachments/1222453583728087090/1225460450909749278/image.png?ex=662135fb&is=660ec0fb&hm=18220dda9361cf649a7ec0de82730cf3bc1f78ccd9f6318ea131271aa8682c1c&

rhythmic-lizard-47527

04/04/2024, 3:03 PM

There is a way after we used the Query knowledge card one time to save the data somewhere for not to use it any time we need the kb? to make all the processes cheaper and not used each time we need the KB with this expensive card Each time a client asks a question we use the KB, but we need to use each time the Query knowledge card? or there is a way to use it one time only at the beginning for scanning all the kb? https://cdn.discordapp.com/attachments/1222453583728087090/1225460833924943934/image.png?ex=66213656&is=660ec156&hm=7115883d5677b5ca216d07dd38ad39e5bffa51a76b1b8a4f2bd2efec1e0514f4&

rhythmic-lizard-47527

04/04/2024, 3:04 PM

the main reason of those prices is this query knowledge bases cards...but there is no other way to answer with KB

fresh-fireman-491

04/04/2024, 3:08 PM

I would recommend you to take a look at this video, which explains the basics of how RAG works

https://www.youtube.com/watch?v=2mDsuzty4fw▾

fresh-fireman-491

04/06/2024, 3:06 PM

Hey there @rhythmic-lizard-47527 You can now reduce the spending as you wish in the KB agent settings. You can pick how many chunks it should retrieve from the Knowledge Bases. Less chunks = cheaper, but less accurate.

rhythmic-lizard-47527

04/07/2024, 7:43 AM

how we can do it?

fresh-fireman-491

04/07/2024, 7:58 AM

You can edit it in Knowledge Agent settings. There is a slider

rhythmic-lizard-47527

04/07/2024, 10:36 AM

I want to marry you @fresh-fireman-491

4 Views

Previous Next