Using the new 'Vision Agent' to analyse an image sent by a user on WhatsApp Botpress #🤝help

Using the new 'Vision Agent' to analyse an image s...

limited-library-71452

04/18/2024, 2:14 PM

Goal: Get an image sent by a user on WhatsApp to be described by the agent and saved as a variable. The logs are saying that i've got an error in the openAI part (i think) issue id iss_01HVRE0HSMVKFPRE0B14PQV7HW message: An error occurred while executing an agent action: [object Object] (Status Code: 400) I have left the image url blank. I have created a variable for 'store answer in variable' I then have an 'AI task' card to generate some advice based on the response from that variable. Vision agent __has__ proven to work when there is a url in the 'image url' field of the 'Extract content from image' card https://cdn.discordapp.com/attachments/1230521763847209021/1230521763994140702/Screenshot_2024-04-18_at_15.09.04.png?ex=66339fb2&is=66212ab2&hm=4c38860d256fbf31b6c156e7b97b59633b5091aacd3a3f6e244265179acff62d& https://cdn.discordapp.com/attachments/1230521763847209021/1230521764304654408/Screenshot_2024-04-18_at_15.10.48.png?ex=66339fb2&is=66212ab2&hm=d409629007db45077f71f793861ef91c4dc4c7645715c93b1bfc5abb945153ac&

fresh-fireman-491

04/18/2024, 2:24 PM

It can't run with the image url as blank

fresh-fireman-491

04/18/2024, 2:24 PM

It needs an image to analyse

limited-library-71452

04/18/2024, 2:27 PM

Sylvain mentioned "If the incoming message is of type image, then the image will be extracted automatically. It also supports extracting from links" If this isn't the case, is there a way to somehow generate a link from the users whatsapp message/input maybe?

jolly-policeman-82775

04/18/2024, 2:35 PM

Hey man

jolly-policeman-82775

04/18/2024, 2:35 PM

you need to save the url into a variable

jolly-policeman-82775

04/18/2024, 2:35 PM

and put it in the image url box

jolly-policeman-82775

04/18/2024, 2:35 PM

example: bot: what image can i look at today ? user: example image.com Bot: saves image URL into variable Bot: provides info about that image

fresh-fireman-491

04/18/2024, 2:41 PM

https://botpress.com/docs/cloud/channels/whatsapp/#media-images-audio-documents

limited-library-71452

04/18/2024, 2:51 PM

Ok looks like the answer is in there, thank you. My no code status is about to get tested 😂 Thanks Decay

limited-library-71452

04/18/2024, 2:52 PM

I think you mean what Decay has sent right? Creating a variable from a users input?

limited-library-71452

04/18/2024, 3:04 PM

hmm getting closer. Pumped about this! I think it's sending to 'the gpt' now. But it "failed to extract content from images using gpt" https://cdn.discordapp.com/attachments/1230521763847209021/1230534291776208946/Screenshot_2024-04-18_at_16.02.12.png?ex=6633ab5d&is=6621365d&hm=645f564076ef6052e098a77d1a044d326bd359ca78c78a4da3f7de28a107113d&

fresh-fireman-491

04/18/2024, 3:07 PM

Try and console.log the link that you are extracting

fresh-fireman-491

04/18/2024, 3:07 PM

Might be the issue

jolly-policeman-82775

04/18/2024, 3:07 PM

Yeah

limited-library-71452

04/18/2024, 3:13 PM

Super sorry Decay, I have no idea what that means. Is that something to add here? I think I may be out of my depth a bit now 😔 https://cdn.discordapp.com/attachments/1230521763847209021/1230536751597097050/Screenshot_2024-04-18_at_16.13.02.png?ex=6633ada7&is=662138a7&hm=102c93617523568f4a97d4085662cb693a83ddab9af9f6176d1e2f37c11ae2ad&

fresh-fireman-491

04/18/2024, 3:14 PM

Try with just the URL

fresh-fireman-491

04/18/2024, 3:14 PM

Maybe @bumpy-addition-21507 or @limited-pencil-78283 could help here

bumpy-addition-21507

04/18/2024, 3:18 PM

Tbh, @fresh-fireman-491 is right on the money here. That should work

fresh-fireman-491

04/18/2024, 3:18 PM

Amazing! Thank you

limited-library-71452

04/18/2024, 3:23 PM

So just... https://lookaside.fbsbx.com/ or will I need the brackets too?? The original one was: {"imageUrl": "https://lookaside.fbsbx.com/...."}

fresh-fireman-491

04/18/2024, 3:23 PM

Neither should work

fresh-fireman-491

04/18/2024, 3:24 PM

You need the last partr

fresh-fireman-491

04/18/2024, 3:24 PM

You can get it by this

Copy code

javascript
const whatsappAccessToken = env.WHATSAPP_ACCESS_TOKEN
 
const res = await axios.get(event.payload.imageUrl, {
  headers: {
    Authorization: `Bearer ${whatsappAccessToken}`,
  },
})
 
// This will be a JavaScript Buffer (https://nodejs.org/api/buffer.html) containing the raw binary content of the media file.
const rawFileContent = res.data
 
// This will indicate the file type, see:
// https://developers.facebook.com/docs/whatsapp/cloud-api/reference/media/#supported-media-types
const mimeType = res.headers['content-type']

fresh-fireman-491

04/18/2024, 3:25 PM

https://developers.facebook.com/docs/whatsapp/cloud-api/reference/media/#download-media

limited-library-71452

04/18/2024, 3:33 PM

ok, created the Configuration Variable named "WHATSAPP_ACCESS_TOKEN" in the bot settings. Do I need to enter this code somewhere also? Sorry dude. I don't envy you having to deal with no-code warriors like me. I appreciate the pain this must cause

fresh-fireman-491

04/18/2024, 4:02 PM

No worries at all. I haven't really used WhatsApp that much, that is why I pinged BattleSynth and Lijo. I was hoping that one of them could help use here

limited-library-71452

04/18/2024, 4:32 PM

No worries. I guess there will be a way to change that url entry part to be whatever the user sends in. I doubt it would be made so that you have to have the same url in place for every user

limited-pencil-78283

04/18/2024, 4:49 PM

@fresh-fireman-491 - haven't read the thread yet, will come back in sometime

limited-pencil-78283

04/18/2024, 5:47 PM

Which card is this?

limited-library-71452

04/18/2024, 5:59 PM

Hey Lijo it's "Extract content from image" https://cdn.discordapp.com/attachments/1230521763847209021/1230578409697181766/Screenshot_2024-04-18_at_18.58.43.png?ex=6633d473&is=66215f73&hm=56c3ce350dceb2a11e27ca4d580cf610d0fe3ed227043c1afa74ac06f5538cf7&

limited-pencil-78283

04/18/2024, 6:00 PM

Ah I have never used it

limited-library-71452

04/18/2024, 6:06 PM

Yea I think it's pretty new. If I can get it to work it'll open a ton of opportunity. I don't think there is any other way for me to allow an AI to analyse the contents of an image which a WhatsApp user sends in. It looks like this card could do it. We are nearly there. I think it's an issue with the url part. Sylvain did say that can be blank earlier on though

jolly-policeman-82775

04/18/2024, 6:07 PM

Whats qwrong ?

jolly-policeman-82775

04/18/2024, 6:10 PM

https://cdn.discordapp.com/attachments/1230521763847209021/1230581173252653116/image.png?ex=6633d706&is=66216206&hm=164bd92bd062c68f8c3d3c48ae85ab707884e350030c87ea105af2e0d20aa971&

fresh-fireman-491

04/18/2024, 6:13 PM

Hey there Theo 👋 It's with WhatsApp You can read about it here in this post

jolly-policeman-82775

04/18/2024, 6:13 PM

hundreds-battery-97158

04/19/2024, 7:44 AM

Hey there @limited-library-71452 . I have successfully integrated vision to my WhatsApp but I am using Claude instead of OpenAI. I used a thread from Decay in tutorials titled How to use Claude 3 in Botpress or something like that

hundreds-battery-97158

04/19/2024, 7:46 AM

are you able to extract the base64 string and file type from your url?

hundreds-battery-97158

04/19/2024, 8:04 AM

Here is a sample workflow and the code You listen to input by using event.type === image, in that (user sends image expression card) and if it is an image then you send it to the vision workflow where you can extract the base64string and the image type. Once you have those saved in variables then you can post them to the Vision API in that second card in the Vision node and display response from the API in that text card. I am also no code and it took me a month to force myself to understand what is happening here. You are on the right path you will crack it, just continue trying. You will learn more in the process https://cdn.discordapp.com/attachments/1230521763847209021/1230791169957691503/Screenshot_1709.png?ex=66349a99&is=66222599&hm=4438cedb8f3ee338bcd1d2ea22350f269d6413b526e79445092c1944a12867f7& https://cdn.discordapp.com/attachments/1230521763847209021/1230791170230325270/Screenshot_1710.png?ex=66349a99&is=66222599&hm=47cb69de837436356bda87725b0b77fac850c605474c6190e011b16c9e01a434&

limited-library-71452

04/19/2024, 8:16 AM

Thanks so much for this info Takudzwa. Can’t wait to try this out. This will be huge if I can get it going 💪

limited-library-71452

04/19/2024, 7:01 PM

Appreciate the help already but coudl I ask one more question maybe. Can I confirm that you didnt actaully need/use the 'extract content from image' card. You just sent the image the user sends in via whatsapp, direct to cluade/openAI via an execute code card instead ?

limited-library-71452

04/19/2024, 7:01 PM

https://cdn.discordapp.com/attachments/1230521763847209021/1230956477028761681/Screenshot_2024-04-19_at_19.59.57.png?ex=6635348e&is=6622bf8e&hm=42894b052bda3695f46174d45c36c46aefc67eaf34af5ccc2dd8fc852b492b0c&

limited-library-71452

04/19/2024, 7:17 PM

I'm a little behind you in this part. You learned a lot in that month I think! 🧠

hundreds-battery-97158

04/19/2024, 7:23 PM

I think this card just got released recently and you are the first person I am seeing talking about this card.

hundreds-battery-97158

04/19/2024, 7:25 PM

Please see the code i Shared above Your image content will be in the imageUrl so you can get your image content there (there will be no need for the extract content card. )

limited-library-71452

04/19/2024, 7:27 PM

Ok understood. Thank you. Yea not sure i'm meant to have it tbh. It doesn't seem to do what I need as I need to generate the url on the fly as the user sends an image in. I'll experiment with the code. Thank you again.

limited-library-71452

04/22/2024, 2:32 PM

This coding stuff is hard. Which is the reason I came to botpress in the first place 😂 I have temporarily attempted to go back to the 'easy' method of using this new 'Extract Content from Image" card, and keep getting this error. To save me from continuing down the learning code from scratch route has anyone sucessfuly used this card before? (without having a url to use in the card's standard fields) https://cdn.discordapp.com/attachments/1230521763847209021/1231975920525512724/Screenshot_2024-04-22_at_15.28.29.png?ex=6627c67c&is=662674fc&hm=ee48a0e6f19d94e930435f39ccd943534f5cd36774d3f59c9149abdbacd5190d&

hundreds-battery-97158

04/25/2024, 12:18 PM

hey bro how is it going now? Any success?

limited-library-71452

04/25/2024, 1:54 PM

Hey dude, not quite yet no. My coding knowledge is almost zero so I had to continue trying to use the "extract content from image card". The vision workflow always seems to fail without the url in the card though. I tried variables and tried to extract a url from the whatsapp image. Also now trying to use Zapier to do the image analysis, which it does well, and webhook from botpress to zapier and back to botpress for that part. Getting the image file/info/url from whatsapp is tricky still. I will maybe need to learn more/some code, seems like javascript, first

limited-library-71452

04/25/2024, 1:55 PM

I will not let this beat me though!

hundreds-battery-97158

04/26/2024, 7:37 PM

Seems like you are going through exactly how I went through this. It literally took me a month to have this return a successful response but I also learnt loads of stuff by the trial and error. Keep up trying you will crack it and will be a better by the time you are done.

hundreds-battery-97158

04/26/2024, 7:38 PM

I will also try to test the "extract content from image card" this weekend and will let you know if learn something from it

hundreds-battery-97158

04/26/2024, 7:40 PM

lets see a screenshot of your code? The one you are using to read the imageUrl

limited-library-71452

04/30/2024, 7:40 AM

Any luck? I can set up a variable for the url and use the variable in the url field of the card. If I prefill the variable then the AI does understand and does extract the content from the image. BUT I still can't get the variable filled with the url of the incoming message image. I've been using ChatGPT to help with code and I actually think it's doing it in a different language. To find the WhatsApp image url and then save it as the variable @workflow.testimage It gave me this... Which even I can tell is way off...

limited-library-71452

04/30/2024, 7:40 AM

https://cdn.discordapp.com/attachments/1230521763847209021/1234771387650936873/Screenshot_2024-04-30_at_08.39.12.png?ex=6631f1f7&is=6630a077&hm=78d2f47b6db1feb814e28b663a353cc2328f2601bf8f37cc9a22f9e44c960c31&

hundreds-battery-97158

04/30/2024, 9:51 AM

Hey bro, still haven't managed to check out the card For code can you try using Claude. Haven't been using ChatGPT in a while but I believe you can do better with Claude 3 Explain to it your whole scenario and give it the code you currently have and ask it to improve it

hundreds-battery-97158

04/30/2024, 9:53 AM

Also the code I attached here you can copy it as it is and just activate the variables in your account and I am sure you can get some progress

limited-library-71452

05/02/2024, 10:01 AM

Dude, i've done some pretty extensive work with Claude and ChatGPT now. Still no luck i'm afraid. They've iterated loads on two code blocks. One to extract mediaID and the other fetch the URL. They seem to think it's all about this error: "An error occured while running Execute Code card: temp is not defined" Don't suppose you (or anyone else) knows of anyone that making use of this "extract content from image" card yet do you?

limited-library-71452

05/02/2024, 11:39 AM

I think i'm getting close using this code: https://cdn.discordapp.com/attachments/1230521763847209021/1235556378521632839/Screenshot_2024-05-02_at_12.39.12.png?ex=6634cd0c&is=66337b8c&hm=f9b35c86f12a8ed3bb969db49b34c4afca46d08fa472e8c85c8b6d54a563cc57&

limited-library-71452

05/02/2024, 11:41 AM

I think i'm having an issue with an 'access token' when I look at the log. It seem sto be blank or undefined. https://cdn.discordapp.com/attachments/1230521763847209021/1235556773004443709/Screenshot_2024-05-02_at_12.41.16.png?ex=6634cd6a&is=66337bea&hm=9a49bb884f47376f3acddfc73a0642fd2e7ee524dd5899527a4ce1bffd7b4637&

limited-library-71452

05/02/2024, 11:43 AM

I do have the configuration variable set up in the settings though. Also entered it as a bot variable in the main editing panel. https://cdn.discordapp.com/attachments/1230521763847209021/1235557244196880414/Screenshot_2024-05-02_at_12.42.52.png?ex=6634cdda&is=66337c5a&hm=d45d5c781b245f1dcaa23a216940e7caf34320a6a789a858acbe07f22fb86a0a&

fresh-fireman-491

05/02/2024, 12:30 PM

Use bot.WHAT..... for the token

limited-library-71452

05/02/2024, 1:00 PM

Hey Decay, not sure what you mean my friend. Change the default value for the variable? https://cdn.discordapp.com/attachments/1230521763847209021/1235576634900217856/Screenshot_2024-05-02_at_13.59.27.png?ex=6634dfe9&is=66338e69&hm=19241707cdbfc6b78b20e64c74fec0078af25e68a273a0387aea6d76b0a36748&

limited-library-71452

05/02/2024, 1:00 PM

limited-library-71452

05/02/2024, 1:08 PM

Ok it looks like the Access token is ok and we are generating a URL. It's just not saving to the variable required. Is this enough to save the URL as the variable @workflow.testimage for use in the "Extract content from image" card? Seems like there should be more

limited-library-71452

05/02/2024, 1:08 PM

https://cdn.discordapp.com/attachments/1230521763847209021/1235578663580340344/Screenshot_2024-05-02_at_14.06.07.png?ex=6634e1cd&is=6633904d&hm=ae335467d86b07913551a577d7ac4ba2588b31606859c650607652ceb3697e49&

limited-library-71452

05/02/2024, 1:09 PM

Console log shows the URL we need to be saved in that variable... https://cdn.discordapp.com/attachments/1230521763847209021/1235578907009355776/Screenshot_2024-05-02_at_14.09.13.png?ex=6634e207&is=66339087&hm=1f32442fbc5a92b9ed6282af70dce351f40abe5f066174cfe1a3d37bf7d2809b&

limited-library-71452

05/02/2024, 1:18 PM

This is an example image URL from the incoming whatsapp message https://lookaside.fbsbx.com/whatsapp_business/attachments/?mid=1148705659495348&ext=1714655238&hash=ATuiSTp7GKQDuhLNbEqJH1XgenQMd3dNF9DGu2xKgT9OCQ Is this URL even able to be analysed via the "Extract content from image" card. Is it public. I think you mentioned before that they have to be a *public *URL

limited-library-71452

05/02/2024, 1:19 PM

inserting it into a browser leads to a Meta error message

hundreds-battery-97158

05/02/2024, 8:38 PM

Hey bro, great progress if you can get the url I think now the next thing will be able to extract file type from the url and also turn the data in a base64String Once you can extract those then you have all you need to make an API call to Claude

limited-library-71452

05/03/2024, 7:57 AM

Thanks. I really want to try and use the "extract content from image" card ideally. Feels like that would be simpler. Less that can go wrong. Although I can't actually get it to work yet 😂

hundreds-battery-97158

05/03/2024, 1:02 PM

You really should continue with the card and be successful so that you can post a tutorial for all of us to learn how to use the card

limited-library-71452

05/03/2024, 4:27 PM

😂 i'll keep trying. I think the issue is currently that the links created by a WhatsApp image message are not 'public'. The URLs for use in the "extract content from image" card need to be public apparently. If that is the case then my use case is impossible in Botpress.

fresh-fireman-491

05/03/2024, 5:42 PM

Hey there I am probably starting a new project soon where I will need a file from WhatsApp. I am saving this post so I can come back if I start the project and figure it out 🙂

limited-library-71452

05/04/2024, 4:33 PM

Awesome. Really appreciate it Decay. There will be an answer 🙌

fresh-fireman-491

05/08/2024, 3:56 PM

Could be

hundreds-battery-97158

05/10/2024, 4:00 PM

hey Michael

hundreds-battery-97158

05/10/2024, 4:00 PM

how has it been going? any progress

limited-library-71452

05/14/2024, 8:20 PM

Well I thought it was over because of the end to end encryption of WhatsApp BUT The latest is that I used Postman to test if a GET request thing was even able to actually 'see' the image URL of an incoming WhatsApp image. And it DID when sent with the authorisation token. It shows up in the body of a response. So I just need to somehow have that GET thing sent to an AI API and work out how to make it see the body of the response and then find out how I can see the result of what it sees, then it might work. Also found out that those wahtsapp image URL's potentially only last for 5 mins. Ive randomly been banned from Claude unfortunately whilst testing it. Not sure why.

clean-photographer-30160

05/15/2024, 1:39 PM

hi @limited-library-71452

clean-photographer-30160

05/15/2024, 1:39 PM

any progress, im having the same problem that you

clean-photographer-30160

05/15/2024, 5:43 PM

hi can you share the remix of this flow?

clean-photographer-30160

05/21/2024, 5:39 PM

i find a way to make it, i will share it @limited-library-71452 @hundreds-battery-97158

clean-photographer-30160

05/21/2024, 5:48 PM

Copy code

try {
    const res = await axios.get(event.payload.imageUrl, {
        headers: {
            Authorization: `Bearer ${env.WHATSAPP_ACCESS_TOKEN}`,
        },
        responseType: 'arraybuffer'
    });
    const { data } = await axios.post(
        "https://r86otig4qf.execute-api.us-east-2.amazonaws.com/s3-image-uploader-test",
        res.data
    )

    workflow.dataFromImage = data.url

} catch (e) {
    workflow.dataFromImage = 'error :('
}

clean-photographer-30160

05/21/2024, 5:48 PM

here is the explanation

clean-photographer-30160

05/21/2024, 5:49 PM

This code performs the following steps: Using axios to make a GET request to an image URL:

Copy code

const res = await axios.get(event.payload.imageUrl, {
    headers: {
        Authorization: `Bearer ${env.WHATSAPP_ACCESS_TOKEN}`,
    },
    responseType: 'arraybuffer'
});

event.payload.imageUrl: The URL of the image you want to retrieve. **axios.get**: Makes a GET request to that URL. **headers: **Includes an authorization header with a WhatsApp access token. **responseType: 'arraybuffer': **Specifies that the response should be treated as an arraybuffer, which is useful for handling binary data like images. Using axios to make a POST request to an image upload service:

Copy code

const { data } = await axios.post(
    "https://r86otig4qf.execute-api.us-east-2.amazonaws.com/s3-image-uploader-test",
    res.data
)

Service URL: "https://r86otig4qf.execute-api.us-east-2.amazonaws.com/s3-image-uploader-test" res.data: The body of the response from the GET request, which contains the image data. axios.post: Makes a POST request to the service URL with the image data. Storing the URL of the uploaded image:

Copy code

workflow.dataFromImage = data.url
data.url: Assigns the URL of the uploaded image (provided by the upload service) to workflow.dataFromImage.
Error handling:

javascript
Copy code
} catch (e) {
    workflow.dataFromImage = 'error :('
}

**catch**: Catches any errors that occur during the execution of the GET or POST requests. **workflow.dataFromImage = 'error :('**: If an error occurs, it assigns the value 'error :(' to workflow.dataFromImage. Summary This code retrieves an image from a URL using an authorization token, then uploads that image to a specified service and stores the URL of the uploaded image in a variable. If any error occurs during the process, it catches the error and stores an error message instead of the URL.

clean-photographer-30160

05/21/2024, 5:51 PM

it works very good

limited-library-71452

05/24/2024, 8:44 AM

This is great thank you. I'll try it out asap. Can I ask how you are using it? Are you sending an image to an AI to analyze?

hundreds-battery-97158

05/24/2024, 11:07 AM

Congrats @clean-photographer-30160 there is always a way.

clean-photographer-30160

05/24/2024, 7:11 PM

Yes, I send the image and them I pass it for the ai

clean-photographer-30160

05/24/2024, 7:11 PM

https://cdn.discordapp.com/attachments/1230521763847209021/1243642598246191155/IMG_6309.png?ex=665237ed&is=6650e66d&hm=c71090fe8adcc78b1c78b25b7b199e0d533d2a12f62f77d8c5d03bbc23abd264&

clean-photographer-30160

05/24/2024, 7:13 PM

Is very important to save it in a external server

clever-horse-5574

05/26/2024, 4:14 PM

That's awesome @clean-photographer-30160 ! Can you maybe share the whole workflow you built?

busy-art-73803

07/10/2024, 6:14 AM

I see you are uploading it to an Amazon cloud address, do I need to open as well?

clean-musician-66640

07/29/2024, 1:53 PM

@clean-photographer-30160 how did you set up the api? I tried my own bucket it seems something I didn’t setup properly. But with your example it works fine https://cdn.discordapp.com/attachments/1230521763847209021/1267480239253225595/IMG_6711.png?ex=66a8f075&is=66a79ef5&hm=fa6db1ae88618fd246b796f574ec2747c02a3beb7d80a26c53fdf0bbdb945a2e&

gentle-policeman-89141

10/12/2024, 12:08 PM

Hi @limited-library-71452 , did you manage to get this working? I’ve been struggling with it myself. Did you manually configure Botpress with WhatsApp, or did you use the wizard? When using the wizard, it hides the WhatsApp access token, so trying to make a GET request to the media URL always throws an error. Also, Meta has been quite a pain when it comes to linking already verified WhatsApp numbers to Meta apps and getting the correct token for API requests. Would appreciate any advice!

55 Views

Previous Next