Hello everyone,
Iām developing a real estate chatbot using Botpress and Iām facing a challenge with the knowledge baseās handling of web page content when itās segmented into chunks.
To illustrate, letās say I have a web page for āResidence Aā, listing available apartments in that residence which are from apartment 1 to apartment 10.
This page is segmented into chunks when scraped into the knowledge base using āwebsiteā. The first chunk includes the residenceās name āResidence Aā along with āApartment 1ā to āApartment 5.ā And, the second chunk starts with āApartment 6ā and goes to āApartment 10ā but does not repeat the āResidence Aā name since itās already mentioned in the first chunk.
The issue is that when I query the bot about the available apartments in āResidence A,ā it only gives me from Apartments 1 to 5 (the ones in the first chunk) and doesnāt give the second chunkās apartments (Apartment 6 to 10) since they lack the residence name āResidence Aā in their chunk, so the bot doesnāt really know to which residence they belong since it only sees their names, and therefore doesnāt include them with the available apartments for residence A.
Does each chunk really gets treated independently ? If so, how can we ensure that all relevant data in subsequent chunks is still correctly associated with āResidence Aā? Any suggestions on configuring the knowledge base or the scraping mechanism to maintain linkage between related chunks would be very helpful.
Thank you for any insights or advice!