https://discord.gg/botpress logo
#🌎general
Title
# 🌎general
n

narrow-author-22836

06/06/2023, 6:55 PM
And the pdf is 400+ pages big
a

acoustic-hair-60678

06/06/2023, 7:16 PM
Arthi, since it looks like it'll be impossible to use an offline solution, maybe you can use your chatbot as a study tool? Asking it questions that you anticipate will be covered tomorrow can help you get to the key insights from the 400+ page document. Just brainstorming here, but hope that can help!
n

narrow-author-22836

06/06/2023, 7:56 PM
Thats a good call and idea. Thank you ^^
a

acoustic-hair-60678

06/06/2023, 8:23 PM
Best of luck on the exam. You got this!
d

damp-jackal-59222

06/06/2023, 9:21 PM
Use an online llm to generate questions, then use a similar online llm to generate answers. Then train a simple embedding models that runs easily on cpu/locally (Maybe word2vec can work) against the Q&As. Use some local vectordb to index Q&As. Using the same model locally you should have a decent semantic search system to retrieve questions and answers. You can use something like pypdf and tiktoken to chunk the large pdf into usable chunks.
9 Views