ChatGPT for YOUR OWN PDF files with LangChain

Featured Sponsor

Store	Link	Sample Product
UK Artful Impressions	Premiere Etsy Store

If you’re looking to harness the power of large language models for your data, this is the video for you. In this tutorial, you’ll …

source

We’re happy to share our sponsored content because that’s how we monetize our site!

Article	Link
UK Artful Impressions	Premiere Etsy Store
Sponsored Content	View
ASUS Vivobook Review	View
Ted Lasso’s MacBook Guide	View
Alpilean Energy Boost	View
Japanese Weight Loss	View
MacBook Air i3 vs i5	View
Liberty Shield	View

🔥📰 For more news and articles, click here to see our full list. 🌟✨

👍🎉 Don’t forget to follow and like our Facebook page for more updates and amazing content: Decorris List on Facebook 🌟💯

📸✨ Follow us on Instagram for more news and updates: @decorrislist 🚀🌐

🎨✨ Follow UK Artful Impressions on Instagram for more digital creative designs: @ukartfulimpressions 🚀🌐

🎨✨ Follow our Premier Etsy Store, UK Artful Impressions, for more digital templates and updates: UK Artful Impressions 🚀🌐

46 thoughts on “ChatGPT for YOUR OWN PDF files with LangChain”

Anthony Skelton May 3, 2023 at 5:46 am

Nice work – very clearly explained and you addressed the code fragments really well – look forward to more vids!!
Italo Aguiar May 3, 2023 at 11:17 am

Excellent!! 🎉
Vincent Alcala May 3, 2023 at 3:38 pm

Thank you very much for the video, excellent content and production!
I have a question, how can I change the text-ada model for a davinci?
j May 3, 2023 at 4:50 pm

How much does it change in got3.5 turbo?
FastStart May 3, 2023 at 8:58 pm

Thank you for good video😊
Sandeep Saha May 4, 2023 at 1:32 am

Where is the model specified? I want to change it
sasanga abeywickrama May 4, 2023 at 9:50 am

Good stuff! How mush would it cost for the vector DB for the demonstrated operations ?
Akash Patil May 4, 2023 at 11:21 am

Nice video. Is anyone getting this error: "RateLimitError: You exceeded your current quota, please check your plan and billing details.."
Gimba Goyo May 4, 2023 at 11:24 am

Nice, I don't have the basic coding skills and I feel that's a must. I will like to challenge you though to create an App that can compare two or more than two documents and to discover if there are issues of copy and paste or plagiarism between the documents without running a search across the whole internet. Is this doable?
Rafael May 4, 2023 at 12:05 pm

Could it be used with chatgpt api wrapper for plus users ?
Ludwig van Beethoven May 4, 2023 at 12:10 pm

Thanks, can we also use it with non pay-for-each-token models like ChatGPT3.5 or ChatGPT4? (Might be a stupid question; but i did not find an answer to this so far)
kingarthur0407 May 4, 2023 at 12:49 pm

I've written a prompt for GPT-4 that I use with chatGPT in Macromancy formatting to transform it into a legal assistant. and the results have been stellar. Is it possible to encode this prompt into the system you describe so that the bot operates with it in mind?
Tanya Alexander May 4, 2023 at 1:02 pm

Amazing video with concise and clear explanations! Question: Is there a way for me to use Azure and One Drive to do this? I'm a noob and am not sure how but your video makes me willing to try. My organization (healthcare) has mountains of PDFs with gold we cannot mine in them.
Snowyiu May 4, 2023 at 1:07 pm

Wow, I stared at that opening graph for like 10 minutes being in awe, realizing the implications and uses, marveling at the elegance. This is insanely similar to an approach I thought of to extract new information during conversation, but this is more elegant.
I should start making graphs of my approaches, since they do tend to get pretty complex and sometimes I lose track of what I'm doing or trying to do.
Alberto Cambronero May 4, 2023 at 8:38 pm

what is the token limit on this? can it read 1000 pages PDFs and answer questions accuaretly?
Vito F May 4, 2023 at 9:03 pm

Can this be done without relying on openai? There are quite a few open models now, I wonder if this kind of workflow would be possible using gpt4all and similar models locally
smart May 5, 2023 at 3:04 am

hey the way u explain seems extremely simple to implement can we use PDF gpt for commercial use
M.A. Buth May 5, 2023 at 8:34 am

Nice video! I assume that DeepL uses a similar approach to translate PDFs. I used it but encountered some problems. For example, if a sentence does not end on one page, it can cause problems and return nonsense. This may have been the reason for our "Overlap"? So, I rewrote some 250-page-long documents to eliminate any overlapping sentences from page to page. (From now on, I will compare translating a text to making queries, since both require a comparable amount of "work" for GPT.) This helped a lot, but not always.

In my opinion, the reason for the occasional issues is that it is difficult to predict the number of tokens required for each page. If the text, like in my case, is complex scientific or technical content, GPT will need more tokens for the same number of characters than it would for a fairy tale, for example. Therefore, with a technical or scientific document, you may run out of tokens very quickly if the content is complex. Whether it's translating or making queries, I believe this problem will arise.

Perhaps we need to wait for GPT to upgrade the maximum number of tokens by 2-3 times from now until it can handle any kind of text. Currently, you could reduce the format of your pages to ensure that each page has less (con)text.
The_Video_Freak May 6, 2023 at 2:49 am

Best thing ever seen ❤
Andres Montoya May 6, 2023 at 6:30 pm

Mindblowing! Very clear and your explanation is excellent! Thanks 😉
Birupakshya bhol May 6, 2023 at 6:38 pm

Is there any way to save the generated embeddings to a file and later on I can load it from disk to avoid the embedding generation again and again ? If possible can you please give me a sample
Werner Shintaku May 6, 2023 at 8:19 pm

Very good and clear.
onur keskin May 7, 2023 at 10:54 am

whenever I tried to run "reader" and also as soon as I copıed to location i've got some error What should I do please get me a feedback
Yug Khatri May 7, 2023 at 8:59 pm

what is the approx cost of API, if I use a University Subject's textbook with 1000 pages? I mean cost of embedding the pdf data to model and also the search cost for questions. Can you tell the cost in the form of API pricing or tokens?
Rónán Mac Samhráin May 8, 2023 at 4:55 pm

Very informative video. How would you recommend doing summarisation of a long pdf. Do you think recursive summarisation OR semantic clustering, sampling from clusters, and then summarisation of clusters, would be best?
Andrew Gumbiner May 9, 2023 at 12:59 am

At step 25, I just received the following error message "embeddings.openai.embed_with_retry.<locals>._embed_with_retry in 4.0 seconds as it raised RateLimitError: You exceeded your current quota, please check your plan and billing details.." – can you explain what I need to do to prevent this error message please?
AeroArtz May 9, 2023 at 11:09 am

Im getting a rate limit reached error. Any idea how do i resolve this?
fenix dota11 May 9, 2023 at 3:08 pm

hello, can u do it with vicuna llm?
Haris Husic May 10, 2023 at 6:33 am

Thanks! This was super helpful and I was able to query my own PDF's but I can't figure out where and how to specify the LLM I want to use GPT-4. Can you please let me know?
Tyler Klug May 10, 2023 at 6:01 pm

Fantastic video. I'm sure someone has made a follow-up somewhere, but can you help me understand how to wrap everything into my own UI where I can pass a parameter through to the search query so it can effectively act as a chatbot?
smashybuttons May 11, 2023 at 12:52 am

Man I love your tutorials! Do you have any advice on converting scanned pdfs to text for this same application? what are tools you'd recommend?
Nithin Reddy May 11, 2023 at 7:09 am

How to use Stable Vicuna or Vicuna instead of OpenAI?
Jim Moses May 12, 2023 at 7:05 am

How to expand this for multiple pdf's? Pt is that a matter of combining multiple odf's into a single source pdf?
Ady Sadek May 12, 2023 at 5:27 pm

Hi very Good video, my question to you , what maximum size is permitted, can we upload 2-10G files of info, or even more, with this procedure? Let me know if we have to develop another type of architecture? Best Regards
RADIUM May 12, 2023 at 7:58 pm

Bhai tu india se hai kia
Phillip Worts May 13, 2023 at 1:57 am

Gerat stuff. With the help a Bard adjusting your code, I was able to call on a PDF on my local desktop and not the google drive. How do I go about reading a whole folder of files at once?
Matthew Widmer May 13, 2023 at 7:24 am

I love you. thank you for making this so easy!
Asep Mulyana May 13, 2023 at 11:56 am

Thanks for your video! How can I change the PDF file using URL instead of google drive?
Bill K May 13, 2023 at 2:57 pm

Thank you!. Fantastic stuff.
Snaky May 13, 2023 at 10:25 pm

That was a great video, thanks!
But in the end, how do you then output the ChatGPT message outside of Langchain into your apps?
MajorBuzzKill May 15, 2023 at 1:26 am

I used a research paper as input pdf and i want it to create a 1500 word summary but it cuts off at 200 something words, also where you specified i cant input any models. ( 8:59 )
The cutest cat May 16, 2023 at 9:46 am

Thanks, this helps me a lot!
Manju Bishnoi May 17, 2023 at 11:29 am

Very good explanation, loved it. Just wanted to know about what tool you are using to make diagrams?
Antariksh verma May 17, 2023 at 9:47 pm

Thank you for this awesome content! I have a query ,I am trying to ingest a huge pdf let say 1000 pages pdf ,it is failing while doing ingestion. I am using azure open ai for this. Can u pls put some thoughts how a huge pdf can be used in this scenario?
H. Allen May 18, 2023 at 2:50 am

Hi, what's the tool you used for the flowchart at the beginning of the video? Thanks!
forensic files May 18, 2023 at 3:51 am

How to compare two texts and highlight what is new