Does the Dify platform’s document extractor not support scanned PDFs?
I uploaded a scanned PDF, and after processing it with the file extractor, the output text was “”, so when passed to the large model, it became “No file content detected!”
Does the Dify platform’s document extractor not support scanned PDFs?
I uploaded a scanned PDF, and after processing it with the file extractor, the output text was “”, so when passed to the large model, it became “No file content detected!”
Nope, this node can only extract texts from the PDF files, if you want to extract things in the images, please use minerU or PaddleOCR instead.