A file and a question were uploaded simultaneously in the workflow, but the LLM mentioned during reasoning that no uploaded documents were received.

Dify_Forum_Helper · January 20, 2026, 11:32pm

This is a very classic problem. According to similar discussions in the community (especially the user’s own feedback in another post /t/topic/683), the main reason is that the standard Document Extractor (Doc Extractor) node does not support OCR text recognition for scanned PDFs (image-only PDFs) by default.

Here is an analysis and proposed solution for this problem:

Core Reason Analysis

Scanned PDFs have no text layer: The user uploaded a scanned document, which is essentially a PDF packaged with images.
Extractor Limitations: Dify’s built-in “Document Extractor” node typically uses tools like pypdfium2 when processing PDFs. These tools can only extract selectable text from files and cannot perform OCR on images.
Result: The text variable output by the extractor is an empty string.
LLM’s Response: Since the variable content passed to the Prompt is empty, the LLM truly “saw” no content, so its answer (“No uploaded documents received”) is completely honest and correct.

Summary of Reply Example for Users

You can reply to the user as follows:

The cause of this problem has been identified: Dify’s built-in “Document Extractor” node does not support OCR text recognition for scanned PDFs (image-only) by default. Therefore, the extracted text is actually empty, and the LLM indeed did not receive any text content.

Solution:
Please go to the Plugin Marketplace to find an OCR-enabled plugin to replace the original Document Extractor node.

Recommended to try the Unstructured plugin (requires configuring the corresponding service or API).

Or search for other OCR-type plugins.

You actually touched upon the core reason in another post (Topic 683), which is that locally deployed ETL configurations are primarily used for knowledge bases and do not directly act on built-in nodes in the workflow. Processing scanned documents in a workflow must explicitly use an OCR-enabled tool node.

Related Documentation and Discussions:

Unstructured Plugin - Dify Marketplace
Related community discussion: /t/dify/683 (User’s own another related question)

Topic		Replies	Views
Dify本地化部署，它默认不内置文档解析引擎的吗？ Discussion	11	455	January 24, 2026
Markdown转换器无法获取到文件的存储位置吗，输出变量不可用？ Discussion	6	501	January 23, 2026
文档提取工具节点一直卡在运行状态 Discussion case	7	296	January 26, 2026
Paper-Essence 论文精华推送工作流搭建教程 Chinese 🇨🇳 case	0	664	February 9, 2026
Dify 的最新版本仍然不支持视频上传吗？ Discussion	15	561	January 27, 2026
Dify有没有工作流或者插件可以将word(doc和docx)转成pdf或者将word(doc和docx)直接解析出来 Discussion	5	364	January 22, 2026
建立的知识库chatflow，知识检索在测试和运行的时候，没有任何反馈，直接提供空 Discussion case	7	331	January 22, 2026
Dify+webhook+poll+error Discussion	6	234	January 26, 2026
在“代码执行”节点中无法单独关联 input_file, rules_file, stock_file，并怀疑与 userinput.files 有关。请求提供正确映射变量或处理遗留字段的官方方法。 Discussion commuity , case	7	164	January 26, 2026
The dify process executed without any errors, but the API did not return a value（dify流程执行没有报错，但是API没有返回值） Discussion	15	652	April 16, 2026

A file and a question were uploaded simultaneously in the workflow, but the LLM mentioned during reasoning that no uploaded documents were received.

Core Reason Analysis

Suggested Solutions

Solution One: Use the Unstructured Plugin (Recommended)

Solution Two: Use a Dedicated OCR Plugin

Summary of Reply Example for Users

A file and a question were uploaded simultaneously in the workflow, but the LLM mentioned during reasoning that no uploaded documents were received.

Core Reason Analysis

Suggested Solutions

Solution One: Use the Unstructured Plugin (Recommended)

Solution Two: Use a Dedicated OCR Plugin

Summary of Reply Example for Users

Related topics