How to adapt custom chunked data output from code nodes to the parent_child_structure (multimodal) validation of knowledge base nodes?

loficore · January 12, 2026, 6:58am

Problem Background

I am developing a custom code parsing tool based on the Dify workflow, specifically targeting the Zig language. To achieve more precise RAG results, I have disabled the system’s automatic chunking and instead manually implemented a “parent-child chunking” logic within a code node (Node.js/Python):

Parent Chunk: A complete function implementation or type definition.
Child Chunk: Fine-grained semantic units derived from the parent chunk (e.g., comments, function signatures).

Core Pain Point

The biggest obstacle I’m facing is: The JSON object output by the code node cannot be recognized by the knowledge base node, or it prompts Output parent_child_structure is missing. Although I’ve tried mimicking the output format of tool nodes, the lack of official documentation defining the Schema for the (multimodal)parent_child_structure type has led to frequent failures in variable mapping.

Actions Taken

Data Structure Restructuring: I’ve tried returning a plain array, as well as an Object containing parent_mode and parent_child_chunks.
Output Variable Definition: In the code node’s “Output Variables,” I manually declared result as type Object, but the variable selector in the downstream knowledge base node still fails to correctly parse its internal sub-properties.
Environment Check: Confirmed that the Embedding model is functioning normally, and child_contents are all non-empty string arrays.

Questions for Guidance

Official Schema Definition: What is the complete JSON Schema for the strongly typed variable parent_child_structure? Besides parent_mode and parent_child_chunks, are there hidden metadata fields or specific $schema identifier requirements?
Variable Recognition Logic: Why is the Object output by the code node often filtered out (not displayed) in the knowledge base node’s variable selector? Is there a specific variable naming convention or “Output Variable” declaration method that must be followed?
Best Practices for Manual Chunking: If I want to bypass Dify’s default cleaning logic and directly store preprocessed parent-child chunks into the knowledge base, aside from the “code node → knowledge base node” path, is there a more mature API or plugin approach?

Attachment: Current Output Format Reference

{
  "parent_child_structure": {
    "parent_mode": "paragraph",
    "parent_child_chunks": [
      {
        "parent_content": "pub fn main() void { ... }",
        "child_contents": ["pub fn main()", "void { ... }"]
      }
    ]
  }
}

I would greatly appreciate guidance from official documentation or experienced users—thank you very much!

Topic		Replies	Views
Markdown转换器无法获取到文件的存储位置吗，输出变量不可用？ Discussion	6	502	January 23, 2026
大模型输出了mermaid代码块，但是在dify平台无法正常渲染成图形 Discussion	1	158	February 4, 2026
请问dify支持自定义节点吗？ Discussion	1	252	November 29, 2025
能不能扩展一下变量聚合的功能，或则新增一种节点。 Discussion commuity	2	79	January 23, 2026
Dify Code Execution Node \| Structured Data Processing \| Achieving Data Merging and Integration Chinese 🇨🇳 ai , course-beginner	0	233	October 22, 2025
Dify 1.11.2 社区版，能否再工作流中添加节点列表中，扩展自定义节点？ Discussion	2	114	January 22, 2026
Plugin: Advanced Markdown Chunker – smarter Markdown chunking for RAG Discussion	2	398	January 14, 2026
Dify 自定义 Sandbox 陷入死循环：修复"Operation not permitted"后出现"ModuleNotFoundError"，两者无法共存 Seeking help	3	772	January 17, 2026
在“代码执行”节点中无法单独关联 input_file, rules_file, stock_file，并怀疑与 userinput.files 有关。请求提供正确映射变量或处理遗留字段的官方方法。 Discussion commuity , case	7	165	January 26, 2026
工作流中同时上传了文件和一个问题，发现llm在思考时说未收到任何上传的文档 Discussion	20	755	January 21, 2026