How can we pass “instructions” to the TTS model (like gpt-4o-mini-tts) in Dify?

Kirtan_Bhad · November 8, 2025, 3:15pm

Question:

Hi everyone,

I’m using the Text-to-Speech (TTS) block in Dify with the gpt-4o-mini-tts model from OpenAI.
In the OpenAI API or Python SDK, we can include an additional field called instructions to control the tone, style, or mood of the generated audio. For example:

client.audio.speech.with_streaming_response.create(
    model="gpt-4o-mini-tts",
    voice="coral",
    input="Today is a wonderful day to build something people love!",
    instructions="Speak in a cheerful and positive tone."
)

However, in Dify’s TTS block settings, I don’t see any option to add instructions — only model, voice, and input text fields.

Is there a way in Dify to pass the same kind of instructions parameter (e.g., “Speak in a calm and professional tone”) to the TTS model?
Or do we need to use a workaround, like embedding the tone directly in the input text or using a custom HTTP block to call the OpenAI TTS endpoint?

Would really appreciate any guidance or examples from others who’ve implemented tone/style control in TTS within Dify workflows.

kurokobo · November 10, 2025, 1:26am

@Kirtan_Bhad
Unfortunately, this is currently a known limitation of the built-in TTS node.

As a workaround, you can use the Podcast Generator plugin, which supports the Instructions feature (I added this feature for this purpose ). Although this plugin is designed to generate conversation-style voices for two people, you can generate a single voice by passing in a one-line script and filling the Voice 2 parameters with dummy data.

Of course, you can also use the HTTP node to call the OpenAI API directly.

Hope this helps.

Topic		Replies	Views
Dify Tutorial \| Workflow, Build a Local, Open-Source Long-Text Translation Powerhouse with Dify \| Hands-on Tutorial Chinese 🇨🇳 course-beginner , ai	0	72	October 15, 2025
How to Build AI Chatbots & Chatflow Automation with Dify.ai English 🇬🇧 course-beginner , case	0	77	October 22, 2025
Build an AI Chatbot using Dify AI and Streamlit English 🇬🇧 course-beginner	0	56	October 22, 2025
Dify Tutorial \| [Getting Started with Dify] 3 ways to publish Dify apps Chinese 🇨🇳 ai , course-beginner	0	72	October 15, 2025
Deep Dive into Dify Template Conversion Node \| Enhance the Readability and Structure of AI Responses! Chinese 🇨🇳 ai , course-beginner	0	118	October 22, 2025
About the Feature Request category Feature Request	0	38	October 22, 2025
Create Your First AI Agent in Minutes with Dify.ai English 🇬🇧 ai , course-beginner	0	79	October 22, 2025
Dify AI: Create Apps/Software in Minutes With a Drag-and-Drop UI! English 🇬🇧 course-beginner , ai	0	39	October 16, 2025
Dify Quickstart Guide: Build Your First AI Workflow English 🇬🇧 course-beginner	2	179	October 23, 2025
Vibe coding ... Build Custom Tools for AI Agent with Cursor AI English 🇬🇧 course-advanced , case , ai	0	41	October 16, 2025

How can we pass “instructions” to the TTS model (like gpt-4o-mini-tts) in Dify?

Question:

Related topics