Dify workflow patterns — 10 we actually use in production

Ten Dify workflow patterns that survive production — KB fallback, context compression, concurrent tool calls, retries and more.

Pattern 1 — KB fallback#

Problem: top-1 retrieval below threshold shouldn’t be answered confidently.

Do: “Retrieval → conditional branch” — if score < 0.5, reply “No relevant info, handing off.”

Problem: long history blows up tokens.

Do: “Get context → LLM summarize → replace history with summary.” Compression ratio typically 5-10×.

Problem: pre-sales / post-sales / complaints in one bot performs poorly.

Do: front-load an LLM classifier (enum output), branch on it.

Problem: looking up order + shipping + coupons serially is slow.

Do: “Iterate” or “parallel” nodes — fire three HTTP calls, merge results.

Problem: HTTP tools occasionally 500.

Do: HTTP node “3 retries with exponential backoff” + fallback branch returns a degraded answer.

Problem: promo day floods downstream ERP.

Do: Add a “rate limit” node at the start keyed by user_id; over N/min → queue.

Problem: user asks refund amount; LLM cannot compute it.

Do:

1. HTTP(order API) → order_amount
2. LLM → answer using `{{order_amount}}`
3. Prompt: "Use `{{order_amount}}` exactly; do not rewrite."

Problem: AI can’t answer — don’t dead-end.

Do: failure branch calls Chatwoot API to open a ticket with the full context as a note.

Problem: collecting name / email / description rarely happens in one turn.

Do: Use Conversation App, check completeness per turn, ask for whatever is missing.

Problem: was the new prompt actually better?

Do: random branch — 50% old prompt, 50% new — tag user_id, compare CSAT later.

Keep workflow nodes atomic — easier to debug individually
Declare key variables in the Start node, not scattered
Use different temperatures for different LLM nodes (generation vs classification)
Use Dify’s debug pane to step through — faster than full logs