This simulates a support assistant where the retrieval step is expensive but deterministic, while the final answer still recomputes. Submit the same question twice, or change only the tone, and watch Gradio show the used cache badge when the cached helper is reused.