Research Demos

Interactive demos for prosody control techniques. Each demo lets you try a different approach to emotion and prosody manipulation.

Current Focus: S2 - Temporal Keyframe Control

These demos explore different ways to control prosody at word-level. Goal: "word 3 = angry, word 7 = calm" with smooth transitions.

Coming soon

Emo-FiLM Keyframes

Word-level emotion control with FiLM modulation

  • Per-word emotion
  • Timeline editor
  • Smooth transitions
Coming soon

DrawSpeech

Sketch pitch and energy curves directly

  • Draw pitch contour
  • Draw energy
  • Canvas editor
Coming soon

Chatterbox

Single-parameter emotion exaggeration (0=monotone, 2=dramatic)

  • One slider
  • Paralinguistic tags
  • Quick testing
Coming soon

MaskGCT Prosody

Masked generative codec for prosody editing

  • Prosody masking
  • Selective regeneration
  • Fine control

Demo pages are auto-generated by research agents using Codex.

See .claude/commands/create-demo-page.md for the template.