Audio Generator
Create dialogue, SFX, and voice variants with a multi-modality audio workstation.
Workspace Map
- Top Header: editor title, panel toggle, generation status, and close.
- Left Sidebar (collapsible + resizable): modality panels, metadata panel, and performance text panel.
- Main Workspace: audio selection/clear controls plus full player and trim controls.
- Bottom Generations Panel (collapsible + resizable): previous generated outputs for quick A/B selection.
Panel-by-Panel Reference
Dialogue Panel (Text-to-Speech)
Core dialogue generation panel with character selector, prompt text, model picker, dynamic model options, and output format. Character-specific voice settings can be loaded/saved against cast entries.
SFX Panel
Focused sound-effect generation with prompt, model picker, dynamic settings, duration slider, and output format. Best for effects/foley layers.
Voice Design Panel
Builds reusable voice profiles using character name, voice characteristics, and sample line fields. Includes model picker and advanced options for voice identity shaping.
Voice-to-Voice Panel
Transforms source audio into a target voice. Supports file upload, drag-drop, asset browser selection, source preview player, target voice picker, and model-specific options.
Audio Metadata Panel
Sidebar metadata context for selected audio and timeline points (duration/current time/in-out).
Performance Text Panel
Displays performance text context tied to the current asset so generation choices stay aligned to script intent.
Workspace Header
Main-center controls to select project audio into workspace and clear the current canvas audio.
Audio Player Panel
Playback and edit controls: play/pause, seek, volume, speed, reverse, in/out points, apply trim, and reset trim.
Previous Generations Panel
Bottom history strip of generated outputs with quick selection. Collapsible and vertically resizable for compare-heavy review passes.
Panel Persistence Behavior
Each modality keeps its own configuration snapshot, so switching panels preserves previous settings instead of resetting them.
Main Workspace Capabilities
- Select existing project audio into the workspace header.
- Play/pause, seek, volume, playback speed, and reverse playback controls.
- Set in/out points and apply trim or reset trim.
- Review metadata and attached performance text context during iteration.
Step-by-Step Workflow
- Open Audio Generator in the target project.
- Choose a modality panel (Dialogue, SFX, Voice Design, or Voice-to-Voice).
- Set model + prompt/options in that panel.
- Generate and monitor status in header/task indicators.
- Preview in the central player and trim if needed.
- Compare alternates from the bottom previous-generations panel.
- Select the approved take and continue to timeline/assembly workflows.
Usage Tips
- Name outputs by scene + intent + version so downstream review is faster.
- Keep each modality focused on one task type to avoid prompt confusion.
- Use the bottom generations panel as your source-of-truth shortlist before exporting or placing in edits.