Audio Node

Learn the basics of audio nodes

The audio node brings voice, sound, and sonic atmosphere into your FLORA canvas. It closes the loop between what you see and what you hear, letting you generate voiceovers, sound effects, and transcriptions alongside the image and video work already happening on your canvas. No jumping out to a separate tool, no booking scratch VO, no silent concepts. Audio nodes turn sound into another first-class node you can pipe into the rest of your workflow.

Here is a quick introduction on how to get started with audio nodes:

Capabilities

  • Text-to-speech. Type a script, choose a voice, and get a voiceover back. Play it on the canvas, export as MP3 or WAV.

  • Text-to-SFX. Describe a sound effect in plain language and generate it in place.

  • Text-to-Music. Describe a sonic environment in plain language and recieve a matching track.

  • Audio-to-text. Transcribe an audio clip into a text node for editing, captioning, or downstream prompting.

  • Lipsync. Pair an audio node with a video node to drive a lipsynced performance.

  • FAUNA-aware. FAUNA can create and chain audio nodes for you as part of a multi-step workflow.

Models

Visit our Audio Models section to learn about the video models and capabilities available in the Video Node.

Last updated

Was this helpful?