Creating Multiple Talking Actors

The Talking Actor tool creates two-person dialogue videos from a reference image or video, using Text-to-Speech with voice cloning. The Dialogue list defines the order, lines, and pauses to structure the interaction.

Required Elements

  • Image / Video: Reference image or video of the actor. Supported formats: PNG, JPG, JPEG, MP4, WMV, MOV, and AVI.
  • Voice: Load an audio file for the actor’s voice, or use a preset or reference audio for TTS voice generation. Supported formats: MP3 and WAV.

AI Settings

Select the Talking Actor Tool from the left AI Toolbar and complete the tasks in each section under the Multiple Talking Actor tab. You can collapse or expand a section by clicking its caption.

Ensure both an image / video and dialogue audio are provided to enable the GENERATE button. Note the task price displayed above the button before clicking.

Track progress on the AI Render View or via the new entry in History on the right side of the AI Workspace. When the submission completes successfully, the AI-generated video will appear. Click Play to play the video, Loop to repeat it, or drag the playhead to a specific point.

If satisfied with the generated result, you can upscale the output to a higher resolution.