===================
Sound Prompter
===================

Sound Prompter lets you generate sound effects (optionally with music) that match your composition. It generates .wav files in realtime (e.g. ~10s generation time for a 10s clip on a recent NVIDIA GPU). Click to play the example:

.. video:: https://media-blaceplugins.b-cdn.net/soundprompter_fruit.mp4
    :autoplay:
    :width: 100%
    :loop:

Prompt: "person cutting a fruit"

.. video:: https://media-blaceplugins.b-cdn.net/soundprompter_spaceship.mp4
    :autoplay:
    :width: 100%
    :loop:

Prompt: "spaceship and laser beam"

Requirements
=======================
 - Windows only currently
 - NVIDIA GPU with at least 10 to 12GB VRAM
 - ~10GB free RAM when invoking the plugin

`Windows Download <https://download.blaceplugins.com/download/depends/SoundPrompter-v1.1.46-Win-55e8720e.zip>`_

Workflow
=======================
 - Apply the effect to the layer you want to create the SFX for. Can be a subcomp or an adjustment layer to let multiple layers participate in the SFX creation.
 - Click "Add Text Input" to add a mask and link it in the "Positive Prompt" field. Then you can change the name of the mask to modify the prompt. Add another Text Input and link in "Negative Prompt" to supress certain keywords (e.g. music).

 .. image:: img/sp_ui.png


 - Click "Generate Audio". On the first invocation this might take up to a minute to initialize as the AI model needs to be loaded and the video frames have to be rendered. Once this is done you can iterate on it with different prompts and settings much faster.
 - Once the computation is done the created audio file (.wav) is added to your project and timeline above the effect layer (in the unregistered plugin version the file will contain some gaps and beep sounds).

 .. image:: img/sp_watermark.png


Parameters
===========

Text Prompt
-------------------
Textual description of the SFX.

Negative Text Prompt
-------------------
Supresses those keywords during generation. Can be used to avoid the generation of background music by including the term "music". If the generated sound sounds harsh or otherwise shrill, add "shrill".

Seed
------
Modify the Seed value to get different variations.

Prompt Influence
------------------
The higher this value the stronger the text and video guidance will influence the generated sounds.

Steps
------
The number of iterations the model runs. Lower numbers run faster but might produce worse results.

Model
------
Choose between XL and XXL model. The XXL model needs to be installed manually (from the component manager) and has higher runtime requirements.

Backend & Performance
-----------------------
.. include:: common-args.rst