WORKFLOWS/TIKTOK-CAPTIONS

TikTok Captions

Generate professional animated captions powered by OpenAI Whisper with dynamic styles and engagement features.

Video*

Video Path

Caption Size

The maximum number of words to generate in each window

Highlight Color

The color of the highlight for the captioned text

model

Whisper model size (currently only large-v3 is supported).

language

Language spoken in the audio, specify 'auto' for automatic language detection

Temperature

temperature to use for sampling

Patience

optional patience value to use in beam decoding, as in https://arxiv.org/abs/2204.05424, the default (1.0) is equivalent to conventional beam search

Suppress Tokens

comma-separated list of token ids to suppress during sampling; '-1' will suppress most special characters except common punctuations

Initial Prompt

optional text to provide as a prompt for the first window.

Condition On Previous Text

if True, provide the previous output of the model as a prompt for the next window; disabling may make the text inconsistent across windows, but the model becomes less prone to getting stuck in a failure loop

Temperature Increment On Fallback

temperature to increase when falling back when the decoding fails to meet either of the thresholds below

Compression Ratio Threshold

if the gzip compression ratio is higher than this value, treat the decoding as failed

Logprob Threshold

if the average log probability is lower than this value, treat the decoding as failed

No Speech Threshold

if the probability of the <|nospeech|> token is higher than this value AND the decoding has failed due to `logprob_threshold`, consider the segment as silence

1 CREDIT

Generated in 0.087603158636s

TikTok Captions

QUEUE

GENERATION HISTORY

FLOWS