Text to Video (with Metadata Filters)
Text to Video (with Metadata Filters)
Text to Video (with Metadata Filters)
Triggers an asynchronous text-to-video search within the specified dataset (dataset_id), restricted to assets matching every entry in metadata_filters (e.g., genre = musical, rating != PG, headline contains olympics, impressions > 1000). The text_query is encoded into an embedding using the dataset’s configured vision-language encoder (e.g., CLIP, SigLIP, Perception Encoder) and ranked against video keyframe embeddings via vector similarity, while the metadata filters are applied as a pre-scoring constraint on the asset metadata fields. Returns the Databricks run_id immediately; use the status and result endpoints to poll for the final ranked list of videos (each with the parent video, top composite slice, top keyframe, metadata, score, and optional moderation score).
Bearer authentication of the form Bearer <token>, where token is your auth token.
When true, moderation scoring is skipped and moderation_score will be null on all results.