For AI agents: a documentation index is available at the root level at /llms.txt and /llms-full.txt. Append /llms.txt to any URL for a page-level index, or .md for the markdown version of any page.
Get Started
Docs & GuidesAPI Reference
Docs & GuidesAPI Reference
  • Authentication
    • API Authentication
  • API Reference
      • POSTAudio Sound (with Metadata Filters)
      • GETJob Result
      • GETJob Status
      • POSTText to Image (with Metadata Filters)
      • POSTText to Keyframe (with Metadata Filters)
      • POSTText to Shot (with Metadata Filters)
      • POSTText to Transcript (with Metadata Filters)
      • POSTText to Video (with Metadata Filters)
Get Started
LogoLogo
API ReferenceSearch Metadata Filter

Text to Video (with Metadata Filters)

POST
https://api.coactive.ai/api/v0/search/metadata-filters/text-to-video
POST
/api/v0/search/metadata-filters/text-to-video
$curl -X POST https://api.coactive.ai/api/v0/search/metadata-filters/text-to-video \
> -H "Authorization: Bearer <token>" \
> -H "Content-Type: application/json" \
> -d '{
> "dataset_id": "11973632-462e-4e14-a744-181b28ea931d",
> "text_query": "live concert performance"
>}'
1{
2 "run_id": 452378
3}
Triggers an asynchronous text-to-video search within the specified dataset (`dataset_id`), restricted to assets matching every entry in `metadata_filters` (e.g., `genre = musical`, `rating != PG`, `headline contains olympics`, `impressions > 1000`). The `text_query` is encoded into an embedding using the dataset's configured vision-language encoder (e.g., CLIP, SigLIP, Perception Encoder) and ranked against video keyframe embeddings via vector similarity, while the metadata filters are applied as a pre-scoring constraint on the asset metadata fields. Returns the Databricks `run_id` immediately; use the status and result endpoints to poll for the final ranked list of videos (each with the parent video, top composite slice, top keyframe, metadata, score, and optional moderation score).
Was this page helpful?
Previous

Get Detailed Video Search By Id

Next
Built with

Triggers an asynchronous text-to-video search within the specified dataset (dataset_id), restricted to assets matching every entry in metadata_filters (e.g., genre = musical, rating != PG, headline contains olympics, impressions > 1000). The text_query is encoded into an embedding using the dataset’s configured vision-language encoder (e.g., CLIP, SigLIP, Perception Encoder) and ranked against video keyframe embeddings via vector similarity, while the metadata filters are applied as a pre-scoring constraint on the asset metadata fields. Returns the Databricks run_id immediately; use the status and result endpoints to poll for the final ranked list of videos (each with the parent video, top composite slice, top keyframe, metadata, score, and optional moderation score).

Authentication

AuthorizationBearer

Bearer authentication of the form Bearer <token>, where token is your auth token.

Request

This endpoint expects an object.
dataset_idstringRequiredformat: "uuid"
Dataset to search within
text_querystringRequired>=2 characters
Natural language query encoded into an embedding to score visual content via vector similarity.
limitintegerOptional1-100Defaults to 40
Max number of results to return
offsetintegerOptional>=0Defaults to 0
Number of results to skip before returning
metadata_filterslist of objectsOptional
List of filter objects applied against asset metadata fields before scoring. All filters are combined with AND.
skip_moderationbooleanOptionalDefaults to false

When true, moderation scoring is skipped and moderation_score will be null on all results.

Response

Successful Response
run_idinteger
Databricks job run ID to use for status polling and result retrieval

Errors

422
Unprocessable Entity Error