Image to Image
Authentication
Bearer authentication of the form Bearer <token>, where token is your auth token.
Bearer authentication of the form Bearer <token>, where token is your auth token.
Executes a visual similarity search that takes an input image (provided via coactive_image_id or public_url) and searches within a specified dataset (dataset_id) to find visually similar assets. The input image is encoded into an embedding using the dataset’s configured encoder (e.g., CLIP, SigLIP, or other vision-language embedding models), then the embedding is searched against images and video frames in the dataset using vector similarity. Results can be optionally filtered by asset_type (image or video). Returns a ranked list of image and video frame assets ordered by similarity score, including asset metadata, URLs, and optional video information for frame results.