Vector Databases for Semantic Search in Video Archives

Vector Databases

Finding a specific shot in petabytes of video footage used to take hours of manual scrubbing. By using vector databases and vision-language models, we made our entire archive searchable by natural language.

Table of contents:

Generating Embeddings

We sample frames from our videos every second and process them through OpenAI's CLIP model, creating mathematical representations (vectors) of the visual content.

Milvus and Pinecone

Storing millions of highly-dimensional vectors requires specialized tech. We evaluate the performance of vector databases using HNSW algorithms to perform millisecond nearest-neighbor searches.

Multi-modal Search

By aligning text and image embeddings, a user can search for 'a dog running on the beach at sunset' and the system retrieves the vector that mathematically closest matches that text prompt.

Contact

Let's talk.

A direct line to the team behind the work. No account managers, no briefing relay between departments. Tell us about your next project and we'll reply within 24 hours with concrete next steps.

Response Within 24 hours, direct from the team

Available  •  Remote-first, worldwide

Briefing

Send us a short briefing.