Skip to content

Search for specific visual content within a video by describing it in words

License

Notifications You must be signed in to change notification settings

shashnkvats/CLIPMatch

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

5 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

CLIPMatch

CLIPMatch is a gradio-based application that facilitates the exploration of specific visual content within a video by describing it in words. It employs the capabilities of OpenAI's CLIP model to analyze the similarity between video content and textual descriptions. Users can upload a video file and enter a text query, which then generates a similarity graph over time, making it possible to pinpoint the segments where the described content is most likely to appear.

Watch the video

Features

  • Video Upload: Upload a video file to be analyzed.
  • Text Query: Enter a text description to search for specific visual content within the video.
  • Similarity Graph: Generate a similarity graph showing the correlation between the video and text over time, identifying the instances where the described content is visually represented.
  • Closest Match Identification: The graph highlights the point of highest similarity between the video and text, aiding in locating the described visual content.

Installation

Ensure you have the necessary libraries installed using the following:

pip install -r requirements.txt

Usage

  • Clone the repository to your local machine.
  • Navigate to the project directory in the terminal.
  • Run the following command to launch the Gradio interface:

python app.py

  • Open the Gradio interface in your web browser (the URL will be displayed in the terminal).
  • Upload a video file and enter a text query to search for specific visual content within the video.
  • View the similarity graph and analyze the results to find the closest match for your query.

Dependencies

      Gradio
      Matplotlib
      NumPy
      OpenCV
      Torch
      Torchvision
      Clip
      Matplotlib
      PIL (Pillow)
      tqdm

About

Search for specific visual content within a video by describing it in words

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages