[go: up one dir, main page]

Skip to content
#

msvd

Here are 16 public repositories matching this topic...

This project utilizes advanced deep learning techniques to automatically generate contextually relevant captions for videos by extracting spatial and temporal features, while incorporating Gaussian attention to focus on important regions. This enhances video indexing, retrieval, and accessibility for visually impaired individuals.

  • Updated Jul 24, 2023
  • Jupyter Notebook

Improve this page

Add a description, image, and links to the msvd topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the msvd topic, visit your repo's landing page and select "manage topics."

Learn more