[go: up one dir, main page]

Skip to content
View v-iashin's full-sized avatar
👨‍💻
👨‍💻

Block or report v-iashin

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Pinned Loading

  1. Synchformer Synchformer Public

    Efficient synchronization from sparse cues

    Python 28 4

  2. SparseSync SparseSync Public

    Source code for "Sparse in Space and Time: Audio-visual Synchronisation with Trainable Selectors." (Spotlight at the BMVC 2022)

    Python 50 9

  3. SpecVQGAN SpecVQGAN Public

    Source code for "Taming Visually Guided Sound Generation" (Oral at the BMVC 2021)

    Jupyter Notebook 347 40

  4. video_features video_features Public

    Extract video features from raw videos using multiple GPUs. We support RAFT flow frames as well as S3D, I3D, R(2+1)D, VGGish, CLIP, and TIMM models.

    Python 525 97

  5. BMT BMT Public

    Source code for "Bi-modal Transformer for Dense Video Captioning" (BMVC 2020)

    Jupyter Notebook 225 57

  6. MDVC MDVC Public

    PyTorch implementation of Multi-modal Dense Video Captioning (CVPR 2020 Workshops)

    Python 143 20