[go: up one dir, main page]

Skip to content
View saulocatharino's full-sized avatar
🚩
Revolucionando.
🚩
Revolucionando.
  • Beet Labs
  • Rio de janeiro

Block or report saulocatharino

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
saulocatharino/README.md

Saulo Catharino

Blog

mystreak

Saulo Catharino GitHub stats ovi

Pinned Loading

  1. Video-LLaMA Video-LLaMA Public

    Forked from DAMO-NLP-SG/Video-LLaMA

    Video-LLaMA: An Instruction-tuned Audio-Visual Language Model for Video Understanding

    Python

  2. VisionLLM VisionLLM Public

    Forked from OpenGVLab/VisionLLM

    VisionLLM: Large Language Model is also an Open-Ended Decoder for Vision-Centric Tasks

  3. Voice-Identification Voice-Identification Public

    Forked from AKBoles/Voice-Identification

    Project to explore Speaker and Voice Identification. To follow will be further Speech Recognition tasks.

    Jupyter Notebook 1

  4. whisper whisper Public

    Forked from openai/whisper

    Robust Speech Recognition via Large-Scale Weak Supervision

    Python

  5. YOLOX YOLOX Public

    Forked from Megvii-BaseDetection/YOLOX

    YOLOX is a high-performance anchor-free YOLO, exceeding yolov3~v5 with MegEngine, ONNX, TensorRT, ncnn, and OpenVINO supported. Documentation: https://yolox.readthedocs.io/

    Python