#

vision-models

Here are 16 public repositories matching this topic...

antonio-f / Moondream

Testing the Moondream tiny vision model

tutorial artificial-intelligence image-captioning language-models image-descriptions hands-on huggingface-transformers vision-models vision-transformers running-locally tiny-models

Updated May 12, 2024
Jupyter Notebook

duynamrcv / vision_flocking

Vision-based swarms in the Presence of Occlusions

python3 swarm-robotics behavior-control vision-models

Updated Jun 28, 2024
Python

regokan / deep-vision-lab

A comprehensive repository for research, code, and insights on convolutional neural networks and deep vision models

cnn vision-models

Updated Nov 5, 2024
Jupyter Notebook

afondiel / how-diffusion-models-work-crash-course-DLAI

Diffusion Models crash course with Pytorch from DeepLearningAI

computer-vision latent-space diffusion-models conditional-generation vision-models latent-diffusion generative-ai unconditional-generation genai conditional-diffusion

Updated Oct 14, 2024
Jupyter Notebook

shivendrra / AIVA-4x500m

building AVA from ex-machina; a lightweight multi-modal system from scratch, just for learning & experimentation

machine-learning transformer vision audio-engine audio-classification vision-engine vision-models vision-transformer swin-transformer large-language-models llm audio-transformers

Updated Jul 31, 2024
Jupyter Notebook

Amr-Abdellatif / Fine-Tuninng-Pre-Trained-Vision-models-PyTorch

In This repo i FineTuned a Pretrained ResNet18 model from PyTorch library

pytorch vision pretrained-models fine-tuning vision-models

Updated Feb 17, 2024
Jupyter Notebook

afondiel / Prompt-Engineering-for-Vision-Models-DeepLearningAI

These notes and resources are compiled from the crash course Prompt Engineering for Vision Models offered by DeepLearning.AI.

image-processing cnn video-processing vit diffusion-models convnets vision-models visual-prompting prompt-engineering vision-language-model large-vision-language-models meta-sam large-vision-models vision-model-prompting

Updated Aug 20, 2024
Jupyter Notebook

ArashAkbarinia / DeepTHS

A framework to compute threshold sensitivity of deep networks to visual stimuli.

deep-neural-networks deep-learning sensitivity-analysis cognitive-neuroscience linear-probing linear-classifier explainable-ai vision-models human-machine-behavior

Updated Jul 4, 2024
Python

Pavansomisetty21 / Image-Caption-Generation-using-LLMs-GEMINI-

we generate captions to the images which are given by user(user input) using prompt engineering and Generative AI

Updated Aug 24, 2024
Jupyter Notebook

major196512 / vistem

General Vision Model Training Template

pytorch vision-models

Updated Nov 12, 2020
Python

ksm26 / Prompt-Engineering-for-Vision-Models

Enhance your skills in prompt engineering for vision models. Learn to effectively prompt, fine-tune, and track experiments for models like SAM, OWL-ViT, and Stable Diffusion 2.0 to achieve precise image generation, segmentation, and object detection.

machine-learning sam image-generation object-detection image-segmentation hyperparameter-tuning comet-library fine-tuning diffusion-models vision-models in-painting prompt-engineering stable-diffusion dreambooth owl-vit visual-workflows

Updated May 13, 2024
Jupyter Notebook

The-Swarm-Corporation / swarm-models

A simple to use package to call various model providers such as openai, anthropic, and others with utmost reliability, security, and performance.

library ai computer-vision tool ml usage production-ready swarms agents enterprise-grade vision-models llms

Updated Nov 5, 2024
Python

EthanBnntt / tinygrad-gmlp

An implementation of gated MLPs in tinygrad, as an alternative to transformers.

machinelearning vision-models tinygrad gmlp

Updated Sep 6, 2024
Python

kyegomez / Midas

Implementation of Midas from [Towards Robust Monocular Depth Estimation] in Pytorch and Zeta

python ai tensorflow parallel ml pytorch artificial-intelligence multi-modal vision-models

Updated Mar 11, 2024
Shell

kyegomez / VisionLLaMA

Implementation of VisionLLaMA from the paper: "VisionLLaMA: A Unified LLaMA Interface for Vision Tasks" in PyTorch and Zeta

ai deep-learning vit multi-modal vision-models vision-transformers

Updated Nov 4, 2024
Python

computer-vision-challenge

afondiel / computer-vision-challenge

This is a series of computer vision foundational projects that anyone diving into the field must tackle.

computer-vision image-processing cnn image-classification image-generation image-detection lvm vlm computer-vision-algorithms computer-vision-tools computer-vision-opencv computer-vision-datasets vision-models vision-transformer computer-vision-python computer-vision-projects computer-vision-hello-world cv-challenge computer-vision-challenge

Updated Nov 1, 2024
Jupyter Notebook

Improve this page

Add a description, image, and links to the vision-models topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the vision-models topic, visit your repo's landing page and select "manage topics."