siglip

Here are 21 public repositories matching this topic...

gokayfem / ComfyUI_VLM_nodes

Custom ComfyUI nodes for Vision Language Models, Large Language Models, Image to Music, Text to Music, Consistent and Random Creative Prompt Generation

image-captioning nodes vlm custom-nodes img2text llm mllm llava comfyui siglip phi15 joytag img2sfx

Updated Feb 13, 2025
Python

merveenoyan / siglip

Star

Projects based on SigLIP (Zhai et. al, 2023) and Hugging Face transformers integration 🤗

machine-learning computer-vision multimodal-learning siglip

Updated Feb 21, 2025
Jupyter Notebook

MCG-NJU / AWT

Star

[NeurIPS 2024] AWT: Transferring Vision-Language Models via Augmentation, Weighting, and Transportation

computer-vision transfer-learning clip video-understanding zero-shot-learning open-set-recognition vlms siglip

Updated Oct 5, 2024
Python

rizavelioglu / tryoffdiff

Star

Official repository of "TryOffDiff: Virtual-Try-Off via High-Fidelity Garment Reconstruction using Diffusion Models".

fashion pytorch e-commerce demo-app image-to-image diffusion virtual-try-on stable-diffusion huggingface-diffusers siglip virtual-try-off

Updated Jan 20, 2025
Python

OrvilleX / MachineLearning

Star

本项目以应用为主出发，结合了从基础的机器学习、深度学习到目标检测以及目前最新的大模型，采用目前成熟的第三方库、开源预训练模型以及相关论文的最新技术，目的是记录学习的过程同时也进行分享以供更多人可以直接进行使用。

machine-learning tensorflow numpy svm sklearn scipy knn spark-mllib llm mllm siglip

Updated Feb 17, 2025
Jupyter Notebook

NikosEfth / freedom

Star

Official PyTorch implementation of the WACV 2025 Oral paper "Composed Image Retrieval for Training-FREE DOMain Conversion".

computer-vision deep-learning neural-networks cross-domain clip image-retrieval cross-domain-learning composed-image-retrieval training-free siglip domain-conversion

Updated Jan 24, 2025
Python

rhysdg / vision-at-a-clip

Sponsor

Star

Low-latency ONNX and TensorRT based zero-shot classification and detection with contrastive language-image pre-training based prompts

machine-learning clip tensorrt onnx zero-shot-classification zero-shot-object-detection foundation-models grounding-dino siglip

Updated Aug 31, 2024
Jupyter Notebook

filipbasara0 / simple-clip

Star

A minimal, but effective implementation of CLIP (Contrastive Language-Image Pretraining) in PyTorch

machine-learning deep-learning pytorch representation-learning self-supervised-learning multi-modal-learning contrastive-learning zero-shot-classification siglip

Updated Feb 14, 2024
Jupyter Notebook

miccunifi / Cross-the-Gap

Star

[ICLR 2025] - Cross the Gap: Exposing the Intra-modal Misalignment in CLIP via Modality Inversion

Updated Feb 7, 2025

awsaf49 / flickr-dataset

Star

Download flickr8k, flickr30k image caption datasets

image flickr dataset clip captioning-images image-text flickr8k flickr30k siglip

Updated Feb 6, 2024

ola-krutrim / Chitrarth

Star

Chitrarth: Bridging Vision and Language for a Billion People

image transformers vlm siglip

Updated Feb 12, 2025
Python

seanvelasco / memegraph

Sponsor

Star

Meme search and discovery engine using OpenAI CLIP and Salesforce BLIP

memes transformer openai search-algorithm clip mlx siglip catlip corenet

Updated Nov 6, 2024
Python

alejandroolivo / ObjectClassification-with-fastSAM-and-embeddings

Star

Este proyecto presenta una solución de Computer Vision para la detección y clasificación de objetos en imágenes, las cuales son extraídas como frames de vídeos. Utiliza el modelo FastSAM para la detección de objetos, y para la clasificación, emplea embeddings que pueden ser generados mediante dos modelos distintos: CLIP o SigLIP.

python computer-vision clip fastsam siglip