Hybrid Edge-Cloud Vision-Language System (VLM) for interactive semantic grounding and robotics perception. Powered by PaliGemma 2 and Gemini 3.0.
robotics computervision vlm robotics-perception gemini-api edge-ai vision-language-model hybrid-ai paligemma2 semantic-grounding
-
Updated
Jan 14, 2026 - Python