Supertonic (TTS)
Lightning-Fast, On-Device TTS
Lightning-Fast, On-Device TTS
Generate videos from images and text prompts
Streaming conversational audio in realtime
Generate documents from structured data
Analyze videos to track and label objects
Demo of the Qwen Image Editing Fusion Collection
Fast, multi-speaker TTS (44.1kHz) with voice cloning
Complex text label dection using SAM3 with VLM-FO1
Fast 4 step inference with Qwen Image Edit 2509
The secrets to building world-class LLMs
Generate a video from an image with a prompt
Demo of the Collection of Qwen Image Editing LoRAs
Generate depth maps from images using GPU acceleration
Lightning-Fast, On-Device TTS
Wan2.2 Animate
Generate any application by Vibe Coding
Convert photos to anime-style images
Qwen-Image-2509-MultipleAngles
generate a video from an image with a text prompt
Analyze videos to track and label objects
AI-powered image editing tool
Qwen-Image-2509-CharacterSheet
Generate documents from structured data
Swap faces in images
Generate images from text prompts
Free Text-To-Speech generator with Emotion control (OpenAI)
Image-to-3D Generation
Generate images from text descriptions
Transcribe audio or video into text in multiple languages
Upload audio or YouTube link to get detailed analysis
Generate a video by interpolating between two images with a prompt
Compact model, powerful multimodal reasoning.