🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
-
Updated
Aug 16, 2024 - Python
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
SoftVC VITS Singing Voice Conversion
Easily train a good VC model with voice data <= 10 mins!
so-vits-svc fork with realtime support, improved interface and more features.
End-to-End Speech Processing Toolkit
Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audio, music, and speech generation research and development.
Automatic Speech Recognition (ASR), Speaker Verification, Speech Synthesis, Text-to-Speech (TTS), Language Modelling, Singing Voice Synthesis (SVS), Voice Conversion (VC)
🔊 A comprehensive list of open-source datasets for voice and sound computing (95+ datasets).
A simple, high-quality voice conversion tool focused on ease of use and performance.
This is now the official location of the Merlin project.
AutoVC: Zero-Shot Voice Style Transfer with Only Autoencoder Loss
YourTTS: Towards Zero-Shot Multi-Speaker TTS and Zero-Shot Voice Conversion for everyone
Just a fork of RVC for easy audio file voice conversion locally
NISQA - Non-Intrusive Speech Quality and TTS Naturalness Assessment
The code for the bark-voicecloning model. Training and inference.
可本地部署的AI语音工具箱 | A user-friendly audio toolkit for voice recognition, voice transcription, voice conversion etc.
Unsupervised Speech Decomposition Via Triple Information Bottleneck
singing voice change based on whisper, and lora for singing voice clone
Voice Conversion Tool Kit
FreeVC: Towards High-Quality Text-Free One-Shot Voice Conversion
Add a description, image, and links to the voice-conversion topic page so that developers can more easily learn about it.
To associate your repository with the voice-conversion topic, visit your repo's landing page and select "manage topics."