Jacen789

Follow

🎯

专注

Jacen Jacen789

🎯

专注

Follow

★★★★★☆

63 followers · 52 following

Guangzhou, Guangdong, China
https://www.cnblogs.com/jacen789/

Achievements

Achievements

Starred repositories

Wox-launcher / Wox

A cross-platform launcher that simply works

Go 24,285 2,361 Updated Aug 20, 2024

MetaCubeX / ClashMetaForAndroid

Forked from xuhaoyang/ClashForAndroid

A rule-based tunnel for Android.

Kotlin 12,785 1,064 Updated Aug 27, 2024

fishaudio / fish-speech

Brand new TTS solution

Python 7,252 574 Updated Aug 25, 2024

FunAudioLLM / CosyVoice

Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.

Python 4,178 402 Updated Aug 21, 2024

jingyonghou / RPN_KWS

Region proposal network based small-footprint keyword spotting (Pytorch)

Python 51 16 Updated Nov 15, 2023

2noise / ChatTTS

A generative speech model for daily dialogue.

Python 29,692 3,238 Updated Aug 25, 2024

Tele-AI / TeleSpeech-ASR

Python 434 38 Updated Jun 7, 2024

gitwukeyi / FSPEN

Python 35 9 Updated Apr 24, 2024

ncsoft / PhonMatchNet

Official implementation of "PhonMatchNet: Phoneme-Guided Zero-Shot Keyword Spotting for User-Defined Keywords" (INTERSPEECH 2023)

Python 35 6 Updated Jun 3, 2024

myshell-ai / OpenVoice

Instant voice cloning by MIT and MyShell.

Python 28,082 2,752 Updated Aug 21, 2024

notion-enhancer / notion-enhancer

An enhancer/customiser for the all-in-one productivity workspace Notion

JavaScript 4,750 238 Updated Jul 19, 2024

fishaudio / Bert-VITS2

vits2 backbone with multilingual-bert

Python 7,702 1,094 Updated Aug 26, 2024

linexjlin / GPTs

leaked prompts of GPTs

28,121 3,772 Updated Aug 26, 2024

linto-ai / whisper-timestamped

Multilingual Automatic Speech Recognition with word-level timestamps and confidence

Python 1,820 147 Updated Jul 22, 2024

m-bain / whisperX

WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)

Python 10,632 1,133 Updated Aug 21, 2024

state-spaces / mamba

Mamba SSM architecture

Python 12,270 1,033 Updated Aug 15, 2024

pettarin / forced-alignment-tools

A collection of links and notes on forced alignment tools

Python 862 86 Updated Nov 10, 2021

prosodylab / Prosodylab-Aligner

Python interface for forced audio alignment using HTK and SoX

Python 331 77 Updated Jun 28, 2020

jaekookang / p2fa_py3

Penn Phonetics Lab Forced Aligner Toolkit (P2FA) for Python3

Python 96 20 Updated Feb 27, 2024

PyAV-Org / PyAV

Pythonic bindings for FFmpeg's libraries.

Cython 2,431 359 Updated Aug 19, 2024

SYSTRAN / faster-whisper

Faster Whisper transcription with CTranslate2

Python 11,063 927 Updated Aug 21, 2024

openai / whisper

Robust Speech Recognition via Large-Scale Weak Supervision

Python 66,614 7,849 Updated Aug 19, 2024

flashlight / sequence

Sequence algorithms for use in Flashlight.

C++ 12 2 Updated May 6, 2024

yl4579 / HiFTNet

HiFTNet: A Fast High-Quality Neural Vocoder with Harmonic-plus-Noise Filter and Inverse Short Time Fourier Transform

Python 127 10 Updated Oct 3, 2023

wenet-e2e / WeTextProcessing

Text Normalization & Inverse Text Normalization

Python 441 66 Updated Aug 1, 2024

openai / CLIP

CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image

Jupyter Notebook 24,364 3,189 Updated Jul 23, 2024

THUDM / VisualGLM-6B

Chinese and English multimodal conversational language model | 多模态中英双语对话语言模型

Python 4,063 412 Updated Aug 23, 2024

Plachtaa / VALL-E-X

An open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo is available in https://plachtaa.github.io

Python 7,522 747 Updated Feb 11, 2024

SCIR-HI / Huatuo-Llama-Med-Chinese

Repo for BenTsao [original name: HuaTuo (华驼)], Instruction-tuning Large Language Models with Chinese Medical Knowledge. 本草（原名：华驼）模型仓库，基于中文医学知识的大语言模型指令微调

Python 4,452 444 Updated Oct 30, 2023

PlayVoice / whisper-vits-svc

Core Engine of Singing Voice Conversion & Singing Voice Clone

Python 2,599 918 Updated Apr 23, 2024

Starred topics

diffusion-models

multi-modal

hotword-detection

keyword-spotting

wake-word-detection

tts

voice-conversion

Machine learning

Deep learning

Algorithm