[go: up one dir, main page]

Skip to content
View Jacen789's full-sized avatar
🎯
专注
🎯
专注

Block or report Jacen789

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

A cross-platform launcher that simply works

Go 24,285 2,361 Updated Aug 20, 2024

A rule-based tunnel for Android.

Kotlin 12,785 1,064 Updated Aug 27, 2024

Brand new TTS solution

Python 7,252 574 Updated Aug 25, 2024

Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.

Python 4,178 402 Updated Aug 21, 2024

Region proposal network based small-footprint keyword spotting (Pytorch)

Python 51 16 Updated Nov 15, 2023

A generative speech model for daily dialogue.

Python 29,692 3,238 Updated Aug 25, 2024
Python 434 38 Updated Jun 7, 2024
Python 35 9 Updated Apr 24, 2024

Official implementation of "PhonMatchNet: Phoneme-Guided Zero-Shot Keyword Spotting for User-Defined Keywords" (INTERSPEECH 2023)

Python 35 6 Updated Jun 3, 2024

Instant voice cloning by MIT and MyShell.

Python 28,082 2,752 Updated Aug 21, 2024

An enhancer/customiser for the all-in-one productivity workspace Notion

JavaScript 4,750 238 Updated Jul 19, 2024

vits2 backbone with multilingual-bert

Python 7,702 1,094 Updated Aug 26, 2024

leaked prompts of GPTs

28,121 3,772 Updated Aug 26, 2024

Multilingual Automatic Speech Recognition with word-level timestamps and confidence

Python 1,820 147 Updated Jul 22, 2024

WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)

Python 10,632 1,133 Updated Aug 21, 2024

Mamba SSM architecture

Python 12,270 1,033 Updated Aug 15, 2024

A collection of links and notes on forced alignment tools

Python 862 86 Updated Nov 10, 2021

Python interface for forced audio alignment using HTK and SoX

Python 331 77 Updated Jun 28, 2020

Penn Phonetics Lab Forced Aligner Toolkit (P2FA) for Python3

Python 96 20 Updated Feb 27, 2024

Pythonic bindings for FFmpeg's libraries.

Cython 2,431 359 Updated Aug 19, 2024

Faster Whisper transcription with CTranslate2

Python 11,063 927 Updated Aug 21, 2024

Robust Speech Recognition via Large-Scale Weak Supervision

Python 66,614 7,849 Updated Aug 19, 2024

Sequence algorithms for use in Flashlight.

C++ 12 2 Updated May 6, 2024

HiFTNet: A Fast High-Quality Neural Vocoder with Harmonic-plus-Noise Filter and Inverse Short Time Fourier Transform

Python 127 10 Updated Oct 3, 2023

Text Normalization & Inverse Text Normalization

Python 441 66 Updated Aug 1, 2024

CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image

Jupyter Notebook 24,364 3,189 Updated Jul 23, 2024

Chinese and English multimodal conversational language model | 多模态中英双语对话语言模型

Python 4,063 412 Updated Aug 23, 2024

An open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo is available in https://plachtaa.github.io

Python 7,522 747 Updated Feb 11, 2024

Repo for BenTsao [original name: HuaTuo (华驼)], Instruction-tuning Large Language Models with Chinese Medical Knowledge. 本草(原名:华驼)模型仓库,基于中文医学知识的大语言模型指令微调

Python 4,452 444 Updated Oct 30, 2023

Core Engine of Singing Voice Conversion & Singing Voice Clone

Python 2,599 918 Updated Apr 23, 2024
Next