[go: up one dir, main page]

Skip to content
View Chung-I's full-sized avatar
  • Department of Electrical Engineering, National Taiwan University
  • Taipei, Taiwan

Block or report Chung-I

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Functionally Reduced And-Inverter Graph

C++ 3 Updated Jul 8, 2020

Python scripts to bulk upload your local image as emojis to your Slack

Python 3 Updated Aug 27, 2024

[ICASSP 2024] StoryTTS: A Highly Expressive Text-to-Speech Dataset with Rich Textual Expressiveness Annotations

HTML 132 4 Updated Apr 27, 2024

Unsupervised Speech Decomposition Via Triple Information Bottleneck

Python 637 92 Updated Feb 8, 2022

Multi-Scale Neural Audio Codec (SNAC) compresses audio into discrete codes at a low bitrate

Python 372 21 Updated Sep 11, 2024

Instant voice cloning by MIT and MyShell.

Python 28,984 2,826 Updated Aug 21, 2024

FACodec: Speech Codec with Attribute Factorization used for NaturalSpeech 3

Python 153 9 Updated Apr 20, 2024

Pikachu Volleyball implemented into JavaScript by reverse engineering the original game

JavaScript 973 113 Updated Apr 2, 2024

ZMM-TTS: Zero-shot Multilingual and Multispeaker Speech Synthesis Conditioned on Self-supervised Discrete Speech Representations

C 117 8 Updated Mar 6, 2024

Unofficial implementation of NVIDIA P-Flow TTS paper

Python 213 30 Updated Jul 1, 2024

Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audi…

Python 4,493 385 Updated Sep 23, 2024

Models and code for RepCodec: A Speech Representation Codec for Speech Tokenization

Python 147 10 Updated Jul 12, 2024

DinoSR: Self-Distillation and Online Clustering for Self-supervised Speech Representation Learning

Python 47 4 Updated Jan 18, 2024

Official implementation for the paper "DoLa: Decoding by Contrasting Layers Improves Factuality in Large Language Models"

Python 414 48 Updated Apr 24, 2024
Python 3 Updated Sep 14, 2023

multilingual speech aligner

Python 71 5 Updated Nov 19, 2023

An Open Source text-to-speech system built by inverting Whisper.

Jupyter Notebook 3,816 207 Updated Jun 18, 2024

Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable…

Python 20,696 2,108 Updated Jul 18, 2024

Official code for Interspeech 2023 paper "Self-supervised Fine-tuning for Improved Content Representations by Speaker-invariant Clustering"

Python 41 5 Updated May 19, 2023

phoneme tokenizer and grapheme-to-phoneme model for 8k languages

Python 142 13 Updated Jun 9, 2023
Python 252 35 Updated May 15, 2023

fine-tune Whipser model for Taiwanese speech recognition

Python 26 8 Updated Mar 23, 2023

"Cascading and Direct Approaches to Unsupervised Constituency Parsing on Spoken Sentences", ICASSP 2023

Python 3 1 Updated Mar 18, 2023

💎 A list of accessible speech corpora for ASR, TTS, and other Speech Technologies

1,265 137 Updated Jun 6, 2024

Grapheme to phoneme conversion with deep learning.

Python 352 38 Updated Dec 8, 2023

A tokenizer, text cleaner, and phonemizer for many human languages.

Python 276 36 Updated Jul 3, 2024

A playbook for systematically maximizing the performance of deep learning models.

26,629 2,211 Updated Jun 18, 2024

Kaldi style neural network training in pytorch for use in place of nnet3 in Kaldi.

Python 26 6 Updated Jul 25, 2024
Python 3 Updated Nov 22, 2022

CKIP CoreNLP Toolkits

Python 114 15 Updated Apr 9, 2023
Next