wavy-jung

🤗

Doohae Jung wavy-jung

🤗

AI researcher at @kakao

35 followers · 73 following

@kakao
Seoul, Korea

Achievements

Starred repositories

XueFuzhao / awesome-mixture-of-experts

A collection of AWESOME things about mixture-of-experts

903 68 Updated Jul 31, 2024

myshell-ai / JetMoE

Reaching LLaMA2 Performance with 0.1M Dollars

Python 954 77 Updated Jul 23, 2024

aitsc / GLMKD

Are Intermediate Layers and Labels Really Necessary? A General Language Model Distillation Method ; GKD: A General Knowledge Distillation Framework for Large-scale Pre-trained Language Model

Python 31 1 Updated Aug 4, 2023

allanj / repo-level-codegen-papers

Repo-Level Code generation papers

58 3 Updated Jun 16, 2024

tree-sitter / tree-sitter

An incremental parsing system for programming tools

Rust 17,828 1,336 Updated Aug 26, 2024

freeCodeCamp / devdocs

API Documentation Browser

Ruby 34,846 2,319 Updated Aug 23, 2024

apple / axlearn

An Extensible Deep Learning Library

Python 1,741 225 Updated Aug 26, 2024

IEIT-Yuan / Yuan2.0-M32

Mixture-of-Experts (MoE) Language Model

Python 175 38 Updated Aug 22, 2024

maxpumperla / learning_ray

Notebooks for the O'Reilly book "Learning Ray"

Jupyter Notebook 240 63 Updated Apr 25, 2024

meta-llama / llama-agentic-system

Agentic components of the Llama Stack APIs

Python 3,071 291 Updated Aug 26, 2024

mlfoundations / open_lm

A repository for research on medium sized language models.

Python 463 62 Updated Aug 20, 2024

joke2k / faker

Faker is a Python package that generates fake data for you.

Python 17,515 1,910 Updated Aug 23, 2024

karpathy / LLM101n

LLM101n: Let's build a Storyteller

27,303 1,493 Updated Aug 1, 2024

NVIDIA / NeMo-Aligner

Scalable toolkit for efficient model alignment

Python 492 51 Updated Aug 27, 2024

mlfoundations / datacomp

DataComp: In search of the next generation of multimodal datasets

Python 629 49 Updated Jan 2, 2024

mlfoundations / dclm

DataComp for Language Models

HTML 1,079 95 Updated Aug 19, 2024

NVIDIA / NeMo-Framework-Launcher

Provides end-to-end model development pipelines for LLMs and Multimodal models that can be launched on-prem or cloud-native.

Python 440 133 Updated Aug 26, 2024

SkyworkAI / Skywork-MoE

Skywork-MoE: A Deep Dive into Training Techniques for Mixture-of-Experts Language Models

119 5 Updated Jun 12, 2024

huggingface / text-clustering

Easily embed, cluster and semantically label text datasets

Python 420 32 Updated Mar 28, 2024

google / maxtext

A simple, performant and scalable Jax LLM!

Python 1,418 260 Updated Aug 27, 2024

unslothai / unsloth

Finetune Llama 3.1, Mistral, Phi & Gemma LLMs 2-5x faster with 80% less memory

Python 14,791 980 Updated Aug 27, 2024

princeton-nlp / SimPO

SimPO: Simple Preference Optimization with a Reference-Free Reward

Python 613 35 Updated Aug 22, 2024

databricks / dbrx

Code examples and resources for DBRX, a large language model developed by Databricks

Python 2,495 234 Updated May 1, 2024

mistralai / mistral-inference

Official inference library for Mistral models

Jupyter Notebook 9,454 831 Updated Aug 22, 2024

pjlab-sys4nlp / llama-moe

⛷️ LLaMA-MoE: Building Mixture-of-Experts from LLaMA with Continual Pre-training

Python 831 44 Updated Jun 25, 2024

huggingface / llm-swarm

Manage scalable open LLM inference endpoints in Slurm clusters

Python 216 21 Updated Jul 11, 2024

huggingface / cosmopedia

Python 408 39 Updated Jul 17, 2024

HigherOrderCO / Bend

A massively parallel, high-level programming language

Rust 17,109 419 Updated Aug 23, 2024

meta-llama / llama-recipes

Scripts for fine-tuning Meta Llama3 with composable FSDP & PEFT methods to cover single/multi-node GPUs. Supports default & custom datasets for applications such as summarization and Q&A. Supportin…

Jupyter Notebook 11,425 1,618 Updated Aug 25, 2024

argilla-io / distilabel

Distilabel is a framework for synthetic data and AI feedback for engineers who need fast, reliable and scalable pipelines based on verified research papers.

Python 1,302 87 Updated Aug 27, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Doohae Jung wavy-jung

Achievements

Achievements

Block or report wavy-jung

Starred repositories

XueFuzhao / awesome-mixture-of-experts

myshell-ai / JetMoE

aitsc / GLMKD

allanj / repo-level-codegen-papers

tree-sitter / tree-sitter

freeCodeCamp / devdocs

apple / axlearn

IEIT-Yuan / Yuan2.0-M32

maxpumperla / learning_ray

meta-llama / llama-agentic-system

mlfoundations / open_lm

joke2k / faker

karpathy / LLM101n

NVIDIA / NeMo-Aligner

mlfoundations / datacomp

mlfoundations / dclm

NVIDIA / NeMo-Framework-Launcher

SkyworkAI / Skywork-MoE

huggingface / text-clustering

google / maxtext

unslothai / unsloth

princeton-nlp / SimPO

databricks / dbrx

mistralai / mistral-inference

pjlab-sys4nlp / llama-moe

huggingface / llm-swarm

huggingface / cosmopedia

HigherOrderCO / Bend

meta-llama / llama-recipes

argilla-io / distilabel

Starred topics

Google