[go: up one dir, main page]

Skip to content
View wavy-jung's full-sized avatar
🤗
🤗

Block or report wavy-jung

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

A collection of AWESOME things about mixture-of-experts

903 68 Updated Jul 31, 2024

Reaching LLaMA2 Performance with 0.1M Dollars

Python 954 77 Updated Jul 23, 2024

Are Intermediate Layers and Labels Really Necessary? A General Language Model Distillation Method ; GKD: A General Knowledge Distillation Framework for Large-scale Pre-trained Language Model

Python 31 1 Updated Aug 4, 2023

Repo-Level Code generation papers

58 3 Updated Jun 16, 2024

An incremental parsing system for programming tools

Rust 17,828 1,336 Updated Aug 26, 2024

API Documentation Browser

Ruby 34,846 2,319 Updated Aug 23, 2024

An Extensible Deep Learning Library

Python 1,741 225 Updated Aug 26, 2024

Mixture-of-Experts (MoE) Language Model

Python 175 38 Updated Aug 22, 2024

Notebooks for the O'Reilly book "Learning Ray"

Jupyter Notebook 240 63 Updated Apr 25, 2024

Agentic components of the Llama Stack APIs

Python 3,071 291 Updated Aug 26, 2024

A repository for research on medium sized language models.

Python 463 62 Updated Aug 20, 2024

Faker is a Python package that generates fake data for you.

Python 17,515 1,910 Updated Aug 23, 2024

LLM101n: Let's build a Storyteller

27,303 1,493 Updated Aug 1, 2024

Scalable toolkit for efficient model alignment

Python 492 51 Updated Aug 27, 2024

DataComp: In search of the next generation of multimodal datasets

Python 629 49 Updated Jan 2, 2024

DataComp for Language Models

HTML 1,079 95 Updated Aug 19, 2024

Provides end-to-end model development pipelines for LLMs and Multimodal models that can be launched on-prem or cloud-native.

Python 440 133 Updated Aug 26, 2024

Skywork-MoE: A Deep Dive into Training Techniques for Mixture-of-Experts Language Models

119 5 Updated Jun 12, 2024

Easily embed, cluster and semantically label text datasets

Python 420 32 Updated Mar 28, 2024

A simple, performant and scalable Jax LLM!

Python 1,418 260 Updated Aug 27, 2024

Finetune Llama 3.1, Mistral, Phi & Gemma LLMs 2-5x faster with 80% less memory

Python 14,791 980 Updated Aug 27, 2024

SimPO: Simple Preference Optimization with a Reference-Free Reward

Python 613 35 Updated Aug 22, 2024

Code examples and resources for DBRX, a large language model developed by Databricks

Python 2,495 234 Updated May 1, 2024

Official inference library for Mistral models

Jupyter Notebook 9,454 831 Updated Aug 22, 2024

⛷️ LLaMA-MoE: Building Mixture-of-Experts from LLaMA with Continual Pre-training

Python 831 44 Updated Jun 25, 2024

Manage scalable open LLM inference endpoints in Slurm clusters

Python 216 21 Updated Jul 11, 2024
Python 408 39 Updated Jul 17, 2024

A massively parallel, high-level programming language

Rust 17,109 419 Updated Aug 23, 2024

Scripts for fine-tuning Meta Llama3 with composable FSDP & PEFT methods to cover single/multi-node GPUs. Supports default & custom datasets for applications such as summarization and Q&A. Supportin…

Jupyter Notebook 11,425 1,618 Updated Aug 25, 2024

Distilabel is a framework for synthetic data and AI feedback for engineers who need fast, reliable and scalable pipelines based on verified research papers.

Python 1,302 87 Updated Aug 27, 2024
Next