nbasyl

Follow

🌵

I am Groot

LIU, Shih-Yang nbasyl

🌵

I am Groot

Follow

HKUST CSE PhD student, Intern at @NVlabs

51 followers · 23 following

HKUST CSE, NVIDIA Research
HK and TW
https://nbasyl.github.io

Achievements

Achievements

Highlights

Pro

Stars

facebookresearch / SpinQuant

Code repo for the paper "SpinQuant LLM quantization with learned rotations"

Python 74 9 Updated Aug 12, 2024

IST-DASLab / marlin

FP16xINT4 LLM inference kernel that can achieve near-ideal ~4x speedups up to medium batchsizes of 16-32 tokens.

Python 532 39 Updated Aug 15, 2024

AIoT-MLSys-Lab / SVD-LLM

Official Code for "SVD-LLM: Truncation-aware Singular Value Decomposition for Large Language Model Compression"

Python 77 7 Updated May 31, 2024

microsoft / TransformerCompression

For releasing code related to compression methods for transformers, accompanying our publications

Python 352 31 Updated Aug 26, 2024

pytorch / ao

PyTorch native quantization and sparsity for training and inference

Python 571 77 Updated Aug 28, 2024

mit-han-lab / qserve

QServe: W4A8KV4 Quantization and System Co-design for Efficient LLM Serving

Python 382 16 Updated Aug 13, 2024

NVlabs / RADIO

Official repository for "AM-RADIO: Reduce All Domains Into One"

Python 563 22 Updated Aug 13, 2024

PingchengDong / GQA-LUT

The official implementation of the DAC 2024 paper GQA-LUT

Python 10 Updated Jun 18, 2024

hiyouga / LLaMA-Factory

Efficiently Fine-Tune 100+ LLMs in WebUI (ACL 2024)

Python 29,703 3,651 Updated Aug 27, 2024

AnswerDotAI / fsdp_qlora

Training LLMs with QLoRA + FSDP

Jupyter Notebook 1,370 182 Updated Aug 28, 2024

meta-llama / llama3

The official Meta Llama 3 GitHub site

Python 25,785 2,873 Updated Aug 12, 2024

NVlabs / DoRA

[ICML2024 (Oral)] Official PyTorch implementation of DoRA: Weight-Decomposed Low-Rank Adaptation

Python 532 28 Updated Jul 6, 2024

spcl / QuaRot

Code for QuaRot, an end-to-end 4-bit inference of large language models.

Python 245 17 Updated Jul 22, 2024

stanfordnlp / pyreft

ReFT: Representation Finetuning for Language Models

Python 1,042 88 Updated Aug 21, 2024

locuslab / wanda

A simple and effective LLM pruning approach.

Python 600 74 Updated Aug 9, 2024

facebookresearch / dino

PyTorch code for Vision Transformers training with the Self-Supervised learning method DINO

Python 6,181 897 Updated Jul 3, 2024

ttchengab / continuous_3d_words

This is a project page for continuous 3D words

JavaScript 2 Updated Apr 9, 2024

metame-ai / awesome-llm-plaza

awesome llm plaza: daily tracking all sorts of awesome topics of llm, e.g. llm for coding, robotics, reasoning, multimod etc.

115 8 Updated Aug 20, 2024

nbasyl / DoRA

Official implementation of "DoRA: Weight-Decomposed Low-Rank Adaptation"

119 3 Updated Apr 28, 2024

allenai / OLMo

Modeling, training, eval, and inference code for OLMo

Python 4,310 422 Updated Aug 28, 2024

lm-sys / vicuna-blog-eval

The code and data for the GPT-4 based benchmark in the vicuna blog post

Python 33 8 Updated Aug 2, 2023

yxli2123 / LoftQ

Python 188 16 Updated Jun 11, 2024

tloen / alpaca-lora

Instruct-tune LLaMA on consumer hardware

Jupyter Notebook 18,512 2,208 Updated Jul 29, 2024

haotian-liu / LLaVA

[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.

Python 18,947 2,075 Updated Aug 12, 2024

wy1iu / butterfly-oft

Official implementation of "Parameter-Efficient Orthogonal Finetuning via Butterfly Factorization"

72 Updated Apr 12, 2024

NVIDIA / TensorRT-LLM

TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficie…

C++ 8,024 883 Updated Aug 27, 2024

huggingface / peft

🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.

Python 15,609 1,502 Updated Aug 26, 2024

microsoft / LoRA

Code for loralib, an implementation of "LoRA: Low-Rank Adaptation of Large Language Models"

Python 10,172 649 Updated Aug 14, 2024

nbasyl / LLM-FP4

The official implementation of the EMNLP 2023 paper LLM-FP4

Python 153 9 Updated Dec 15, 2023

sangminwoo / awesome-vision-and-language

A curated list of awesome vision and language resources (still under construction... stay tuned!)

425 36 Updated Aug 4, 2024