HuangOwen

🏠

Working from home

Xijie Huang HuangOwen

🏠

Working from home

Ph.D. Student in HKUST CSE

95 followers · 75 following

VSDL Lab, HKUST
Santa Monica, CA
22:46 (UTC +08:00)
https://huangowen.github.io/
@Owen_Huangxj

Achievements

Lists (3)

Sort

🔮 Future ideas

✨ Inspiration

🚀 My stack

Beta Lists are currently in beta. Share feedback and report bugs.

Starred repositories

microsoft / VPTQ

VPTQ, A Flexible and Extreme low-bit quantization algorithm

Python 121 5 Updated Oct 4, 2024

TUDB-Labs / MixLoRA

State-of-the-art Parameter-Efficient MoE Fine-tuning Method

Python 68 8 Updated Aug 22, 2024

Hsu1023 / DuQuant

[NeurIPS 2024 Oral🔥] DuQuant: Distributing Outliers via Dual Transformation Makes Stronger Quantized LLMs.

Python 57 3 Updated Oct 3, 2024

baaivision / Emu3

Next-Token Prediction is All You Need

Python 799 22 Updated Sep 30, 2024

HandH1998 / QQQ

QQQ is an innovative and hardware-optimized W4A8 quantization solution for LLMs.

Python 63 5 Updated Sep 12, 2024

CompVis / latent-diffusion

High-Resolution Image Synthesis with Latent Diffusion Models

Jupyter Notebook 11,594 1,509 Updated Feb 29, 2024

Dao-AILab / fast-hadamard-transform

Fast Hadamard transform in CUDA, with a PyTorch interface

C 94 14 Updated May 24, 2024

HuangOwen / RoLoRA

[EMNLP 2024] RoLoRA: Fine-tuning Rotated Outlier-free LLMs for Effective Weight-Activation Quantization

Python 19 2 Updated Sep 24, 2024

DD-DuDa / BitDistiller

[ACL 2024] A novel QAT with Self-Distillation framework to enhance ultra low-bit LLMs.

Python 75 8 Updated May 16, 2024

AIoT-MLSys-Lab / SVD-LLM

Official Code for "SVD-LLM: Truncation-aware Singular Value Decomposition for Large Language Model Compression"

Python 87 7 Updated Oct 1, 2024

Qualcomm-AI-research / lr-qat

Python 13 Updated Sep 4, 2024

NVlabs / Minitron

A family of compressed models obtained via pruning and knowledge distillation

259 16 Updated Oct 2, 2024

hijkzzz / Awesome-LLM-Strawberry

A collection of LLM papers, blogs, and projects, with a focus on OpenAI o1 and reasoning techniques.

4,006 218 Updated Oct 4, 2024

luosiallen / latent-consistency-model

Latent Consistency Models: Synthesizing High-Resolution Images with Few-Step Inference

Python 4,304 224 Updated Jun 14, 2024

Yutong-Zhou-cv / Awesome-Text-to-Image

(ෆ`꒳´ෆ) A Survey on Text-to-Image Generation/Synthesis.

2,102 187 Updated Aug 20, 2024

LeapLabTHU / Attention-Mediators

[ECCV 2024] Efficient Diffusion Transformer with Step-wise Dynamic Attention Mediators

Python 28 Updated Sep 11, 2024

ollama / ollama

Get up and running with Llama 3.2, Mistral, Gemma 2, and other large language models.

Go 91,979 7,241 Updated Oct 3, 2024

openai / consistency_models

Official repo for consistency models.

Python 6,079 411 Updated Mar 22, 2024

TencentARC / Open-MAGVIT2

Open-MAGVIT2: Democratizing Autoregressive Visual Generation

Python 631 24 Updated Sep 27, 2024

zhentingqi / rStar

Python 336 34 Updated Sep 23, 2024

locuslab / ect

Consistency Models Made Easy

Python 197 7 Updated Sep 23, 2024

comfyanonymous / ComfyUI

The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.

Python 52,322 5,515 Updated Oct 4, 2024

NexaAI / Awesome-LLMs-on-device

Awesome LLMs on Device: A Comprehensive Survey

777 108 Updated Sep 24, 2024

buoyancy99 / diffusion-forcing

code for "Diffusion Forcing: Next-token Prediction Meets Full-Sequence Diffusion"

Python 520 20 Updated Sep 26, 2024

Huage001 / LinFusion

Official PyTorch and Diffusers Implementation of "LinFusion: 1 GPU, 1 Minute, 16K Image"

Python 212 14 Updated Oct 2, 2024

Kwai-Kolors / Kolors

Kolors Team

Python 3,663 242 Updated Sep 4, 2024

IndexFziQ / Diffusion4NLP-Papers

A paper list about diffusion models for natural language processing.

172 6 Updated Aug 28, 2023

IST-DASLab / marlin

FP16xINT4 LLM inference kernel that can achieve near-ideal ~4x speedups up to medium batchsizes of 16-32 tokens.

Python 575 45 Updated Sep 4, 2024

IST-DASLab / Sparse-Marlin

Boosting 4-bit inference kernels with 2:4 Sparsity

Cuda 47 2 Updated Sep 4, 2024

OpenLMLab / LEval

[ACL'24 Outstanding] Data and code for L-Eval, a comprehensive long context language models evaluation benchmark

Python 349 14 Updated Jul 9, 2024