Stars
CodeGeeX: An Open Multilingual Code Generation Model (KDD 2023)
Introductory examples for using PYNQ with Alveo
Neural network inferences on Alveo cards with hls4ml framework
Tutorial for deploying models on Alveo boards
Neural Network Distiller by Intel AI Lab: a Python package for neural network compression research. https://intellabs.github.io/distiller
SOTA low-bit LLM quantization (INT8/FP8/INT4/FP4/NF4) & sparsity; leading model compression techniques on TensorFlow, PyTorch, and ONNX Runtime
A coding-free framework built on PyTorch for reproducible deep learning studies. 🏆25 knowledge distillation methods presented at CVPR, ICLR, ECCV, NeurIPS, ICCV, etc are implemented so far. 🎁 Train…
A simple Implementation of ParticleNet in Pytorch Geometric
Graph Neural Network Library for PyTorch
clash for windows汉化版. 提供clash for windows的汉化版, 汉化补丁及汉化版安装程序
This is a series of GPU optimization topics. Here we will introduce how to optimize the CUDA kernel in detail. I will introduce several basic kernel optimizations, including: elementwise, reduce, s…
PArallel Distributed Deep LEarning: Machine Learning Framework from Industrial Practice (『飞桨』核心框架,深度学习&机器学习高性能单机、分布式训练和跨平台部署)
C++ implementation of ChatGLM-6B & ChatGLM2-6B & ChatGLM3 & GLM4(V)
Open deep learning compiler stack for cpu, gpu and specialized accelerators
🧑🏫 60+ Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), ga…
AISystem 主要是指AI系统,包括AI芯片、AI编译器、AI推理和训练框架等AI全栈底层技术
A demo for accelerating YOLOv2 in xilinx's fpga pynq/zedboard
A FPGA-based neural network inference accelerator, which won the third place in DAC-SDC
Designs for finalist teams of the DAC System Design Contest
PyTorch implementation of DiracDeltaNet from paper Synetgy: Algorithm-hardware Co-design for ConvNet Accelerators on Embedded FPGAs