[go: up one dir, main page]

Skip to content
View zyt1024's full-sized avatar

Block or report zyt1024

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Official inference framework for 1-bit LLMs

C++ 10,788 730 Updated Oct 31, 2024

CodeGeeX: An Open Multilingual Code Generation Model (KDD 2023)

Python 8,217 603 Updated Aug 13, 2024

Java-地下城与勇士-dnf工具

Java 83 10 Updated Jun 2, 2024
Shell 1,035 337 Updated Aug 27, 2024
Dockerfile 3 2 Updated Nov 18, 2020

Introductory examples for using PYNQ with Alveo

Jupyter Notebook 48 17 Updated Mar 14, 2023

Neural network inferences on Alveo cards with hls4ml framework

Ada 7 1 Updated Jul 28, 2022

Tutorial for deploying models on Alveo boards

Jupyter Notebook 5 2 Updated Jun 20, 2022

Neural Network Distiller by Intel AI Lab: a Python package for neural network compression research. https://intellabs.github.io/distiller

Jupyter Notebook 4,353 802 Updated Apr 24, 2023

SOTA low-bit LLM quantization (INT8/FP8/INT4/FP4/NF4) & sparsity; leading model compression techniques on TensorFlow, PyTorch, and ONNX Runtime

Python 2,217 255 Updated Nov 5, 2024

A coding-free framework built on PyTorch for reproducible deep learning studies. 🏆25 knowledge distillation methods presented at CVPR, ICLR, ECCV, NeurIPS, ICCV, etc are implemented so far. 🎁 Train…

Python 1,387 131 Updated Oct 15, 2024

A simple Implementation of ParticleNet in Pytorch Geometric

Python 7 3 Updated Feb 15, 2021

Graph Neural Network Library for PyTorch

Python 21,301 3,649 Updated Oct 31, 2024

基于FPGA的数字识别-实时视频处理的定点卷积神经网络实现

Verilog 273 63 Updated May 2, 2023

使用Verilog实现的CNN模块,可以方便的在FPGA项目中使用

Verilog 491 107 Updated Jun 18, 2018

clash for windows汉化版. 提供clash for windows的汉化版, 汉化补丁及汉化版安装程序

JavaScript 21,147 2,746 Updated Oct 16, 2024

C++那些事

C++ 39,379 8,534 Updated Jun 14, 2024

This is a series of GPU optimization topics. Here we will introduce how to optimize the CUDA kernel in detail. I will introduce several basic kernel optimizations, including: elementwise, reduce, s…

Cuda 821 131 Updated Jul 29, 2023

PArallel Distributed Deep LEarning: Machine Learning Framework from Industrial Practice (『飞桨』核心框架,深度学习&机器学习高性能单机、分布式训练和跨平台部署)

C++ 22,236 5,583 Updated Nov 5, 2024

C++ implementation of ChatGLM-6B & ChatGLM2-6B & ChatGLM3 & GLM4(V)

C++ 2,934 333 Updated Jul 31, 2024

Open deep learning compiler stack for cpu, gpu and specialized accelerators

Python 11,762 3,468 Updated Nov 5, 2024

🧑‍🏫 60+ Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), ga…

Python 55,963 5,744 Updated Aug 24, 2024

llm deploy project based mnn.

C++ 1,462 163 Updated Nov 5, 2024

AISystem 主要是指AI系统,包括AI芯片、AI编译器、AI推理和训练框架等AI全栈底层技术

Jupyter Notebook 11,064 1,604 Updated Oct 26, 2024

A demo for accelerating YOLOv2 in xilinx's fpga pynq/zedboard

C 777 233 Updated Jul 29, 2024

A FPGA-based neural network inference accelerator, which won the third place in DAC-SDC

C 28 3 Updated May 11, 2022

Designs for finalist teams of the DAC System Design Contest

Objective-C 35 19 Updated Jul 8, 2020

PyTorch implementation of DiracDeltaNet from paper Synetgy: Algorithm-hardware Co-design for ConvNet Accelerators on Embedded FPGAs

Python 31 10 Updated May 30, 2019
Next