[go: up one dir, main page]

Skip to content
View enp1s0's full-sized avatar
🤯
Computing
🤯
Computing

Organizations

@FDPS @rioyokotalab @mori-lab @rapidsai @wmmae @hpc-wakate

Block or report enp1s0

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

collection of articles about PhD life written in 🇯🇵

190 4 Updated Jun 15, 2024

The book "Performance Analysis and Tuning on Modern CPU"

TeX 2,144 157 Updated Nov 5, 2024

LLM training in simple, raw C/CUDA

Cuda 24,319 2,742 Updated Oct 2, 2024

The official Vim repository

Vim Script 36,580 5,458 Updated Nov 4, 2024

A ksvd implementation written in python.

Python 104 22 Updated Dec 26, 2022

Itoyori: A distributed multi-threading runtime system for global-view fork-join task parallelism

C++ 20 2 Updated Feb 9, 2024

display manager with console UI

Zig 5,450 305 Updated Oct 12, 2024

int8_t and int16_t matrix multiply based on https://arxiv.org/abs/1705.01991

C++ 63 21 Updated Dec 30, 2023

The fastest and most memory efficient lattice Boltzmann CFD software, running on all GPUs via OpenCL. Free for non-commercial use.

C++ 3,896 310 Updated Oct 31, 2024

Synchronize your working directory efficiently to a remote place without committing the changes.

Go 73 11 Updated Nov 7, 2022

GPTPU for SC 2021

C++ 48 9 Updated Mar 22, 2023

Repository for nvCOMP docs and examples. nvCOMP is a library for fast lossless compression/decompression on the GPU that can be downloaded from https://developer.nvidia.com/nvcomp.

C++ 560 78 Updated Sep 11, 2024

stdgpu: Efficient STL-like Data Structures on the GPU

C++ 1,155 83 Updated Oct 22, 2024

Linux Kernel for Surface Devices

Shell 5,129 216 Updated Oct 20, 2024

A single-header C++ library for simplifying the use of CUDA Runtime Compilation (NVRTC).

C++ 517 64 Updated May 22, 2024

Dear ImGui: Bloat-free Graphical User interface for C++ with minimal dependencies

C++ 60,966 10,292 Updated Nov 6, 2024

Templight is a Clang-based tool to profile the time and memory consumption of template instantiations and to perform interactive debugging sessions to gain introspection into the template instantia…

C++ 730 41 Updated Oct 18, 2024

GPGPU-Sim provides a detailed simulation model of contemporary NVIDIA GPUs running CUDA and/or OpenCL workloads. It includes support for features such as TensorCores and CUDA Dynamic Parallelism as…

C++ 1,121 508 Updated Aug 21, 2024

Test suite for probing the numerical behavior of NVIDIA tensor cores

Cuda 30 12 Updated Jul 24, 2024

Important concepts in numerical linear algebra and related areas

726 61 Updated Jan 13, 2024

A massively-parallel, block-sparse tensor framework written in C++

C++ 255 52 Updated Nov 6, 2024

Parallel Library for Tensor Network Methods

C++ 28 8 Updated Aug 22, 2023

⚡ Dark powered Vim/Neovim plugin manager

Vim Script 3,427 198 Updated May 13, 2024

gpuprec: Extended-Precision Libraries on GPUs

Cuda 34 7 Updated Jan 9, 2016

Crow is very fast and easy to use C++ micro web framework (inspired by Python Flask)

C++ 7,481 891 Updated Jun 6, 2024

A compact split ortholinear keyboard.

Python 864 182 Updated Nov 20, 2022

Binary Neural Network Framework for FPGA(Differentiable LUT)

C++ 138 21 Updated Aug 14, 2024

rust-cuda working group

65 7 Updated Jun 12, 2019

Embedded language for high-performance array computations

Haskell 902 118 Updated Oct 30, 2024
Next