[go: up one dir, main page]

Skip to content
View simonguozirui's full-sized avatar

Organizations

@worldaffairsconference

Block or report simonguozirui

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Memory Mosaics are networks of associative memories working in concert to achieve a prediction task.

Python 33 3 Updated Sep 23, 2024

Archon provides a modular framework for combining different inference-time techniques and LMs with just a JSON config file.

Python 115 8 Updated Oct 22, 2024

Doing simple retrieval from LLM models at various context lengths to measure accuracy

Jupyter Notebook 1,546 158 Updated Aug 17, 2024

Simple, flexible configuration in pure Python!

Python 3 2 Updated Oct 26, 2024

Implement a ChatGPT-like LLM in PyTorch from scratch, step by step

Jupyter Notebook 30,826 3,665 Updated Nov 3, 2024

Hydragen: High-Throughput LLM Inference with Shared Prefixes

Python 22 2 Updated May 10, 2024
Python 17 5 Updated Sep 30, 2024
Python 1,203 174 Updated Oct 17, 2024
Python 50 2 Updated Oct 30, 2024

🚀 Awesome System for Machine Learning ⚡️ AI System Papers and Industry Practice. ⚡️ System for Machine Learning, LLM (Large Language Model), GenAI (Generative AI). 🍻 OSDI, NSDI, SIGCOMM, SoCC, MLSy…

2,687 307 Updated Aug 14, 2024

LLM training in simple, raw C/CUDA

Cuda 24,298 2,738 Updated Oct 2, 2024

A framework for deploying on-demand distributed-trust.

C++ 13 2 Updated Jun 4, 2024

Flash Attention in raw Cuda C beating PyTorch

Cuda 13 1 Updated May 14, 2024

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 29,622 4,468 Updated Nov 5, 2024

Making data lake work for time series

Python 1,136 60 Updated Aug 21, 2024

A natural language interface for computers

Python 54,899 4,787 Updated Nov 5, 2024

Building blocks for foundation models.

384 15 Updated Jan 3, 2024

A list of ICs and IPs for AI, Machine Learning and Deep Learning.

PHP 1,636 275 Updated Jun 5, 2024

Fast and memory-efficient exact attention

Python 14,049 1,309 Updated Nov 5, 2024

Toy Gaussian Splatting visualization in Unity

C# 2,175 245 Updated Oct 10, 2024

A demo pipeline of using Redis as an online feature store with Feast for orchestration and Ray for training and model serving

Python 10 Updated Oct 31, 2022

Composable transformations of Python+NumPy programs: differentiate, vectorize, JIT to GPU/TPU, and more

Python 30,401 2,789 Updated Nov 5, 2024

JAX - A curated list of resources https://github.com/google/jax

1,534 132 Updated Jul 10, 2024

A high-performance C++ library for randomized numerical linear algebra

C++ 60 6 Updated Oct 31, 2024

SLAM performance evaluation framework

C++ 314 84 Updated Apr 2, 2024

ONNX Runtime: cross-platform, high performance ML inferencing and training accelerator

C++ 14,612 2,922 Updated Nov 5, 2024

Development repository for the Triton language and compiler

C++ 13,298 1,628 Updated Nov 5, 2024

macOS system monitor in your menu bar

Swift 25,582 846 Updated Oct 28, 2024

Ray is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.

Python 33,752 5,740 Updated Nov 5, 2024
Next