-
-
3090_shorts Public
minimal LLM scripts for 24GB VRAM GPUs. training, inference, whatever
-
ai-toolkit Public
Forked from ostris/ai-toolkitVarious AI scripts. Mostly Stable Diffusion stuff.
Python MIT License UpdatedAug 23, 2024 -
llm-datasets Public
Forked from mlabonne/llm-datasetsHigh-quality datasets, tools, and concepts for LLM fine-tuning.
UpdatedMay 10, 2024 -
fsdp_qlora Public
Forked from AnswerDotAI/fsdp_qloraTraining LLMs with QLoRA + FSDP
Jupyter Notebook Apache License 2.0 UpdatedApr 25, 2024 -
-
trl Public
Forked from huggingface/trlTrain transformer language models with reinforcement learning.
Python Apache License 2.0 UpdatedMar 29, 2024 -
transformers Public
Forked from huggingface/transformers🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
Python Apache License 2.0 UpdatedMar 26, 2024 -
EQ-Bench Public
Forked from EQ-bench/EQ-BenchA benchmark for emotional intelligence in large language models
Python MIT License UpdatedMar 15, 2024 -
-
-
Latte Public
Forked from Vchitect/LatteThe official implementation of Latte: Latent Diffusion Transformer for Video Generation.
Python MIT License UpdatedFeb 20, 2024 -
accelerate Public
Forked from huggingface/accelerate🚀 A simple way to train and use PyTorch models with multi-GPU, TPU, mixed-precision
Python Apache License 2.0 UpdatedFeb 13, 2024 -
-
-
LLMTest_NeedleInAHaystack Public
Forked from gkamradt/LLMTest_NeedleInAHaystackDoing simple retrieval from LLM models at various context lengths to measure accuracy
Jupyter Notebook Other UpdatedJan 21, 2024 -
deep_4_all Public
Forked from blancsw/deep_4_allCourses a codes that I use to teach deeplearing
Python UpdatedJan 3, 2024 -
-
train-mamba-with-fsdp Public
Forked from abacaj/fine-tune-mistral -
-
-
-
-
-
-
-
-
-
FastChat Public
Forked from lm-sys/FastChatAn open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.
Python Apache License 2.0 UpdatedAug 24, 2023