[go: up one dir, main page]

Skip to content
View ZIYU-DEEP's full-sized avatar

Sponsoring

@lilianweng
@teknium1
@camel-ai

Organizations

@googlers

Block or report ZIYU-DEEP

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Beta Lists are currently in beta. Share feedback and report bugs.

Starred repositories

Showing results

Learning Formal Mathematics from Intrinsic Motivation

Rust 7 4 Updated Nov 1, 2024
Python 1 Updated Oct 25, 2024

Path-SGD: Path-Normalized Optimization in Deep Neural Networks

Python 19 4 Updated Nov 26, 2018

Computing various measures and generalization bounds on convolutional and fully connected networks

Python 35 9 Updated Dec 13, 2018

PyTorch native quantization and sparsity for training and inference

Python 1,539 160 Updated Nov 6, 2024
Python 9 Updated Aug 6, 2024

A compact LLM pretrained in 9 days by using high quality data

Python 258 19 Updated Sep 22, 2024

Demonstrations of Loss of Plasticity and Implementation of Continual Backpropagation

Python 152 32 Updated Nov 3, 2024

The Art of Debugging

C 810 34 Updated Aug 3, 2024

Machine Learning Engineering Open Book

Python 11,577 703 Updated Nov 1, 2024

This is work done by the Oxen.ai Community, trying to reproduce the Self-Rewarding Language Model paper from MetaAI.

Python 109 9 Updated Apr 25, 2024

The AI Scientist: Towards Fully Automated Open-Ended Scientific Discovery 🧑‍🔬

Jupyter Notebook 8,052 1,119 Updated Oct 25, 2024

A community-maintained Python framework for creating mathematical animations.

Python 25,799 1,773 Updated Nov 4, 2024

Animation engine for explanatory math videos

Python 70,145 6,169 Updated Oct 27, 2024

Code for the paper "Evolved Policy Gradients"

Python 249 55 Updated Nov 22, 2018

The code and data for "MMLU-Pro: A More Robust and Challenging Multi-Task Language Understanding Benchmark" [NeurIPS 2024]

Python 121 21 Updated Oct 24, 2024

LiveBench: A Challenging, Contamination-Free LLM Benchmark

Python 280 24 Updated Nov 4, 2024

Spectral Representation for Causal Estimation with Hidden Confounders

Python 3 Updated May 27, 2024

Code for Quiet-STaR

Python 632 88 Updated Aug 21, 2024

Learning from synthetic data - code and models

Python 301 13 Updated Jan 6, 2024

Sakura-SOLAR-DPO: Merge, SFT, and DPO

Python 115 7 Updated Dec 30, 2023

A WebUI for Efficient Fine-Tuning of 100+ LLMs (ACL 2024)

Python 2 Updated Oct 12, 2024

Official repository for "Alignment Data Synthesis from Scratch by Prompting Aligned LLMs with Nothing". Your efficient and high-quality synthetic data generation pipeline!

Python 475 53 Updated Nov 5, 2024

This code implements Prioritized Level Replay, a method for sampling training levels for reinforcement learning agents that exploits the fact that not all levels are equally useful for agents to le…

Python 83 16 Updated Jun 11, 2021

This is a collection of research papers for Self-Correcting Large Language Models with Automated Feedback.

431 25 Updated Oct 28, 2024

Framework for building data agent workflows

Python 78 10 Updated Aug 20, 2024

Distilabel is a framework for synthetic data and AI feedback for engineers who need fast, reliable and scalable pipelines based on verified research papers.

Python 1,607 127 Updated Nov 5, 2024

Implementation for "Step-DPO: Step-wise Preference Optimization for Long-chain Reasoning of LLMs"

Python 275 8 Updated Jul 15, 2024

PyTorch native finetuning library

Python 4,257 419 Updated Nov 5, 2024

DSPy: The framework for programming—not prompting—foundation models

Python 18,503 1,425 Updated Nov 6, 2024
Next