-
The University of Chicago
- https://hazelye-bot.github.io/
- in/ziyuye
Lists (1)
Sort Name ascending (A-Z)
Starred repositories
Path-SGD: Path-Normalized Optimization in Deep Neural Networks
Computing various measures and generalization bounds on convolutional and fully connected networks
PyTorch native quantization and sparsity for training and inference
A compact LLM pretrained in 9 days by using high quality data
Demonstrations of Loss of Plasticity and Implementation of Continual Backpropagation
Machine Learning Engineering Open Book
This is work done by the Oxen.ai Community, trying to reproduce the Self-Rewarding Language Model paper from MetaAI.
The AI Scientist: Towards Fully Automated Open-Ended Scientific Discovery 🧑🔬
A community-maintained Python framework for creating mathematical animations.
The code and data for "MMLU-Pro: A More Robust and Challenging Multi-Task Language Understanding Benchmark" [NeurIPS 2024]
LiveBench: A Challenging, Contamination-Free LLM Benchmark
Spectral Representation for Causal Estimation with Hidden Confounders
Learning from synthetic data - code and models
Sakura-SOLAR-DPO: Merge, SFT, and DPO
yangky11 / LLaMA-Factory
Forked from hiyouga/LLaMA-FactoryA WebUI for Efficient Fine-Tuning of 100+ LLMs (ACL 2024)
Official repository for "Alignment Data Synthesis from Scratch by Prompting Aligned LLMs with Nothing". Your efficient and high-quality synthetic data generation pipeline!
This code implements Prioritized Level Replay, a method for sampling training levels for reinforcement learning agents that exploits the fact that not all levels are equally useful for agents to le…
This is a collection of research papers for Self-Correcting Large Language Models with Automated Feedback.
Framework for building data agent workflows
Distilabel is a framework for synthetic data and AI feedback for engineers who need fast, reliable and scalable pipelines based on verified research papers.
Implementation for "Step-DPO: Step-wise Preference Optimization for Long-chain Reasoning of LLMs"
DSPy: The framework for programming—not prompting—foundation models