-
SelfCorrectionLanguageModelTraining Public
Forked from MeNicefellow/SelfCorrectionLanguageModelTrainingPython UpdatedSep 30, 2024 -
uvadlc_notebooks Public
Forked from phlippe/uvadlc_notebooksRepository of Jupyter notebook tutorials for teaching the Deep Learning Course at the University of Amsterdam (MSc AI), Fall 2022/Spring 2022
Jupyter Notebook MIT License UpdatedSep 29, 2024 -
Grounding_LLMs_with_online_RL Public
Forked from flowersteam/Grounding_LLMs_with_online_RLWe perform functional grounding of LLMs' knowledge in BabyAI-Text
Python MIT License UpdatedSep 28, 2024 -
-
dart-math Public
Forked from hkust-nlp/dart-math[NeurIPS'24] Official code for *🎯DART-Math: Difficulty-Aware Rejection Tuning for Mathematical Problem-Solving*
Jupyter Notebook MIT License UpdatedSep 26, 2024 -
-
-
-
Super_MARIO Public
Forked from MARIO-Math-Reasoning/Super_MARIOPython MIT License UpdatedSep 14, 2024 -
-
DeepCubeAI Public
Forked from misaghsoltani/DeepCubeAILearning Discrete World Models for Heuristic Search
Python MIT License UpdatedSep 7, 2024 -
loss-of-plasticity Public
Forked from shibhansh/loss-of-plasticityDemonstrations of Loss of Plasticity and Implementation of Continual Backpropagation
Python MIT License UpdatedAug 21, 2024 -
quiet-star Public
Forked from ezelikman/quiet-starCode for Quiet-STaR
Python Apache License 2.0 UpdatedAug 21, 2024 -
-
search-agents Public
Forked from kohjingyu/search-agentsCode for the paper 🌳 Tree Search for Language Model Agents
Python MIT License UpdatedJul 25, 2024 -
cultural-accumulation Public
Forked from FLAIROx/cultural-accumulationJupyter Notebook UpdatedJul 16, 2024 -
-
grokfast Public
Forked from ironjr/grokfastOfficial repository for the paper "Grokfast: Accelerated Grokking by Amplifying Slow Gradients"
Python MIT License UpdatedJun 24, 2024 -
GrokkedTransformer Public
Forked from OSU-NLP-Group/GrokkedTransformerCode for the paper 'Grokked Transformers are Implicit Reasoners: A Mechanistic Journey to the Edge of Generalization'
Python MIT License UpdatedJun 22, 2024 -
LightZero Public
Forked from opendilab/LightZero[NeurIPS 2023 Spotlight] LightZero: A Unified Benchmark for Monte Carlo Tree Search in General Sequential Decision Scenarios
Python Apache License 2.0 UpdatedJun 20, 2024 -
torax Public
Forked from google-deepmind/toraxTORAX: Tokamak transport simulation in JAX
Python Other UpdatedJun 12, 2024 -
llm.c Public
Forked from karpathy/llm.cLLM training in simple, raw C/CUDA
Cuda MIT License UpdatedJun 10, 2024 -
LAPO Public
Forked from schmidtdominik/LAPOCode for the ICLR 2024 spotlight paper: "Learning to Act without Actions" (introducing Latent Action Policies)
Python UpdatedApr 14, 2024 -
TheArtofHPC_pdfs Public
Forked from VictorEijkhout/TheArtofHPC_pdfsAll pdfs of Victor Eijkhout's Art of HPC books and courses
UpdatedApr 12, 2024 -
-
backtrader_moexalgo Public
Forked from WISEPLAT/backtrader_moexalgoMOEX API AlgoPack integration with Backtrader. На данных с биржи MOEX теперь можно создавать полноценные торговые стратегии. Проводить Backtesting и делать Live торговлю через брокеров Алор, Финам …
Python MIT License UpdatedJan 31, 2024 -
alphageometry Public
Forked from google-deepmind/alphageometryPython Apache License 2.0 UpdatedJan 17, 2024 -
EfficientZero Public
Forked from YeWR/EfficientZeroOpen-source codebase for EfficientZero, from "Mastering Atari Games with Limited Data" at NeurIPS 2021.
Python GNU General Public License v3.0 UpdatedDec 20, 2023 -
DIFUSCO Public
Forked from Edward-Sun/DIFUSCOCode of NeurIPS paper: arxiv.org/abs/2302.08224
Python MIT License UpdatedDec 7, 2023 -
metasploit-framework Public
Forked from rapid7/metasploit-frameworkMetasploit Framework
Ruby Other UpdatedDec 1, 2023