-
SWE-bench Public
Forked from SWE-bench/SWE-benchSWE-bench [Multimodal]: Can Language Models Resolve Real-world Github Issues?
Python MIT License UpdatedJun 4, 2025 -
reward-bench Public
Forked from allenai/reward-benchRewardBench: the first evaluation tool for reward models.
Python Apache License 2.0 UpdatedApr 17, 2025 -
bigcodebench Public
Forked from bigcode-project/bigcodebenchBigCodeBench: Benchmarking Code Generation Towards AGI
Python Apache License 2.0 UpdatedMar 31, 2025 -
evalplus Public
Forked from evalplus/evalplusRigourous evaluation of LLM-synthesized code - NeurIPS 2023
Python Apache License 2.0 UpdatedMar 31, 2025 -
LLaMA-Factory Public
Forked from hiyouga/LLaMA-FactoryA WebUI for Efficient Fine-Tuning of 100+ LLMs (ACL 2024)
-
RM-Bench Public
Forked from THU-KEG/RM-Bench[ICLR 25 Oral] RM-Bench: Benchmarking Reward Models of Language Models with Subtlety and Style
Python UpdatedMar 27, 2025 -
-
LiveCodeBench-AceCoderV2 Public
Forked from LiveCodeBench/LiveCodeBenchOfficial repository for the paper "LiveCodeBench: Holistic and Contamination Free Evaluation of Large Language Models for Code"
Python MIT License UpdatedMar 27, 2025 -
LiveCodeBench-AceCoder Public
Official repository for the paper "LiveCodeBench: Holistic and Contamination Free Evaluation of Large Language Models for Code"
Python MIT License UpdatedMar 25, 2025 -
OpenRLHF Public
Forked from OpenRLHF/OpenRLHFAn Easy-to-use, Scalable and High-performance RLHF Framework (70B+ PPO Full Tuning & Iterative DPO & LoRA & RingAttention & RFT)
Python Apache License 2.0 UpdatedJan 14, 2025 -
apps Public
Forked from hendrycks/appsAPPS: Automated Programming Progress Standard (NeurIPS 2021)
Python MIT License UpdatedNov 18, 2024 -
Qwen2.5-Coder Public
Forked from QwenLM/Qwen2.5-CoderQwen2.5-Coder is the code version of Qwen2.5, the large language model series developed by Qwen team, Alibaba Cloud.
Python UpdatedNov 11, 2024 -
-
anserini Public
Forked from castorini/anseriniAnserini is a Lucene toolkit for reproducible information retrieval research
Java Apache License 2.0 UpdatedOct 14, 2023 -
-
EncryptionProgram Public
This is the encryption program that I wrote in 10th grade using Java.
Java UpdatedAug 24, 2023 -
-
SP500Prediction Public
This is a MLP model developed by Prof Zhong and Prof Enke. It predicts the daily movement of S&P 500 using 50+ data points. I have made some changes to the model itself given that some data points …
-
OpenSourceFinanceData Public
I will put in my API code for getting open source financial data (ex: daily price, yield, spread, etc). Please note that I do not guarantee the accuracy of the data acquired and you should not make…
Python UpdatedAug 24, 2023 -
AWOL Public
AWOL is a incomplete indie game that was originally designed to be submitted to the Global Game Jam Competition.
UpdatedSep 25, 2022 -
-
CarND-LaneLines-P1 Public
Forked from udacity/CarND-LaneLines-P1Lane Finding Project for Self-Driving Car ND
Jupyter Notebook MIT License UpdatedJul 6, 2022