8000 MIracleyin's list / LLM-eval · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
View MIracleyin's full-sized avatar
  • Tencent
  • China
  • 11:42 (UTC +08:00)

Block or report MIracleyin

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Stars

LLM-eval

eval tools for LLM
6 repositories

Code for the paper "Evaluating Large Language Models Trai 5C32 ned on Code"

Python 2,761 388 Updated Jan 17, 2025

CMMLU: Measuring massive multitask language understanding in Chinese

Python 763 62 Updated Dec 6, 2024

Official github repo for C-Eval, a Chinese evaluation suite for foundation models [NeurIPS 2023]

Python 1,733 81 Updated Oct 26, 2023

A Framework for the Systematic Evaluation of Chat-Optimized Language Models as Conversational Agents and an Extensible Benchmark

Python 29 42 Updated May 22, 2025

A framework for few-shot evaluation of language models.

Python 9,039 2,417 Updated May 25, 2025

Do Multilingual Language Models Think Better in English?

Jupyter Notebook 41 5 Updated Aug 3, 2023
0