8000 THUCSTHanxu13 (SillyXu) / Starred · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
View THUCSTHanxu13's full-sized avatar
  • Tsinghua University
  • Beijing, China

Organizations

@thunlp

Block or report THUCSTHanxu13

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

[CVPR 2025 HIghlight] XLRS-Bench: ould Your Multimodal LLMs Understand Extremely Large Ultra-High-Resolution Remote Sensing Imagery?

39 Updated May 30, 2025

(ACL 2025 main) FR-Spec: Frequency-Ranked Speculative Sampling

C++ 26 1 Updated May 30, 2025

Build & Optimize your RAG.

Python 670 50 Updated May 13, 2025

Scalable RL solution for advanced reasoning of language models

Python 1,587 91 Updated Mar 18, 2025
Python 13 Updated Oct 3, 2024

An open-source framework for collaborative AI agents, enabling diverse, distributed agents to team up and tackle complex tasks through internet-like connectivity.

Python 720 74 Updated Oct 20, 2024

The code of our paper "InfLLM: Unveiling the Intrinsic Capacity of LLMs for Understanding Extremely Long Sequences with Training-Free Memory"

Python 358 38 Updated Apr 20, 2024

MiniCPM3-4B: An edge-side LLM that surpasses GPT-3.5-Turbo.

Jupyter Notebook 7,370 464 Updated Nov 6, 2024

[ACL 2024]Official GitHub repo for OlympiadBench: A Challenging Benchmark for Promoting AGI with Olympiad-Level Bilingual Multimodal Scientific Problems.

Python 151 9 Updated Jul 17, 2024

Ouroboros: Speculative Decoding with Large Model Enhanced Drafting (EMNLP 2024 main)

Python 106 9 Updated Mar 20, 2025

MiniCPM-o 2.6: A GPT-4o Level MLLM for Vision, Speech and Multimodal Live Streaming on Your Phone

Python 19,507 1,408 Updated May 27, 2025

Codes for the paper "∞Bench: Extending Long Context Evaluation Beyond 100K Tokens": https://arxiv.org/abs/2402.13718

Python 326 28 Updated Sep 25, 2024

百亿参数的中英文双语基座大模型

Python 2,434 191 Updated Jul 28, 2023

The most comprehensive database of Chinese poetry 🧶最全中华古诗词数据库, 唐宋两朝近一万四千古诗人, 接近5.5万首唐诗加26万宋诗. 两宋时期1564位词人,21050首词。

JavaScript 49,485 9,986 Updated Apr 2, 2025

Full description can be found here: https://discuss.huggingface.co/t/pretrain-gpt-neo-for-open-source-github-copilot-model/7678?u=ncoop57

Python 3,291 219 Updated Jan 18, 2022

Prompt Tuning with Rules

Python 159 29 Updated Sep 4, 2022

Live Training for Open-source Big Models

Python 506 40 Updated May 30, 2023

A List of Big Models

Python 343 14 Updated Jun 30, 2023

Model Compression for Big Models

Python 162 23 Updated Jun 30, 2023

Efficient Training (including pre-training and fine-tuning) for Big Models

Python 592 80 Updated May 29, 2025

Efficient, Low-Resource, Distributed transformer implementation based on BMTrain

Python 256 29 Updated Nov 27, 2023

Efficient Inference for Big Models

Python 584 66 Updated Jan 24, 2023

Few-Shot-Intent-Detection includes popular challenging intent detection datasets with/without OOS queries and state-of-the-art baselines and results.

Python 142 27 Updated Jul 19, 2023

The dataset for the 2019 Cuneiform Language Identification (CLI) shared task

3 Updated Jul 8, 2019

PyTorch implementation of various methods for continual learning (XdG, EWC, SI, LwF, FROMP, DGR, BI-R, ER, A-GEM, iCaRL, Generative Classifier) in three different scenarios.

Jupyter Notebook 1,710 328 Updated May 17, 2025

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

Python 38,650 4,401 Updated May 31, 2025

Tools for state of the art Knowledge Base Completion.

Python 254 39 Updated Aug 16, 2021

S2ORC: The Semantic Scholar Open Research Corpus: https://www.aclweb.org/anthology/2020.acl-main.447/

Python 934 70 Updated Apr 26, 2024
Python 3 Updated Apr 10, 2020

Code associated with the Don't Stop Pretraining ACL 2020 paper

Python 531 73 Updated Nov 15, 2021
Next
0