pku0xff

Fan Xu pku0xff

Hello, world!

17 followers · 14 following

Peking University
Beijing

Achievements

Highlights

Lists (1)

Sort

papers

Stars

facebookresearch / DPR

Dense Passage Retriever - is a set of tools and models for open domain Q&A task.

Python 1,812 313 Updated Apr 6, 2023

StonyBrookNLP / ircot

Repository for Interleaving Retrieval with Chain-of-Thought Reasoning for Knowledge-Intensive Multi-Step Questions, ACL23

Jsonnet 216 29 Updated Jun 12, 2024

chatchat-space / Langchain-Chatchat

Langchain-Chatchat（原Langchain-ChatGLM）基于 Langchain 与 ChatGLM, Qwen 与 Llama 等语言模型的 RAG 与 Agent 应用 | Langchain-Chatchat (formerly langchain-ChatGLM), local knowledge based LLM (like ChatGLM, Qwen and…

TypeScript 35,406 5,928 Updated Mar 25, 2025

saccharomycetes / mllms_know

[ICLR'25] Official code for the paper 'MLLMs Know Where to Look: Training-free Perception of Small Visual Details with Multimodal LLMs'

Python 221 10 Updated Apr 20, 2025

BYVoid / OpenCC

Conversion between Traditional and Simplified Chinese

C++ 9,021 1,012 Updated May 2, 2025

google-deepmind / xquad

195 39 Updated Nov 12, 2021

ashikiut / DefAn

3 2 Updated Apr 27, 2025

NVIDIA / Megatron-LM

Ongoing research training transformer models at scale

Python 12,671 2,870 Updated Jun 26, 2025

deepseek-ai / FlashMLA

FlashMLA: Efficient MLA decoding kernels

Cuda 11,627 871 Updated Apr 29, 2025

Open-Reasoner-Zero / Open-Reasoner-Zero

Official Repo for Open-Reasoner-Zero

Python 1,975 105 Updated Jun 2, 2025

wdndev / llm_interview_note

主要记录大语言大模型（LLMs）算法（应用）工程师相关的知识及面试题

HTML 8,255 897 Updated Apr 30, 2025

hkust-nlp / felm

Github repository for "FELM: Benchmarking Factuality Evaluation of Large Language Models" (NeurIPS 2023)

Python 59 1 Updated Dec 25, 2023

ParticleMedia / RAGTruth

Github repository for "RAGTruth: A Hallucination Corpus for Developing Trustworthy Retrieval-Augmented Language Models"

Python 186 23 Updated Dec 2, 2024

vectara / FaithBench

Python 9 Updated May 12, 2025

baixianghuang / HalluEditBench

Can Knowledge Editing Really Correct Hallucinations? (ICLR 2025)

Python 18 3 Updated May 20, 2025

facebookresearch / KILT

Library for Knowledge Intensive Language Tasks

Python 946 91 Updated Mar 31, 2022

vllm-project / vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 50,842 8,349 Updated Jun 27, 2025

thunlp / UltraChat

Large-scale, Informative, and Diverse Multi-round Chat Data (and Models)

Python 2,603 131 Updated Mar 13, 2024

princeton-nlp / EntityQuestions

EMNLP'2021: Simple Entity-centric Questions Challenge Dense Retrievers https://arxiv.org/abs/2109.08535

Python 146 11 Updated Feb 21, 2022

LuckyyySTA / Awesome-LLM-hallucination

LLM hallucination paper list

318 23 Updated Mar 11, 2024

stanford-oval / storm

An LLM-powered knowledge curation system that researches a topic and generates a full-length report with citations.

Python 24,647 2,209 Updated Jun 27, 2025

yuxiaw / Factcheck-GPT

Fact-Checking the Output of Generative Large Language Models in both Annotation and Evaluation.

Python 100 11 Updated Jan 6, 2024

modelscope / data-juicer

Data processing for and with foundation models! 🍎 🍋 🌽 ➡️ ➡️🍸 🍹 🍷

Python 4,668 245 Updated Jun 26, 2025

WangRongsheng / awesome-LLM-resources

🧑‍🚀 全世界最好的LLM资料总结（视频生成、Agent、辅助编程、数据处理、模型训练、模型推理、o1 模型、MCP、小语言模型、视觉语言模型） | Summary of the world's best LLM resources.

5,528 538 Updated Jun 26, 2025

DigiRL-agent / digirl

Official repo for paper DigiRL: Training In-The-Wild Device-Control Agents with Autonomous Reinforcement Learning.

Python 362 34 Updated Feb 22, 2025

efsotr / flash-attention-w-tree-attn

Forked from Dao-AILab/flash-attention

Fast and memory-efficient exact attention

Python 4 Updated Jun 2, 2025