elderedition

elderedition

Stars

huggingface / transformers

🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.

Python 146,534 29,543 Updated Jul 5, 2025

atharvabagde / GraphReader

Implementation of GraphReader paper: https://arxiv.org/abs/2406.14550

Python 11 2 Updated Oct 21, 2024

Workday / cpc

Python 11 Updated Jan 16, 2025

THUDM / LongBench

LongBench v2 and LongBench (ACL 25'&24')

Python 920 92 Updated Jan 15, 2025

jsvine / pdfplumber

Plumb a PDF for detailed information about each char, rectangle, line, et cetera — and easily extract text and tables.

Python 7,955 754 Updated Jun 22, 2025

open-compass / opencompass

OpenCompass is an LLM evaluation platform, supporting a wide range of models (Llama3, Mistral, InternLM2,GPT-4,LLaMa2, Qwen,GLM, Claude, etc) over 100+ datasets.

Python 5,622 613 Updated Jul 3, 2025

Alab-NII / 2wikimultihop

Python 109 1 Updated Aug 21, 2023

google-deepmind / narrativeqa

This repository contains the NarrativeQA dataset. It includes the list of documents with Wikipedia summaries, links to full stories, and questions and answers.

Shell 478 67 Updated Apr 15, 2020

StonyBrookNLP / musique

Repository for MuSiQue: Multi-hop Questions via Single-hop Question Composition, TACL 2022

Python 147 15 Updated Jun 12, 2024

explodinggradients / ragas

Supercharge Your LLM Application Evaluations 🚀

Python 9,811 969 Updated Jul 3, 2025

THUDM / LongWriter

[ICLR 2025] LongWriter: Unleashing 10,000+ Word Generation from Long Context LLMs

Python 1,693 168 Updated Jun 24, 2025

yuwvandy / KG-LLM-MDQA

Python 308 31 Updated Feb 23, 2025

wgh136 / PicaComic

A comic app built with Flutter, supporting multiple comic sources.

Dart 8,280 991 Updated Dec 21, 2024

stephenleo / llm-structured-output-benchmarks

Benchmark various LLM Structured Output frameworks: Instructor, Mirascope, Langchain, LlamaIndex, Fructose, Marvin, Outlines, etc on tasks like multi-label classification, named entity recognition,…

Python 173 7 Updated Sep 23, 2024

zjunlp / DeepKE

[EMNLP 2022] An Open Toolkit for Knowledge Graph Extraction and Construction

Python 4,011 724 Updated Jun 21, 2025

microsoft / graphrag

A modular graph-based Retrieval-Augmented Generation (RAG) system

Python 26,305 2,723 Updated Jun 23, 2025

zjukg / KG-LLM-Papers

[Paper List] Papers integrating knowledge graphs (KGs) and large language models (LLMs)

1,957 141 Updated Mar 25, 2025

AI4WA / Docs2KG

Docs2KG: A Human-LLM Collaborative Approach to Unified Knowledge Graph Construction from Heterogeneous Documents

Python 309 39 Updated May 21, 2025

chatanywhere / GPT_API_free

Free ChatGPT&DeepSeek API Key，免费ChatGPT&DeepSeek API。免费接入DeepSeek API和GPT4 API，支持 gpt | deepseek | claude | gemini | grok 等排名靠前的常用大模型。

Python 30,763 2,220 Updated Jun 28, 2025

princeton-nlp / AutoCompressors

[EMNLP 2023] Adapting Language Models to Compress Long Contexts

Python 307 26 Updated Sep 9, 2024

infinigence / LVEval

Repository of LV-Eval Benchmark

Python 67 8 Updated Aug 31, 2024

FlagOpen / FlagEmbedding

Retrieval and Retrieval-augmented LLMs

Python 10,075 743 Updated Jun 4, 2025

netease-youdao / QAnything

Question and Answer based on Anything.

Python 13,344 1,292 Updated Mar 24, 2025

3DAgentWorld / Toolkit-for-Prompt-Compression

Toolkit for Prompt Compression

Python 268 9 Updated Feb 11, 2025

microsoft / LLMLingua

[EMNLP'23, ACL'24] To speed up LLMs' inference and enhance LLM's perceive of key information, compress the prompt and KV-Cache, which achieves up to 20x compression with minimal performance loss.

Python 5,235 308 Updated Mar 11, 2025

AlibabaResearch / AdvancedLiterateMachinery

A collection of original, innovative ideas and algorithms towards Advanced Literate Machinery. This project is maintained by the OCR Team in the Language Technology Lab, Tongyi Lab, Alibaba Group.

C++ 1,745 197 Updated Apr 9, 2025

PaddlePaddle / PaddleOCR

Awesome multilingual OCR and Document Parsing toolkits based on PaddlePaddle (practical ultra lightweight OCR system, support 80+ languages recognition, provide data annotation and synthesis tools,…

Python 51,271 8,394 Updated Jul 5, 2025

myscale / MyScaleDB

A @ClickHouse fork that supports high-performance vector search and full-text search.

C++ 975 61 Updated Feb 5, 2025

dataelement / bisheng

BISHENG is an open LLM devops platform for next generation Enterprise AI applications. Powerful and comprehensive features include: GenAI workflow, RAG, Agent, Unified model management, Evaluation,…

TypeScript 9,030 1,477 Updated Jul 4, 2025

MaoXiaoYuZ / Long-Novel-GPT

该项目包括一个基于 GPT 等大语言模型的长篇小说生成器，同时还有各类小说生成 Prompt 以及教程。我们欢迎社区贡献，持续更新以提供最佳的小说创作体验。

Python 732 139 Updated Apr 29, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly