8000 ling-chun (ling-chun) / Starred · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
View ling-chun's full-sized avatar

Block or report ling-chun

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Official Implementation of EAGLE-1 (ICML'24), EAGLE-2 (EMNLP'24), and EAGLE-3.

Python 1,246 140 Updated May 18, 2025

Medusa: Simple Framework for Accelerating LLM Generation with Multiple Decoding Heads

Jupyter Notebook 2,525 175 Updated Jun 25, 2024

目前已囊括232个大模型,覆盖chatgpt、gpt-4o、o3-mini、谷歌gemini、Claude3.5、智谱GLM-Zero、文心一言、qwen-max、百川、讯飞星火、商汤senseChat、minimax等商用模型, 以及DeepSeek-R1、qwq-32b、deepseek-v3、qwen2.5、llama3.3、phi-4、glm4、gemma3、mistral、书生in…

4,221 175 Updated May 17, 2025

TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and support state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorR…

C++ 10,507 1,431 Updated May 17, 2025

[ICML 2024 Spotlight] Differentially Private Synthetic Data via Foundation Model APIs 2: Text

Python 40 9 Updated Jan 11, 2025

A self-learning tutorail for CUDA High Performance Programing.

JavaScript 632 70 Updated Apr 12, 2025

Finetune Qwen3, Llama 4, TTS, DeepSeek-R1 & Gemma 3 LLMs 2x faster with 70% less memory! 🦥

Python 38,842 3,043 Updated May 18, 2025

Search-R1: An Efficient, Scalable RL Training Framework for Reasoning & Search Engine Calling interleaved LLM based on veRL

Python 2,266 159 Updated May 16, 2025

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 47,513 7,452 Updated May 18, 2025

The official code for NAACL 2024 paper: $E^5$: Zero-shot Hierarchical Table Analysis using Augmented LLMs via Explain, Extract, Execute, Exhibit and Extrapolate

Python 12 1 Updated Jun 23, 2024

🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.

Python 18,431 1,870 Updated May 17, 2025
Python 494 40 Updated Jul 26, 2024

DocLayout-YOLO: Enhancing Document Layout Analysis through Diverse Synthetic Data and Global-to-Local Adaptive Perception

Python 1,227 99 Updated Apr 14, 2025

基于PageRank的TextRank方法, 可以应用于中文关键词、短语、摘要提取程序,代码使用Scala编写。

Scala 130 53 Updated Jul 29, 2020

LLM(😽)

Python 1,666 93 Updated Feb 3, 2025

Text Content Grapher based on keyinfo extraction by NLP method。输入一篇文档,将文档进行关键信息提取,进行结构化,并最终组织成图谱组织形式,形成对文章语义信息的图谱化展示。

Python 1,420 360 Updated Oct 20, 2021

Build large-scale task workflows: luigi + job submission + remote targets + environment sandboxing using Docker/Singularity

Python 103 42 Updated May 6, 2025
Python 281 26 Updated Jul 25, 2024

The AI Scientist: Towards Fully Automated Open-Ended Scientific Discovery 🧑‍🔬

Jupyter Notebook 10,968 1,593 Updated Apr 26, 2025

An awesome repository & A comprehensive survey on interpretability of LLM attention heads.

TeX 349 12 Updated Mar 2, 2025

Docker Images for the Neo4j Graph Database

Shell 351 174 Updated May 16, 2025

An NVIDIA AI Workbench example project for fine-tuning a Nemotron-3 8B model

Jupyter Notebook 49 26 Updated Apr 15, 2024

[ACL 2024] An Easy-to-use Instruction Processing Framework for LLMs.

Python 402 36 Updated Dec 23, 2024

Rank-DistiLLM: Closing the Effectiveness Gap Between Cross-Encoders and LLMs for Passage Re-Ranking

Jupyter Notebook 22 Updated Apr 4, 2025

T2Ranking: A large-scale Chinese benchmark for passage ranking.

Python 158 10 Updated Jul 3, 2023

A unified Natural Language Understanding reranker with deep reinforcement learning

Python 3 Updated Oct 7, 2023

This codebase is based on OLTR codebase

Python 6 1 Updated Dec 8, 2023

allRank is a framework for training learning-to-rank neural models based on PyTorch.

Python 933 125 Updated Aug 6, 2024

A deep reinforcement learning approach to search engine ranking (PyTorch). Final Project for UC Berkeley's CS 285: Deep Reinforcement Learning, Decision Making, and Control

TeX 27 9 Updated May 5, 2024
Next
0