8000 CRyan2016 / Starred · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
View CRyan2016's full-sized avatar

Block or report CRyan2016

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

a-m-team's exploration in large language modeling

99 2 Updated May 14, 2025

The papers are organized according to our survey: Evaluating Large Language Models: A Comprehensive Survey.

762 53 Updated May 8, 2024

List of papers on hallucination detection in LLMs.

863 71 Updated May 8, 2025

Efficient Triton Kernels for LLM Training

Python 5,017 324 Updated May 15, 2025

An AI web browsing framework focused on simplicity and extensibility.

TypeScript 11,822 650 Updated May 15, 2025

A collection of MCP servers.

49,022 3,655 Updated May 14, 2025

DeepRetrieval - Hacking 🔥Real Search Engines and Retrievers with LLM via RL

Python 491 67 Updated May 6, 2025

Parsing-free RAG supported by VLMs

Python 701 56 Updated Feb 19, 2025

Mulberry, an o1-like Reasoning and Reflection MLLM Implemented via Collective MCTS

Python 1,175 108 Updated Mar 28, 2025

[ICLR 2025] Alignment Data Synthesis from Scratch by Prompting Aligned LLMs with Nothing. Your efficient and high-quality synthetic data generation pipeline!

Python 699 61 Updated Mar 17, 2025

Distilabel is a framework for synthetic data and AI feedback for engineers who need fast, reliable and scalable pipelines based on verified research papers.

Python 2,697 199 Updated May 12, 2025

A high-quality tool for convert PDF to Markdown and JSON.一站式开源高质量数据提取工具,将PDF转换成Markdown和JSON格式。

Python 33,485 2,692 Updated May 14, 2025

An Easy-to-use, Scalable and High-performance RLHF Framework based on Ray (PPO & GRPO & REINFORCE++ & LoRA & vLLM & RFT)

Python 6,702 654 Updated May 15, 2025

An Open Large Reasoning Model for Real-World Solutions

Python 1,488 80 Updated Mar 4, 2025

Moxin is a family of fully open-source and reproducible LLMs

Python 123 7 Updated Apr 24, 2025

Original source code The Art of Reinforcement Learning by Michael Hu

Python 25 13 Updated Jul 12, 2024

90% of what you need for LLM app development. Nothing you don't.

Python 260 22 Updated Apr 23, 2025

A library of reasoning algorithms for agents

Python 248 14 Updated Apr 6, 2025

Retrieval and Retrieval-augmented LLMs

Python 9,627 699 Updated May 15, 2025

Awesome-LLM-RAG: a curated list of advanced retrieval augmented generation (RAG) in Large Language Models

1,202 74 Updated Feb 24, 2025

A framework for serving and evaluating LLM routers - save LLM costs without compromising quality

Python 3,937 302 Updated Aug 10, 2024

TextGrad: Automatic ''Differentiation'' via Text -- using large language models to backpropagate textual gradients.

Python 2,534 211 Updated Apr 1, 2025

This is the homepage of a new book entitled "Mathematical Foundations of Reinforcement Learning."

MATLAB 8,923 900 Updated May 4, 2025

A collection of LLM papers, blogs, and projects, with a focus on OpenAI o1 🍓 and reasoning techniques.

6,726 374 Updated May 13, 2025

Implementing the 4 agentic patterns from scratch

Jupyter Notebook 1,294 195 Updated Mar 18, 2025

FinRL®: Financial Reinforcement Learning. 🔥

Jupyter Notebook 11,687 2,736 Updated May 5, 2025

主要记录大语言大模型(LLMs) 算法(应用)工程师相关的知识及面试题

HTML 7,447 819 Updated Apr 30, 2025

Controllable Text Generation for Large Language Models: A Survey

TeX 174 9 Updated Aug 27, 2024

Build Better Websites. Create modern, resilient user experiences with web fundamentals.

TypeScript 31,230 2,636 Updated May 13, 2025

Streamlines and simplifies prompt design for both developers and non-technical users with a low code approach.

Python 1,056 92 Updated Mar 21, 2025
Next
0