8000 Debrup-61 (Debrup Das) / Starred · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
View Debrup-61's full-sized avatar

Block or report Debrup-61

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Simple RL training for reasoning

Python 3,649 272 Updated Apr 10, 2025

Search-R1: An Efficient, Scalable RL Training Framework for Reasoning & Search Engine Calling interleaved LLM based on veRL

Python 2,704 204 Updated Jun 20, 2025

A topic-centric list of HQ open datasets.

63,515 10,140 Updated Nov 13, 2024

Awesome-LLM: a curated list of Large Language Model

23,994 2,022 Updated May 9, 2025

The code implementation of MAGDi: Structured Distillation of Multi-Agent Interaction Graphs Improves Reasoning in Smaller Language Models. Paper: https://arxiv.org/abs/2402.01620

Python 35 7 Updated Feb 5, 2024

The official Meta Llama 3 GitHub site

Python 28,805 3,405 Updated Jan 26, 2025

Code base for paper "Zero-Shot Cross-Lingual Transfer with Meta Learning"

Python 34 3 Updated Nov 8, 2024
0