8000 yqy2001 / Starred · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
View yqy2001's full-sized avatar

Organizations

@baaivision

Block or report yqy2001

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Releases from OpenAI Preparedness

Python 761 74 Updated May 30, 2025
Python 8 Updated Mar 18, 2025

Estimate MFU for DeepSeekV3

Python 24 1 Updated Jan 5, 2025

A Distributed Attention Towards Linear Scalability for Ultra-Long Context, Heterogeneous Data Training

Python 372 20 Updated May 29, 2025

MAGI-1: Autoregressive Video Generation at Scale

Python 3,196 180 Updated May 30, 2025

An Open-source RL System from ByteDance Seed and Tsinghua AIR

Python 1,280 52 Updated May 11, 2025

NanoGPT (124M) in 3 minutes

Python 2,601 311 Updated May 27, 2025

DeepGEMM: clean and efficient FP8 GEMM kernels with fine-grained scaling

Python 5,396 606 Updated May 27, 2025

[ICLR 2025 Oral] The official implementation of "Diffusion-Based Planning for Autonomous Driving with Flexible Guidance"

Python 477 56 Updated Apr 5, 2025

verl: Volcano Engine Reinforcement Learning for LLMs

Python 8,811 1,100 Updated May 31, 2025

Large Concept Models: Language modeling in a sentence representation space

Python 2,211 201 Updated Jan 29, 2025

Efficient Triton Kernels for LLM Training

Python 5,121 341 Updated May 31, 2025

The official repository of the Omni-MATH benchmark.

Python 83 1 Updated Dec 22, 2024

OpenR: An Open Source Framework for Advanced Reasoning with Large Language Models

Python 1,775 135 Updated Jan 17, 2025

Solutions of Reinforcement Learning, An Introduction

Jupyter Notebook 2,228 491 Updated May 20, 2024

An API standard for single-agent reinforcement learning environments, with popular reference environments and related utilities (formerly Gym)

Python 9,259 1,027 Updated May 28, 2025

An educational resource to help anyone learn deep reinforcement learning.

Python 10,922 2,340 Updated Aug 5, 2024

Python Implementation of Reinforcement Learning: An Introduction

Python 14,109 4,921 Updated Aug 9, 2024

Next-Token Prediction is All You Need

Python 2,135 80 Updated Mar 17, 2025

A collection of LLM papers, blogs, and projects, with a focus on OpenAI o1 🍓 and reasoning techniques.

6,741 375 Updated May 29, 2025

DSIR large-scale data selection framework for language model training

Python 249 19 Updated Apr 7, 2024

Library for fast text representation and classification.

HTML 26,235 4,768 Updated Mar 22, 2024

DataComp for Language Models

HTML 1,304 118 Updated Mar 19, 2025

OLMoE: Open Mixture-of-Experts Language Models

Jupyter Notebook 770 68 Updated Mar 14, 2025

[NAACL'24] Self-data filtering of LLM instruction-tuning data using a novel perplexity-based difficulty score, without using any other models

Python 368 24 Updated Sep 6, 2024

[ACL'24] Selective Reflection-Tuning: Student-Selected Data Recycling for LLM Instruction-Tuning

Python 353 29 Updated Sep 6, 2024

Measuring Massive Multitask Language Understanding | ICLR 2021

Python 1,418 103 Updated May 28, 2023
Next
0