8000 ybrrraway (hyb) / Starred · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
View ybrrraway's full-sized avatar

Highlights

  • Pro

Block or report ybrrraway

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Pixel-Level Reasoning Model trained with RL

Python 145 2 Updated Jun 24, 2025

An Efficient and User-Friendly Scaling Library for Reinforcement Learning with Large Language Models

Python 1,266 67 Updated Jun 25, 2025

Reinforcement learning resources curated

9,157 1,836 Updated May 25, 2023

HoPE: Hybrid of Position Embedding for Length Generalization in Vision-Language Models

7 Updated Jun 8, 2025

LMDeploy is a toolkit for compressing, deploying, and serving LLMs.

Python 6,597 564 Updated Jun 27, 2025

MiMo: Unlocking the Reasoning Potential of Language Model – From Pretraining to Posttraining

Python 1,477 63 Updated Jun 5, 2025

A streamlined and customizable framework for efficient large model evaluation and performance benchmarking

Python 1,217 131 Updated Jun 26, 2025

A powerful tool for creating fine-tuning datasets for LLM

JavaScript 8,897 861 Updated Jun 25, 2025

Hackable and optimized Transformers building blocks, supporting a composable construction.

Python 9,641 686 Updated Jun 25, 2025

[ArXiv] V2PE: Improving Multimodal Long-Context Capability of Vision-Language Models with Variable Visual Position Encoding

Python 50 2 Updated Dec 13, 2024

[ICML'24] Data and code for our paper "Training-Free Long-Context Scaling of Large Language Models"

Python 412 19 Updated Oct 16, 2024

MetaSpatial leverages reinforcement learning to enhance 3D spatial reasoning in vision-language models (VLMs), enabling more structured, realistic, and adaptive scene generation for applications in…

Python 146 6 Updated May 5, 2025

verl: Volcano Engine Reinforcement Learning for LLMs

Python 10,054 1,655 Updated Jun 27, 2025
Python 718 47 Updated May 30, 2025

Reasoning in LLMs: Papers and Resources, including Chain-of-Thought, OpenAI o1, and DeepSeek-R1 🍓

3,178 186 Updated May 7, 2025

🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.

Python 146,116 29,469 Updated Jun 27, 2025

Witness the aha moment of VLM with less than $3.

Python 3,806 289 Updated May 19, 2025

强化学习中文教程(蘑菇书🍄),在线阅读地址:https://datawhalechina.github.io/easy-rl/

Jupyter Notebook 11,703 2,053 Updated Jun 19, 2025

🚀🚀 「大模型」2小时完全从0训练26M的小参数GPT!🌏 Train a 26M-parameter GPT from scratch in just 2h!

Python 22,447 2,653 Updated Apr 30, 2025

🍒 Cherry Studio is a desktop client that supports for multiple LLM providers.

TypeScript 29,075 2,526 Updated Jun 27, 2025

[ICLR'25] Official code for the paper 'MLLMs Know Where to Look: Training-free Perception of Small Visual Details with Multimodal LLMs'

Python 221 10 Updated Apr 20, 2025

[ICML 2025 Oral] An official implementation of VideoRoPE: What Makes for Good Video Rotary Position Embedding?

Python 159 5 Updated Jun 16, 2025

Fully open data curation for reasoning models

Python 1,948 165 Updated Jun 5, 2025

Reading notes about Multimodal Large Language Models, Large Language Models, and Diffusion Models

448 20 Updated Jun 5, 2025
Python 3,958 373 Updated Jun 13, 2025

Open-source evaluation toolkit of large multi-modality models (LMMs), support 220+ LMMs, 80+ benchmarks

Python 2,593 418 Updated Jun 27, 2025

Qwen2.5-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.

Jupyter Notebook 11,207 812 Updated May 15, 2025

[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的开源多模态对话模型

Python 8,424 648 Updated May 29, 2025

【EMNLP 2024🔥】Video-LLaVA: Learning United Visual Representation by Alignment Before Projection

Python 3,289 235 Updated Dec 3, 2024

SAM with text prompt

Python 2,257 260 Updated May 10, 2025
Next
0