8000 lose4578 (Traly) / Starred · GitHub

More Web Proxy on the site http://driver.im/

lose4578

Follow

Traly lose4578

Follow

15 followers · 7 following

https://orcid.org/0009-0007-1768-2238

Achievements

Achievements

Lists (1)

Sort

✨ Inspiration

Starred repositories

lose4578 / CircleRoPE

Python 3 Updated May 23, 2025

kuleshov-group / bd3lms

Block Diffusion: Interpolating Between Autoregressive and Diffusion Language Models

Python 703 40 Updated Apr 16, 2025

HKUNLP / Dream

Dream 7B, a large diffusion language model

Python 776 33 Updated Jun 18, 2025

ML-GSAI / LLaDA

Official PyTorch implementation for "Large Language Diffusion Models"

Python 2,382 156 Updated Jun 17, 2025

Hao840 / Awesome-Low-Precision-Training

A collection of research papers on low-precision training methods

17 1 Updated May 10, 2025

XiaomiMiMo / MiMo

MiMo: Unlocking the Reasoning Potential of Language Model – From Pretraining to Posttraining

Python 1,468 62 Updated Jun 5, 2025

rangmiao / Eve

Python 10 1 Updated Jan 24, 2025

showlab / Show-o

[ICLR 2025] Repository for Show-o series, One Single Transformer to Unify Multimodal Understanding and Generation.

Python 1,506 65 Updated Jun 23, 2025

Open-Reasoner-Zero / Open-Reasoner-Zero

Official Repo for Open-Reasoner-Zero

Python 1,969 104 Updated Jun 2, 2025

seal-rg / recurrent-pretraining

Pretraining code for a large-scale depth-recurrent language model

Python 783 65 Updated Jun 12, 2025

facebookresearch / coconut

Training Large Language Model to Reason in a Continuous Latent Space

Python 1,162 108 Updated Jan 24, 2025

Deep-Agent / R1-V

Witness the aha moment of VLM with less than $3.

Python 3,790 288 Updated May 19, 2025

FanqingM / MM-Eureka-V0

MM-Eureka V0 also called R1-Multimodal-Journey, Latest version is in MM-Eureka

Python 307 8 Updated Jun 21, 2025

Unakar / Logic-RL

Reproduce R1 Zero on Logic Puzzle

Python 2,360 161 Updated Mar 20, 2025

huggingface / open-r1

Fully open reproduction of DeepSeek-R1

Python 24,865 2,306 Updated Jun 23, 2025

ZiyuGuo99 / Image-Generation-CoT

[CVPR 2025] The First Investigation of CoT Reasoning (RL, TTS, Reflection) in Image Generation

Python 738 21 Updated May 23, 2025

Jiayi-Pan / TinyZero

Minimal reproduction of DeepSeek R1-Zero

Python 11,929 1,490 Updated Apr 24, 2025

SimpleBerry / LLaMA-O1

Large Reasoning Models

Python 804 45 Updated Dec 3, 2024

gaogaotiantian / viztracer

A debugging and profiling tool that can trace and visualize python code execution

Python 6,718 440 Updated May 25, 2025

wutianyuan1 / GPU-preempter

Python 17 1 Updated Oct 13, 2024

facebookresearch / sam2

The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use th…

Jupyter Notebook 15,905 1,853 Updated Dec 25, 2024

FoundationVision / LlamaGen

Autoregressive Model Beats Diffusion: 🦙 Llama for Scalable Image Generation

Python 1,781 81 Updated Aug 15, 2024

karpathy / LLM101n

LLM101n: Let's build a Storyteller

33,779 1,835 Updated Aug 1, 2024

YuchuanTian / U-DiT

[NeurIPS 2024] The official code of "U-DiTs: Downsample Tokens in U-Shaped Diffusion Transformers"

Python 210 11 Updated Sep 30, 2024

JadyXuan / NTTS

NO TIME TO SLEEP

Python 649 25 Updated May 26, 2024

YuchuanTian / DiJiang

[ICML'24 Oral] The official code of "DiJiang: Efficient Large Language Models through Compact Kernelization", a novel DCT-based linear attention mechanism.

Python 101 6 Updated Jun 14, 2024

lose4578 / SAM-DiffSR

SAM-DiffSR: Structure-Modulated Diffusion Model for Image Super-Resolution

Python 122 4 Updated Mar 30, 2024

xinghaochen / TinySAM

[AAAI 2025] Official PyTorch implementation of "TinySAM: Pushing the Envelope for Efficient Segment Anything Model"

Python 486 32 Updated Jan 19, 2025

ggjy / vision_weak_to_strong

Python 38 2 Updated Feb 8, 2024

YuchuanTian / RethinkTinyLM

[ICML'24] The official implementation of “Rethinking Optimization and Architecture for Tiny Language Models”

Python 121 6 Updated Jan 14, 2025

Starred topics

deep-reinforcement-learning

imitation-learning

instance-segmentation

0