8000 lose4578 (Traly) / Starred · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
View lose4578's full-sized avatar

Block or report lose4578

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results
Python 3 Updated May 23, 2025

Block Diffusion: Interpolating Between Autoregressive and Diffusion Language Models

Python 703 40 Updated Apr 16, 2025

Dream 7B, a large diffusion language model

Python 776 33 Updated Jun 18, 2025

Official PyTorch implementation for "Large Language Diffusion Models"

Python 2,382 156 Updated Jun 17, 2025

A collection of research papers on low-precision training methods

17 1 Updated May 10, 2025

MiMo: Unlocking the Reasoning Potential of Language Model – From Pretraining to Posttraining

Python 1,468 62 Updated Jun 5, 2025
Python 10 1 Updated Jan 24, 2025

[ICLR 2025] Repository for Show-o series, One Single Transformer to Unify Multimodal Understanding and Generation.

Python 1,506 65 Updated Jun 23, 2025

Official Repo for Open-Reasoner-Zero

Python 1,969 104 Updated Jun 2, 2025

Pretraining code for a large-scale depth-recurrent language model

Python 783 65 Updated Jun 12, 2025

Training Large Language Model to Reason in a Continuous Latent Space

Python 1,162 108 Updated Jan 24, 2025

Witness the aha moment of VLM with less than $3.

Python 3,790 288 Updated May 19, 2025

MM-Eureka V0 also called R1-Multimodal-Journey, Latest version is in MM-Eureka

Python 307 8 Updated Jun 21, 2025

Reproduce R1 Zero on Logic Puzzle

Python 2,360 161 Updated Mar 20, 2025

Fully open reproduction of DeepSeek-R1

Python 24,865 2,306 Updated Jun 23, 2025

[CVPR 2025] The First Investigation of CoT Reasoning (RL, TTS, Reflection) in Image Generation

Python 738 21 Updated May 23, 2025

Minimal reproduction of DeepSeek R1-Zero

Python 11,929 1,490 Updated Apr 24, 2025

Large Reasoning Models

Python 804 45 Updated Dec 3, 2024

A debugging and profiling tool that can trace and visualize python code execution

Python 6,718 440 Updated May 25, 2025
Python 17 1 Updated Oct 13, 2024

The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use th…

Jupyter Notebook 15,905 1,853 Updated Dec 25, 2024

Autoregressive Model Beats Diffusion: 🦙 Llama for Scalable Image Generation

Python 1,781 81 Updated Aug 15, 2024

LLM101n: Let's build a Storyteller

33,779 1,835 Updated Aug 1, 2024

[NeurIPS 2024] The official code of "U-DiTs: Downsample Tokens in U-Shaped Diffusion Transformers"

Python 210 11 Updated Sep 30, 2024

NO TIME TO SLEEP

Python 649 25 Updated May 26, 2024

[ICML'24 Oral] The official code of "DiJiang: Efficient Large Language Models through Compact Kernelization", a novel DCT-based linear attention mechanism.

Python 101 6 Updated Jun 14, 2024

SAM-DiffSR: Structure-Modulated Diffusion Model for Image Super-Resolution

Python 122 4 Updated Mar 30, 2024

[AAAI 2025] Official PyTorch implementation of "TinySAM: Pushing the Envelope for Efficient Segment Anything Model"

Python 486 32 Updated Jan 19, 2025
Python 38 2 Updated Feb 8, 2024

[ICML'24] The official implementation of “Rethinking Optimization and Architecture for Tiny Language Models”

Python 121 6 Updated Jan 14, 2025
Next
0