8000 cihangxie (Cihang Xie) / Starred · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
View cihangxie's full-sized avatar

Organizations

@ccvl

Block or report cihangxie

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Seed1.5-VL, a vision-language foundation model designed to advance general-purpose multimodal understanding and reasoning, achieving state-of-the-art performance on 38 out of 60 public benchmarks.

Jupyter Notebook 924 24 Updated May 15, 2025

OpenVision: A Fully-Open, Cost-Effective Family of Advanced Vision Encoders for Multimodal Learning

Python 223 12 Updated May 15, 2025

Fully open data curation for reasoning models

Python 1,778 148 Updated May 9, 2025

Qwen3 is the large language model series developed by Qwen team, Alibaba Cloud.

Shell 21,315 1,405 Updated May 16, 2025

MAGI-1: Autoregressive Video Generation at Scale

Python 3,050 166 Updated May 14, 2025

Complex-Edit: CoT-Like Instruction Generation for Complexity-Controllable Image Editing Benchmark

Python 16 1 Updated Apr 22, 2025

Lets make video diffusion practical!

Python 13,286 1,137 Updated May 4, 2025

MedSegFactory: Text-Guided Generation of Medical Image-Mask Pairs

Python 9 2 Updated Apr 12, 2025
Python 21 Updated Apr 7, 2025

MedReason: Eliciting Factual Medical Reasoning Steps in LLMs via Knowledge Graphs

Python 158 15 Updated Apr 8, 2025

m1: Unleash the Potential of Test-Time Scaling for Medical Reasoning in Large Language Models

Jupyter Notebook 27 2 Updated Apr 14, 2025
Python 4 Updated Mar 18, 2025

FlashMLA: Efficient MLA decoding kernels

Cuda 11,551 834 Updated Apr 29, 2025

This repository includes the official implementation of our paper "Beyond Next-Token: Next-X Prediction for Autoregressive Visual Generation"

Python 206 8 Updated Apr 30, 2025

SFT or RL? An Early Investigation into Training R1-Like Reasoning Large Vision-Language Models

Python 107 1 Updated Apr 24, 2025

Official implementation of Scaling Laws in Patchification: An Image Is Worth 50,176 Tokens And More

Python 20 Updated Feb 25, 2025

Pytorch implementation of EpiFoundation

Python 17 Updated Feb 25, 2025

A Training-free Iterative Framework for Long Story Visualization

Python 889 125 Updated Jan 18, 2025

“FlowAR: Scale-wise Autoregressive Image Generation Meets Flow Matching” FlowAR employs a simplest scale design and is compatible with any VAE.

Python 116 1 Updated May 1, 2025

Large Context Attention

Python 709 53 Updated Jan 24, 2025

Official inference framework for 1-bit LLMs

Python 19,627 1,455 Updated May 19, 2025
Python 21 Updated Feb 27, 2025
Python 14 3 Updated Oct 14, 2024
Python 48 2 Updated Feb 26, 2025

Official Repository of VideoLLaMB: Long Video Understanding with Recurrent Memory Bridges

Python 68 2 Updated Feb 27, 2025

[ICLR 2025] This is the official repository of our paper "MedTrinity-25M: A Large-scale Multimodal Dataset with Multigranular Annotations for Medicine“

Python 329 21 Updated Feb 26, 2025

Official inference repo for FLUX.1 models

Python 21,706 1,541 Updated Feb 6, 2025

The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use th…

Jupyter Notebook 15,474 1,745 Updated Dec 25, 2024
Next
0