10000 yuanjunze (MrFace) / Starred · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
View yuanjunze's full-sized avatar
元神
元神

Highlights

  • Pro

Block or report yuanjunze

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

ASIO for Rocksmith 2014

C++ 1,330 111 Updated Jun 12, 2025
Python 12,132 1,101 Updated Jun 21, 2025

Text to 4D Worlds in Blender

Python 87 11 Updated Jun 18, 2025

通过MCP协议操作blender建模, 让LLM直接创建3D模型, 开启3D建模的新篇章

Python 5 2 Updated Mar 21, 2025
Python 41 12 Updated Nov 24, 2023

This is an official implementation for "Video Swin Transformers".

Python 1,560 209 Updated Mar 8, 2023

Bring some peace to your terminal with Cyber Buddha!

1 Updated Dec 30, 2024

[IJCV 2024] InterGen: Diffusion-based Multi-human Motion Generation under Complex Interactions

Python 278 17 Updated Jul 20, 2024

The official PyTorch implementation of L2CS-Net for gaze estimation and tracking

Python 401 94 Updated Feb 2, 2024

视频理解:千问视频多模态模型 & Dify

Python 60 9 Updated Sep 2, 2024

👑 Qwen Blog.

HTML 64 26 Updated Jun 30, 2025

Qwen2.5-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.

Jupyter Notebook 11,289 821 Updated May 15, 2025

The official repo of Qwen-VL (通义千问-VL) chat & pretrained large vision language model proposed by Alibaba Cloud.

Python 6,027 448 Updated Aug 7, 2024

🚀🚀 「大模型」2小时完全从0训练26M的小参数GPT!🌏 Train a 26M-parameter GPT from scratch in just 2h!

Python 22,547 2,670 Updated Apr 30, 2025

SOTA Open Source TTS

Python 22,137 1,817 Updated Jun 12, 2025

Mamba SSM architecture

Python 15,233 1,353 Updated Jun 26, 2025

LongLLaVA: Scaling Multi-modal LLMs to 1000 Images Efficiently via Hybrid Architecture

Python 205 15 Updated Jan 6, 2025

Collection of AWESOME vision-language models for vision tasks

2,805 216 Updated May 25, 2025

High-resolution models for human tasks.

Python 5,061 295 Updated Nov 18, 2024

DORA (Dataflow-Oriented Robotic Architecture) is middleware designed to streamline and simplify the creation of AI-based robotic applications. It offers low latency, composable, and distributed dat…

Rust 2,295 193 Updated Jul 1, 2025

[NeurIPS2024] Repo for the paper `ControlMLLM: Training-Free Visual Prompt Learning for Multimodal Large Language Models'

Python 181 3 Updated May 29, 2025

Tools for Supine.

1 Updated Aug 9, 2024

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

Python 53,262 6,531 Updated Jul 1, 2025

中文nlp解决方案(大模型、数据、模型、训练、推理)

Jupyter Notebook 3,529 413 Updated Jun 30, 2025

Implementation of RT1 (Robotic Transformer) in Pytorch

Python 431 33 Updated Oct 6, 2024

InternLM-XComposer2.5-OmniLive: A Comprehensive Multimodal System for Long-term Streaming Video and Audio Interactions

Python 2,856 175 Updated May 26, 2025

Anole: An Open, Autoregressive and Native Multimodal Models for Interleaved Image-Text Generation

Python 770 46 Updated Jun 16, 2025

GRUtopia: Dream General Robots in a City at Scale

Python 859 51 Updated Jun 27, 2025

EVE Series: Encoder-Free Vision-Language Models from BAAI

Python 332 8 Updated Mar 1, 2025
Next
0