8000 zhyx12 (Vincent Zhang) / Starred · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
View zhyx12's full-sized avatar
  • USTC VIM
  • Hefei

Block or report zhyx12

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

[CVPR 2025 Oral] VGGT: Visual Geometry Grounded Transformer

Python 6,791 681 Updated May 19, 2025

CVPR 2025 - R-TPT: Improving Adversarial Robustness of Vision-Language Models through Test-Time Prompt Tuning

Python 11 Updated Apr 23, 2025

Official PyTorch implementation of One-Minute Video Generation with Test-Time Training

Python 1,536 128 Updated Apr 12, 2025

Variational Animal Motion Embedding - A tool for time series embedding and clustering

Python 26 4 Updated May 16, 2025

Official code for the manuscript "Three-dimensional surface motion capture of multiple freely moving pigs using MAMMAL"

C++ 23 1 Updated Jun 28, 2023
Jupyter Notebook 19 2 Updated Mar 4, 2025

A Python toolbox for analysing body movements across space and time

Python 182 62 Updated May 19, 2025

Official implementation of DeepLabCut: Markerless pose estimation of user-defined features with deep learning for all animals incl. humans

Python 5,034 1,710 Updated May 15, 2025

HMMs in PyTorch

Jupyter Notebook 138 30 Updated Mar 20, 2021
Python 90 33 Updated May 14, 2025

[CVPR 2025 Oral] Reconstruction vs. Generation: Taming Optimization Dilemma in Latent Diffusion Models

Python 765 20 Updated May 19, 2025

Modified Python 3.0 implementation of MotionMapper (https://github.com/gordonberman/MotionMapper)

Python 29 15 Updated May 30, 2024

TVBox开源版,盒子软件分享

JavaScript 2,024 192 Updated Apr 29, 2025
Python 227 32 Updated Apr 25, 2025

(Supports DeepSeek R1) An AI-powered research assistant that performs iterative, deep research on any topic by combining search engines, web scraping, and large language models.

Vue 1,873 243 Updated Apr 6, 2025

Witness the aha moment of VLM with less than $3.

Python 3,667 285 Updated May 19, 2025

Solve Visual Understanding with Reinforced VLMs

Python 4,958 306 Updated May 11, 2025

AIDE: AI-Driven Exploration in the Space of Code. State of the Art machine Learning engineering agents that automates AI R&D.

Python 902 123 Updated Apr 18, 2025

🙌 OpenHands: Code Less, Make More

Python 54,425 6,159 Updated May 19, 2025

[CVPR 2025] 🔥 Official impl. of "TokenFlow: Unified Image Tokenizer for Multimodal Understanding and Generation".

Python 326 1 Updated Mar 5, 2025

[AAAI 2025] The official repository of our paper "Target Semantics Clustering via Text Representations for Robust Universal Domain Adaptation"

Python 5 Updated Apr 14, 2025

[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的开源多模态对话模型

Python 8,121 613 Updated Apr 27, 2025

Reading notes about Multimodal Large Language Models, Large Language Models, and Diffusion Models

418 20 Updated May 12, 2025

[NeurIPS 2024 Best Paper][GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". An *ult…

Jupyter Notebook 7,934 489 Updated May 18, 2025

[CVPR 2025 Oral]Infinity ∞ : Scaling Bitwise AutoRegressive Modeling for High-Resolution Image Synthesis

Python 1,279 61 Updated Apr 24, 2025

Project Page for "LISA: Reasoning Segmentation via Large Language Model"

Python 2,206 153 Updated Feb 16, 2025

PyTorch Implementation of "V* : Guided Visual Search as a Core Mechanism in Multimodal LLMs"

Python 607 39 Updated Jan 7, 2024

An Easy-to-use, Scalable and High-performance RLHF Framework based on Ray (PPO & GRPO & REINFORCE++ & LoRA & vLLM & RFT)

Python 6,737 657 Updated May 19, 2025
Next
0