8000 anxiangsir (Xiang An) / Starred · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
View anxiangsir's full-sized avatar
🤩
🤩

Highlights

  • Pro

Block or report anxiangsir

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Official PyTorch implementation of EMOVA in CVPR 2025 (https://arxiv.org/abs/2409.18042)

Python 55 6 Updated Mar 16, 2025

Official code repo for our work "Native Visual Understanding: Resolving Resolution Dilemmas in Vision-Language Models"

Python 27 Updated Jun 17, 2025

Meta Lingua: a lean, efficient, and easy-to-hack codebase to research LLMs.

Python 4,628 262 Updated Jun 18, 2025

PyTorch code and models for VJEPA2 self-supervised learning from video.

Python 1,626 129 Updated Jun 27, 2025

[ICML'25] Kernel-based Unsupervised Embedding Alignment for Enhanced Visual Representation in Vision-language Models

Python 8 Updated Jun 9, 2025

PyTorch implementation of Zero-Shot Vision Encoder Grafting via LLM Surrogates [ICCV 2025]

Python 42 1 Updated Jun 25, 2025

Code from the paper "Roboflow100-VL: A Multi-Domain Object Detection Benchmark for Vision-Language Models"

Python 69 3 Updated Jun 2, 2025

所有小初高、大学PDF教材。

Roff 41,872 9,328 Updated May 18, 2025

Processed / Cleaned Data for Paper Copilot

Python 508 18 Updated Jun 24, 2025

Tensors and Dynamic neural networks in Python with strong GPU acceleration

Python 91,112 24,546 Updated Jun 29, 2025

An official implementation of Flow-GRPO: Training Flow Matching Models via Online RL

Python 797 29 Updated Jun 16, 2025

Seed1.5-VL, a vision-language foundation model designed to advance general-purpose multimodal understanding and reasoning, achieving state-of-the-art performance on 38 out of 60 public benchmarks.

Jupyter Notebook 1,270 47 Updated Jun 14, 2025

[CVPR 2025] DeCLIP: Decoupled Learning for Open-Vocabulary Dense Perception

Python 62 3 Updated Jun 10, 2025

GIF: Generative Inspiration for Face Recognition at Scale

12 Updated May 7, 2025

[ECCV2020] A Large-Scale Face Anti-Spoofing Dataset

Python 569 94 Updated Feb 26, 2021

[TPAMI] Searching prompt modules for parameter-efficient transfer learning.

Python 232 11 Updated Dec 8, 2023

The simplest, fastest repository for training/finetuning small-sized VLMs.

Python 3,583 320 Updated Jun 27, 2025

MiMo: Unlocking the Reasoning Potential of Language Model – From Pretraining to Posttraining

Python 1,477 64 Updated Jun 5, 2025
Python 72 6 Updated May 4, 2025

The official code of "Breaking the Modality Barrier: Universal Embedding Learning with Multimodal LLMs"

Python 75 2 Updated May 19, 2025

Official Pytorch implementation of [Decoupled Global-Local Alignment for Improving Compositional Understanding]

Python 13 1 Updated May 1, 2025

State-of-the-art Image & Video CLIP, Multimodal Large Language Models, and More!

Jupyter Notebook 1,346 76 Updated May 28, 2025
Python 15 5 Updated Apr 3, 2025

depyf is a tool to help you understand and adapt to PyTorch compiler torch.compile.

Python 691 27 Updated Apr 20, 2025
Python 9 Updated Jun 21, 2025

PyTorch implementation of "UNIT: Unifying Image and Text Recognition in One Vision Encoder", NeurlPS 2024.

Python 30 2 Updated Sep 26, 2024

A free open source IT asset/license management system

PHP 12,376 3,444 Updated Jun 27, 2025

An Open-Source Python3 tool with SMALL models for recognizing layouts, tables, math formulas (LaTeX), and text in images, converting them into Markdown format. A free alternative to Mathpix, empowe…

Jupyter Notebook 2,480 227 Updated May 7, 2025

⭐️ Reason-RFT: Reinforcement Fine-Tuning for Visual Reasoning.

Python 168 10 Updated Jun 10, 2025
Next
0