8000 gimpong (Jinpeng Wang) / Starred · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
View gimpong's full-sized avatar
🤓
Learn and try more
🤓
Learn and try more

Block or report gimpong

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results
JavaScript 36 5 Updated Jun 11, 2025

The code for the paper "Embracing Collaboration Over Competition: Condensing Multiple Prompts for Visual In-Context Learning" (CVPR'25).

Python 7 1 Updated May 1, 2025

Awesome Agent Training

155 10 Updated Jun 4, 2025

This repository provides valuable reference for researchers in the field of multimodality, please start your exploratory travel in RL-based Reasoning MLLMs!

897 43 Updated Jun 16, 2025

Implementing DeepSeek R1's GRPO algorithm from scratch

Python 1,415 62 Updated Apr 18, 2025

Awesome-RAG-Vision: a curated list of advanced retrieval augmented generation (RAG) for Computer Vision

175 4 Updated Jun 3, 2025

The code for the paper "AutoSSVH: Exploring Automated Frame Sampling for Efficient Self-Supervised Video Hashing" (CVPR'25).

Python 4 Updated Apr 9, 2025

A comprehensive collection of process reward models.

92 1 Updated Jun 9, 2025

[ICLR2025 Oral] ChartMoE: Mixture of Diversely Aligned Expert Connector for Chart Understanding

Jupyter Notebook 84 3 Updated Apr 1, 2025

This repository contains the PyTorch implementation of our work at CVPR 2025

Python 7 3 Updated Apr 8, 2025

This repository contains a regularly updated paper list for LLMs-reasoning-in-latent-space.

118 1 Updated Jun 16, 2025

✨First Open-Source R1-like Video-LLM [2025/02/18]

Python 347 12 Updated Feb 23, 2025

Collection of papers and repos for multimodal chain-of-thought

84 4 Updated Nov 6, 2024

Extend OpenRLHF to support LMM RL training for reproduction of DeepSeek-R1 on multimodal tasks.

Python 770 46 Updated May 14, 2025

✨✨Beyond LLaVA-HD: Diving into High-Resolution Large Multimodal Models

Python 159 7 Updated Dec 26, 2024

Official pytorch repository for "Ambiguity-Restrained Text-Video Representation Learning for Partially Relevant Video Retrieval" (AAAI 2025 Paper)

8 Updated Jan 9, 2025

FastVideo is a unified framework for accelerated video generation.

Python 1,530 104 Updated Jun 16, 2025

AI for Science 论文解读合集(持续更新ing),论文/数据集/教程下载:hyper.ai

1,354 185 Updated Mar 22, 2025

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

Python 52,387 6,354 Updated Jun 16, 2025

A collection of vision foundation models unifying understanding and generation.

55 4 Updated Jan 2, 2025

The code for the paper "Efficient Self-Supervised Video Hashing with Selective State Spaces" (AAAI'25).

Python 18 1 Updated Dec 20, 2024

The code for the paper "BoostAdapter: Improving Test-Time Adaptation via Regional Bootstrapping" (NeurIPS'24).

Python 3 Updated Dec 8, 2024

LMDeploy is a toolkit for compressing, deploying, and serving LLMs.

Python 6,526 557 Updated Jun 16, 2025

High-performance Image Tokenizers for VAR and AR

Python 272 6 Updated Apr 25, 2025

📖 This is a repository for organizing papers, codes and other resources related to unified multimodal models.

576 28 Updated Jun 3, 2025

[TMLR 2025🔥] A survey for the autoregressive models in vision.

631 19 Updated Jun 13, 2025

TemporalBench: Benchmarking Fine-grained Temporal Understanding for Multimodal Video Models

Python 34 1 Updated Nov 10, 2024

A Video Tokenizer Evaluation Dataset

Python 125 8 Updated Jan 13, 2025

A suite of image and video neural tokenizers

Jupyter Notebook 1,637 78 Updated Feb 11, 2025

[NeurIPS 2024] Classification Done Right for Vision-Language Pre-Training

Python 209 7 Updated Mar 20, 2025
Next
0