8000 rongzhimd / Starred · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
View rongzhimd's full-sized avatar

Block or report rongzhimd

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

✨✨VITA-Audio: Fast Interleaved Cross-Modal Token Generation for Efficient Large Speech-Language Model

Python 578 47 Updated May 24, 2025

✨✨Latest Advances on Multimodal Large Language Models

15,528 1,008 Updated Jun 13, 2025

Extend OpenRLHF to support LMM RL training for reproduction of DeepSeek-R1 on multimodal tasks.

Python 770 46 Updated May 14, 2025

A live stream development of RL tunning for LLM agents

Python 2,986 414 Updated May 23, 2025

Solve Visual Understanding with Reinforced VLMs

Python 5,137 312 Updated May 11, 2025

A fork to add multimodal model training to open-r1

Python 1,297 63 Updated Feb 8, 2025

✨First Open-Source R1-like Video-LLM [2025/02/18]

Python 346 12 Updated Feb 23, 2025

adds Sequence Parallelism into LLaMA-Factory

Python 509 32 Updated Jun 9, 2025

Open-source evaluation toolkit of large multi-modality models (LMMs), support 220+ LMMs, 80+ benchmarks

Python 2,515 392 Updated Jun 13, 2025

✨✨[CVPR 2025] Video-MME: The First-Ever Comprehensive Evaluation Benchmark of Multi-modal LLMs in Video Analysis

564 21 Updated May 8, 2025
Python 37 2 Updated Jul 9, 2024

Official PyTorch implementation of "Multi-modal Queried Object Detection in the Wild" (accepted by NeurIPS 2023)

Python 336 17 Updated Feb 23, 2024

[CVPR 2024] Aligning and Prompting Everything All at Once for Universal Visual Perception

Python 568 41 Updated May 8, 2024

总结Prompt&LLM论文,开源数据&模型,AIGC应用

3,075 302 Updated Jun 9, 2025

MNBVC(Massive Never-ending BT Vast Chinese corpus)超大规模中文语料集。对标chatGPT训练的40T数据。MNBVC数据集不但包括主流文化,也包括各个小众文化甚至火星文的数据。MNBVC数据集包括新闻、作文、小说、书籍、杂志、论文、台词、帖子、wiki、古诗、歌词、商品介绍、笑话、糗事、聊天记录等一切形式的纯文本中文数据。

3,878 275 Updated Jun 9, 2025

🦜🔗 Build context-aware reasoning applications

Jupyter Notebook 109,364 17,793 Updated Jun 13, 2025

Repository for the paper "Cognitive Mirage: A Review of Hallucinations in Large Language Models"

47 1 Updated Oct 21, 2023

LongBench v2 and LongBench (ACL 2024)

Python 895 88 Updated Jan 15, 2025

LongQLoRA: Extent Context Length of LLMs Efficiently

Python 166 15 Updated Nov 12, 2023

SuperCLUE: 中文通用大模型综合性基准 | A Benchmark for Foundation Models in Chinese

3,198 107 Updated Apr 28, 2025

Llama中文社区,实时汇总最新Llama学习资料,构建最好的中文Llama大模型开源生态,完全开源可商用

Python 14,608 1,304 Updated Apr 6, 2025

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

Python 52,204 6,300 Updated Jun 12, 2025

A series of large language models developed by Baichuan Intelligent Technology

Python 4,123 295 Updated Nov 8, 2024

Free and Open Source, Distributed, RESTful Search Engine

Java 72,939 25,254 Updated Jun 13, 2025

BELLE: Be Everyone's Large Language model Engine(开源中文对话大模型)

HTML 8,169 767 Updated Oct 16, 2024

✨✨Woodpecker: Hallucination Correction for Multimodal Large Language Models

Python 637 30 Updated Dec 23, 2024

Track-Anything is a flexible and interactive tool for video object tracking and segmentation, based on Segment Anything, XMem, and E2FGVI.

Python 6,731 497 Updated May 31, 2024

Tracking Anything in High Quality

Python 752 62 Updated Dec 1, 2023

[ICCV2023 Best Paper Finalist] PyTorch implementation of DiffusionDet (https://arxiv.org/abs/2211.09788)

Python 2,182 167 Updated Dec 22, 2022

[CVPR2023] The official repo for OC-SORT: Observation-Centric SORT on video Multi-Object Tracking. OC-SORT is simple, online and robust to occlusion/non-linear motion.

Python 897 124 Updated Jul 18, 2023
Next
0