8000 cool-xuan (Yixuan Zhou (周宜暄)) / Starred · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
View cool-xuan's full-sized avatar
🎯
Focusing
🎯
Focusing
  • University of Electronic Science and Technology of China (UESTC)
  • Chengdu, Sichuan, China
  • 02:24 (UTC +08:00)

Block or report cool-xuan

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

verl: Volcano Engine Reinforcement Learning for LLMs

Python 9,839 1,618 Updated Jun 23, 2025

✨✨Latest Advances on Multimodal Large Language Models

15,609 1,015 Updated Jun 19, 2025

🔥🔥🔥Latest Papers, Codes and Datasets on Vid-LLMs.

2,431 112 Updated Jun 20, 2025

K-PLUG: Knowledge-injected Pre-trained Language Model for Natural Language Understanding and Generation in E-Commerce (Findings of EMNLP 2021)

Python 31 3 Updated Jan 6, 2023
Python 87 27 Updated Sep 15, 2020

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

Python 52,811 6,466 Updated Jun 23, 2025

InternLM-XComposer2.5-OmniLive: A Comprehensive Multimodal System for Long-term Streaming Video and Audio Interactions

Python 2,852 175 Updated May 26, 2025

Official code for our paper, "LoRA-Pro: Are Low-Rank Adapters Properly Optimized? "

Python 120 6 Updated Apr 8, 2025

LLM101n: Let's build a Storyteller

33,776 1,835 Updated Aug 1, 2024
Python 3,945 371 Updated Jun 13, 2025
Python 44 11 Updated Jun 8, 2022

ccks2022 task9 subtask2 商品同款识别

Jupyter Notebook 43 13 Updated Feb 9, 2023

[CVPR 2024] TimeChat: A Time-sensitive Multimodal Large Language Model for Long Video Understanding

Python 380 34 Updated May 8, 2025

[CVPR'2024 Highlight] Official PyTorch implementation of the paper "VTimeLLM: Empower LLM to Grasp Video Moments".

Python 280 12 Updated Jun 13, 2024

[AAAI 2025] VTG-LLM: Integrating Timestamp Knowledge into Video LLMs for Enhanced Video Temporal Grounding

Python 103 3 Updated Dec 10, 2024

Foundational Models for State-of-the-Art Speech and Text Translation

Jupyter Notebook 11,564 1,141 Updated Nov 14, 2024

[ACL 2024] GroundingGPT: Language-Enhanced Multi-modal Grounding Model

Python 331 17 Updated Nov 4, 2024

A curated list of recent diffusion models for video generation, editing, and various other applications.

4,541 274 Updated Jun 19, 2025

[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.

Python 22,860 2,526 Updated Aug 12, 2024

PyTorch implementation of RCG https://arxiv.org/abs/2312.03701

Python 915 41 Updated Sep 27, 2024

The official implementation of "Divergence of Features and Mean: A BatchNorm-based Abnormality Criterion for Weakly Supervised Video Anomaly Detection"

Python 66 15 Updated Nov 30, 2023

The official code for "MSFlow: Multi-Scale Normalizing Flows for Unsupervised Anomaly Detection"

Python 69 11 Updated Mar 8, 2024

The offical implement of ImbSAM (Imbalanced-SAM)

Python 24 2 Updated Mar 4, 2024

A natural language interface for computers

Python 59,733 5,084 Updated Apr 23, 2025

[AAAI 2024 Oral] AnomalyGPT: Detecting Industrial Anomalies Using Large Vision-Language Models

Python 960 121 Updated Dec 20, 2023

Image inpainting tool powered by SOTA AI Model. Remove any unwanted object, defect, people from your pictures or erase and replace(powered by stable diffusion) any thing on your pictures.

Python 21,666 2,201 Updated Apr 29, 2025

Custom Diffusion: Multi-Concept Customization of Text-to-Image Diffusion (CVPR 2023)

Python 1,945 138 Updated Dec 20, 2023

🕸️ Web apps in pure Python 🐍

Python 23,417 1,371 Updated Jun 23, 2025
Next
0