-
University of Electronic Science and Technology of China (UESTC)
- Chengdu, Sichuan, China
-
02:24
(UTC +08:00)
More
Lists (8)
Sort Name ascending (A-Z)
Stars
verl: Volcano Engine Reinforcement Learning for LLMs
✨✨Latest Advances on Multimodal Large Language Models
🔥🔥🔥Latest Papers, Codes and Datasets on Vid-LLMs.
K-PLUG: Knowledge-injected Pre-trained Language Model for Natural Language Understanding and Generation in E-Commerce (Findings of EMNLP 2021)
Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)
InternLM-XComposer2.5-OmniLive: A Comprehensive Multimodal System for Long-term Streaming Video and Audio Interactions
Official code for our paper, "LoRA-Pro: Are Low-Rank Adapters Properly Optimized? "
ccks2022 task9 subtask2 商品同款识别
[CVPR 2024] TimeChat: A Time-sensitive Multimodal Large Language Model for Long Video Understanding
[CVPR'2024 Highlight] Official PyTorch implementation of the paper "VTimeLLM: Empower LLM to Grasp Video Moments".
[AAAI 2025] VTG-LLM: Integrating Timestamp Knowledge into Video LLMs for Enhanced Video Temporal Grounding
Foundational Models for State-of-the-Art Speech and Text Translation
[ACL 2024] GroundingGPT: Language-Enhanced Multi-modal Grounding Model
A curated list of recent diffusion models for video generation, editing, and various other applications.
[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.
PyTorch implementation of RCG https://arxiv.org/abs/2312.03701
The official implementation of "Divergence of Features and Mean: A BatchNorm-based Abnormality Criterion for Weakly Supervised Video Anomaly Detection"
The official code for "MSFlow: Multi-Scale Normalizing Flows for Unsupervised Anomaly Detection"
The offical implement of ImbSAM (Imbalanced-SAM)
A natural language interface for computers
[AAAI 2024 Oral] AnomalyGPT: Detecting Industrial Anomalies Using Large Vision-Language Models
Image inpainting tool powered by SOTA AI Model. Remove any unwanted object, defect, people from your pictures or erase and replace(powered by stable diffusion) any thing on your pictures.
Custom Diffusion: Multi-Concept Customization of Text-to-Image Diffusion (CVPR 2023)