8000 JialianW (Jialian Wu) / Starred · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
View JialianW's full-sized avatar

Block or report JialianW

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results
Python 17 1 Updated Mar 7, 2025

Fully Open Language Models with Stellar Performance

Python 228 20 Updated May 7, 2025
Python 8 Updated Feb 25, 2025

Agent Laboratory is an end-to-end autonomous research workflow meant to assist you as the human researcher toward implementing your research ideas

Python 4,469 645 Updated Mar 27, 2025

Efficient Triton Kernels for LLM Training

Python 5,123 341 Updated Jun 1, 2025

Cambrian-1 is a family of multimodal LLMs with a vision-centric design.

Python 1,908 132 Updated Oct 30, 2024

The official PyTorch implementation of Google's Gemma models

Python 5,465 539 Updated May 30, EFBD 2025

Large World Model -- Modeling Text and Video with Millions Context

Python 7,277 557 Updated Oct 19, 2024

DeepSeekMoE: Towards Ultimate Expert Specialization in Mixture-of-Experts Language Models

Python 1,708 279 Updated Jan 16, 2024

[CVPR2024 Highlight] VBench - We Evaluate Video Generation

Python 1,018 56 Updated May 29, 2025

[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的开源多模态对话模型

Python 8,231 626 Updated May 29, 2025

[CVPR 24] The repository provides code for running inference and training for "Segment and Caption Anything" (SCA) , links for downloading the trained model checkpoints, and example notebooks / gra…

Python 224 10 Updated Sep 30, 2024
Python 8,624 506 Updated Oct 9, 2024

Promptable GRiT: support inference with both automatic proposal generation and custom point/box prompts.

Python 4 Updated Nov 28, 2023

[arXiv:2309.16669] Code release for "Training a Large Video Model on a Single Machine in a Day"

Python 129 10 Updated Jul 31, 2024

QLoRA: Efficient Finetuning of Quantized LLMs

Jupyter Notebook 10,461 850 Updated Jun 10, 2024

A curated list of deep learning resources for video-text retrieval.

621 67 Updated Oct 20, 2023

Multi-modality pre-training

Python 492 37 Updated May 8, 2024

[NIPS2023] Code and Model for VAST: A Vision-Audio-Subtitle-Text Omni-Modality Foundation Model and Dataset

Jupyter Notebook 278 17 Updated Mar 14, 2024

Implementation of paper 'Helping Hands: An Object-Aware Ego-Centric Video Recognition Model'

Python 33 2 Updated Nov 7, 2023

This repository contains the official implementation of the research paper, "FastViT: A Fast Hybrid Vision Transformer using Structural Reparameterization" ICCV 2023

Python 1,920 113 Updated Nov 30, 2023

An official implementation for "CLIP4Clip: An Empirical Study of CLIP for End to End Video Clip Retrieval"

Python 949 131 Updated Apr 12, 2024

Large-scale text-video dataset. 10 million captioned short videos.

Python 636 40 Updated Aug 14, 2024

Frozen in Time: A Joint Video and Image Encoder for End-to-End Retrieval [ICCV'21]

Python 360 44 Updated May 19, 2022

[CVPR 2024] MovieChat: From Dense Token to Sparse Memory for Long Video Understanding

Python 621 41 Updated Jan 29, 2025

[CVPR2024 Highlight][VideoChatGPT] ChatGPT with video understanding! And many more supported LMs such as miniGPT4, StableLM, and MOSS.

Python 3,245 264 Updated Jan 18, 2025

✨✨Latest Advances on Multimodal Large Language Models

15,409 997 Updated May 30, 2025

Code/Data for the paper: "LLaVAR: Enhanced Visual Instruction Tuning for Text-Rich Image Understanding"

Python 266 14 Updated Jun 12, 2024

Code release for "Learning Video Representations from Large Language Models"

Python 520 46 Updated Oct 1, 2023

[NeurIPS 2022] Egocentric Video-Language Pretraining

Python 241 20 Updated May 9, 2024
Next
0