8000 JialianW (Jialian Wu) / Starred · GitHub

More Web Proxy on the site http://driver.im/

JialianW

Follow

Jialian Wu JialianW

Follow

90 followers · 16 following

San Diego
jialianwu.com

Stars

AMD-AIG-AIMA / InstellaVL

Python 17 1 Updated Mar 7, 2025

AMD-AIG-AIMA / Instella

Fully Open Language Models with Stellar Performance

Python 228 20 Updated May 7, 2025

EvanZhuang / AgenticLU

Python 8 Updated Feb 25, 2025

SamuelSchmidgall / AgentLaboratory

Agent Laboratory is an end-to-end autonomous research workflow meant to assist you as the human researcher toward implementing your research ideas

Python 4,469 645 Updated Mar 27, 2025

linkedin / Liger-Kernel

Efficient Triton Kernels for LLM Training

Python 5,123 341 Updated Jun 1, 2025

cambrian-mllm / cambrian

Cambrian-1 is a family of multimodal LLMs with a vision-centric design.

Python 1,908 132 Updated Oct 30, 2024

google / gemma_pytorch

The official PyTorch implementation of Google's Gemma models

Python 5,465 539 Updated May 30, EFBD 2025

LargeWorldModel / LWM

Large World Model -- Modeling Text and Video with Millions Context

Python 7,277 557 Updated Oct 19, 2024

deepseek-ai / DeepSeek-MoE

DeepSeekMoE: Towards Ultimate Expert Specialization in Mixture-of-Experts Language Models

Python 1,708 279 Updated Jan 16, 2024

Vchitect / VBench

[CVPR2024 Highlight] VBench - We Evaluate Video Generation

Python 1,018 56 Updated May 29, 2025

OpenGVLab / InternVL

[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的开源多模态对话模型

Python 8,231 626 Updated May 29, 2025

xk-huang / segment-caption-anything

[CVPR 24] The repository provides code for running inference and training for "Segment and Caption Anything" (SCA) , links for downloading the trained model checkpoints, and example notebooks / gra…

Python 224 10 Updated Sep 30, 2024

apple / ml-ferret

Python 8,624 506 Updated Oct 9, 2024

xk-huang / Promptable-GRiT

Forked from JialianW/GRiT

Promptable GRiT: support inference with both automatic proposal generation and custom point/box prompts.

Python 4 Updated Nov 28, 2023

zhaoyue-zephyrus / AVION

[arXiv:2309.16669] Code release for "Training a Large Video Model on a Single Machine in a Day"

Python 129 10 Updated Jul 31, 2024

artidoro / qlora

QLoRA: Efficient Finetuning of Quantized LLMs

Jupyter Notebook 10,461 850 Updated Jun 10, 2024

danieljf24 / awesome-video-text-retrieval

A curated list of deep learning resources for video-text retrieval.

621 67 Updated Oct 20, 2023

microsoft / XPretrain

Multi-modality pre-training

Python 492 37 Updated May 8, 2024

TXH-mercury / VAST

[NIPS2023] Code and Model for VAST: A Vision-Audio-Subtitle-Text Omni-Modality Foundation Model and Dataset

Jupyter Notebook 278 17 Updated Mar 14, 2024

Chuhanxx / helping_hand_for_egocentric_videos

Implementation of paper 'Helping Hands: An Object-Aware Ego-Centric Video Recognition Model'

Python 33 2 Updated Nov 7, 2023

apple / ml-fastvit

This repository contains the official implementation of the research paper, "FastViT: A Fast Hybrid Vision Transformer using Structural Reparameterization" ICCV 2023

Python 1,920 113 Updated Nov 30, 2023

ArrowLuo / CLIP4Clip

An official implementation for "CLIP4Clip: An Empirical Study of CLIP for End to End Video Clip Retrieval"

Python 949 131 Updated Apr 12, 2024

m-bain / webvid

Large-scale text-video dataset. 10 million captioned short videos.

Python 636 40 Updated Aug 14, 2024

m-bain / frozen-in-time

Frozen in Time: A Joint Video and Image Encoder for End-to-End Retrieval [ICCV'21]

Python 360 44 Updated May 19, 2022

rese1f / MovieChat

[CVPR 2024] MovieChat: From Dense Token to Sparse Memory for Long Video Understanding

Python 621 41 Updated Jan 29, 2025

OpenGVLab / Ask-Anything

[CVPR2024 Highlight][VideoChatGPT] ChatGPT with video understanding! And many more supported LMs such as miniGPT4, StableLM, and MOSS.

Python 3,245 264 Updated Jan 18, 2025

BradyFU / Awesome-Multimodal-Large-Language-Models

✨✨Latest Advances on Multimodal Large Language Models

15,409 997 Updated May 30, 2025

SALT-NLP / LLaVAR

Code/Data for the paper: "LLaVAR: Enhanced Visual Instruction Tuning for Text-Rich Image Understanding"

Python 266 14 Updated Jun 12, 2024

facebookresearch / LaViLa

Code release for "Learning Video Representations from Large Language Models"

Python 520 46 Updated Oct 1, 2023

showlab / EgoVLP

[NeurIPS 2022] Egocentric Video-Language Pretraining

Python 241 20 Updated May 9, 2024

0