8000 arctanbell / Starred · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
View arctanbell's full-sized avatar

Block or report arctanbell

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

🔥🔥🔥Latest Papers, Codes and Datasets on Vid-LLMs.

2,443 112 Updated Jun 20, 2025
Python 6,530 444 Updated May 21, 2025

PySlowFast: video understanding codebase from FAIR for reproducing state-of-the-art video models.

Python 6,994 1,263 Updated Nov 26, 2024

DilatedToothSegNet: Tooth Segmentation Network on 3D Dental Meshes Through Increasing Receptive Vision

Python 67 13 Updated Jun 10, 2024
Python 4,368 354 Updated Jun 12, 2025

[NeurIPS 2024] An official implementation of ShareGPT4Video: Improving Video Understanding and Generation with Better Captions

Python 1,064 42 Updated Oct 9, 2024

WebRTC/RTSP/RTMP/HTTP/HLS/HTTP-FLV/WebSocket-FLV/HTTP-TS/HTTP-fMP4/WebSocket-TS/WebSocket-fMP4/GB28181/SRT server and client framework based on C++11

C++ 15,304 3,699 Updated Jun 24, 2025

WEB VIDEO PLATFORM是一个基于GB28181-2016标准实现的网络视频平台,支持NAT穿透,支持海康、大华、宇视等品牌的IPC、NVR、DVR接入。支持国标级联,支持rtsp/rtmp等视频流转发到国标平台,支持rtsp/rtmp等推流转发到国标平台。

Java 5,985 1,707 Updated Jun 26, 2025

有趣的80后程序员的工作流分享

843 186 Updated Jun 25, 2025

[CVPR 2024] Depth Anything: Unleashing the Power of Large-Scale Unlabeled Data. Foundation Model for Monocular Depth Estimation

Python 7,608 583 Updated Jul 17, 2024

Open-source and strong foundation image recognition models.

Jupyter Notebook 3,296 307 Updated Feb 18, 2025

End-to-End Dense Video Captioning with Parallel Decoding (ICCV 2021)

Python 219 23 Updated Jan 3, 2024

Multi-model video-to-text by combining embeddings from Flan-T5 + CLIP + Whisper + SceneGraph. The 'backbone LLM' is pre-trained from scratch on YouTube (YT-1B dataset).

Jupyter Notebook 52 8 Updated Apr 21, 2023

Scenic: A Jax Library for Computer Vision Research and Beyond

Python 3,577 455 Updated Jun 23, 2025
Python 4 Updated Feb 17, 2023

LLM UI with advanced features, easy setup, and multiple backend support.

Python 44,095 5,682 Updated Jun 25, 2025

LAVIS - A One-stop Library for Language-Vision Intelligence

Jupyter Notebook 10,681 1,044 Updated Nov 18, 2024

Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities

Python 21,443 2,639 Updated Jun 3, 2025

Large Language-and-Vision Assistant built towards multimodal GPT-4 level capabilities.

Python 1 Updated Apr 22, 2023

A framework for training and evaluating AI models on a variety of openly available dialogue datasets.

Python 10,588 2,095 Updated Nov 3, 2023

The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights -- ResNet, ResNeXT, EfficientNet, NFNet, Vision Transformer (V…

Python 34,563 4,939 Updated Jun 26, 2025

AutoGPT is the vision of accessible AI for everyone, to use and to build on. Our mission is to provide the tools, so that you can focus on what matters.

Python 176,418 45,834 Updated Jun 26, 2025

中文LLaMA&Alpaca大语言模型+本地CPU/GPU训练部署 (Chinese LLaMA & Alpaca LLMs)

Python 18,863 1,890 Updated Apr 30, 2024

Official code for BEVDepth.

Python 787 110 Updated Jan 18, 2023

[IEEE T-PAMI 2023] Awesome BEV perception research and cookbook for all level audience in autonomous diriving

Python 1,282 112 Updated Jan 6, 2024

[ECCV 2022] This is the official implementation of BEVFormer, a camera-only framework for autonomous driving perception, e.g., 3D object detection and semantic map segmentation.

Python 3,824 613 Updated Aug 15, 2024

Object detection and instance segmentation toolkit based on PaddlePaddle.

Python 3 3 Updated Sep 18, 2024

The repository containing tools and information about the WoodScape dataset.

Python 651 130 Updated Aug 26, 2023

Convolutional Neural Networks

C 26,230 21,305 Updated May 3, 2024
Next
0