-
SJTU
- Shanghai
- https://bopang1996.github.io/
Stars
Background Music, a macOS audio utility: automatically pause your music, set individual apps' volumes and record system audio.
A generative world for general-purpose robotics & embodied AI learning.
Official PyTorch implementation of Learning to (Learn at Test Time): RNNs with Expressive Hidden States
A Clash GUI based on tauri. Supports Windows, macOS and Linux.
A temporary webpage for our survey in AGI for computer vision
LAVIS - A One-stop Library for Language-Vision Intelligence
Pre-training Reusable Representations for Robotic Manipulation Using Diverse Human Video Data
Official Algorithm Implementation of ICML'23 Paper "VIMA: General Robot Manipulation with Multimodal Prompts"
A Java implemented Texas holdem and short deck Solver
PyTorch code and models for the DINOv2 self-supervised learning method.
A classified list of meta learning papers based on realm.
Au AC90 toGPT is the vision of accessible AI for everyone, to use and to build on. Our mission is to provide the tools, so that you can focus on what matters.
Official code release of "CLIP goes 3D: Leveraging Prompt Tuning for Language Grounded 3D Recognition"
Replacement for Clubman+ AHK script, written from scratch in C#
Awesome Knowledge Distillation
Pytorch implementation of various Knowledge Distillation (KD) methods.
Official PyTorch implementation of "Image-to-Lidar Self-Supervised Distillation for Autonomous Driving Data"
[ECCV2022] PETR: Position Embedding Transformation for Multi-View 3D Object Detection & [ICCV2023] PETRv2: A Unified Framework for 3D Perception from Multi-Camera Images
This is an official implementation for "Video Swin Transformers".
XuyangBai / TransFusion
Forked from open-mmlab/mmdetection3d[PyTorch] Official implementation of CVPR2022 paper "TransFusion: Robust LiDAR-Camera Fusion for 3D Object Detection with Transformers". https://arxiv.org/abs/2203.11496
[AAAI 2021] Official Implementation of "SimIPU: Simple 2D Image and 3D Point Cloud Unsupervised Pre-Training for Spatial-Aware Visual Representations"
[ECCV 2022] ByteTrack: Multi-Object Tracking by Associating Every Detection Box
[CVPR22] Official codebase of Semantic Segmentation by Early Region Proxy.
Official codes: Self-Supervised Learning by Estimating Twin Class Distribution
The official pytorch implementation of our paper "Is Space-Time Attention All You Need for Video Understanding?"