Lists (1)
Sort Name ascending (A-Z)
Stars
Code for "Unleashing Large-Scale Video Generative Pre-training for Visual Robot Manipulation"
Code for "Unleashing Large-Scale Video Generative Pre-training for Visual Robot Manipulation"
A batched offline inference oriented version of segment-anything
[CVPR 2023] Parameter is Not All You Need: Starting from Non-Parametric Networks for 3D Point Cloud Analysis
paper: https://arxiv.org/abs/2307.02469 page: https://lynx-llm.github.io/
The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.
We unified the interfaces of instruction-tuning data (e.g., CoT data), multiple LLMs and parameter-efficient methods (e.g., lora, p-tuning) together for easy use. We welcome open-source enthusiasts…
Taming Transformers for High-Resolution Image Synthesis
2023 Mobile Robot Grasping and Navigation Challenge
Code release for "Learning Video Representations from Large Language Models"
Official JAX implementation of MAGVIT: Masked Generative Video Transformer
[ICLR2024] Exploring Target Representations for Masked Autoencoders
hand-eye calibration, tool-flange calibration
ROS package for calibrating sensors to a known reference frame.
Official implementation of Adabins: Depth Estimation using adaptive bins
Efficient 3D Backbone Network for Temporal Modeling
Code release for "Masked-attention Mask Transformer for Universal Image Segmentation"
X-VLM: Multi-Grained Vision Language Pre-Training (ICML 2022)
Code release for "Detecting Twenty-thousand Classes using Image-level Supervision".
The toolbox for the Google Refexp dataset proposed in this paper: http://arxiv.org/abs/1511.02283
VisualGPT, CVPR 2022 Proceeding, GPT as a decoder for vision-language models