AI
Stable Diffusion web UI
整理开源的中文大语言模型,以规模较小、可私有化部署、训练成本较低的模型为主,包括底座模型,垂直领域微调及应用,数据集与教程等。
Implementation of CVPR'23: Learning 3D Scene Priors with 2D Supervision
A procedural Blender pipeline for photorealistic training image generation
open Multiple View Geometry library. Basis for 3D computer vision and Structure from Motion.
收集分享 AI 大型语言模型 (LLM)、AI 辅助编程、AI 绘画等领域的常用资料,探索生成式人工智能的应用与开发。
Support BlenderProc2 with multi-GPU batch rendering and 3D visualization for 3D-Front
[CVPR 2024] Depth Anything: Unleashing the Power of Large-Scale Unlabeled Data. Foundation Model for Monocular Depth Estimation
A shift-window based transformer for 3D sparse tasks
Official implementation of the NeurIPS 2021 paper "Panoptic 3D Scene Reconstruction from a Single RGB Image"
Detectron2 is a platform for object detection, segmentation and other visual recognition tasks.
COLMAP - Structure-from-Motion and Multi-View Stereo
[NeurIPS 2023] The repo of CommonScenes, a scene generation method powered by the diffusion model.
CVPR2024 | LASA: Instance Reconstruction from Real Scans using A Large-scale Aligned Shape Annotation Dataset
[NeurIPS 24] The implementation and dataset of LiveScene: Language Embedding Interactive Radiance Fields for Physical Scene Rendering and Control
This repo accompanies the research paper, ARKitScenes - A Diverse Real-World Dataset for 3D Indoor Scene Understanding Using Mobile RGB-D Data and contains the data, scripts to visualize and proces…
[CVPR2019 Oral] Normalized Object Coordinate Space for Category-Level 6D Object Pose and Size Estimation on Python3, Tensorflow, and Keras
Transformer Explained Visually: Learn How LLM Transformer Models Work with Interactive Visualization
Diffusion Explainer: Visual Explanation for Text-to-image Stable Diffusion
🤗 LeRobot: Making AI for Robotics more accessible with end-to-end learning
[ CVPR 2025 ] We introduce LT3SD, a novel latent 3D scene diffusion approach enabling high-fidelity generation of infinite 3D environments in a patch-by-patch and coarse-to-fine fashion.
Code for the paper "AMEGO: Active Memory from long EGOcentric videos" published at ECCV 2024
AI wearables. Put it on, speak, transcribe, automatically
🌮 Trash Annotations in Context Dataset Toolkit