-
Nanjing University
- https://lizhihao6.github.io/
Highlights
- Pro
Lists (11)
Sort Name ascending (A-Z)
Stars
[MM24] Official codes and datasets for ACM MM24 paper "Hi3D: Pursuing High-Resolution Image-to-3D Generation with Video Diffusion Models".
Official implementation of Occupancy-Based Dual Contouring (SIGGRAPH Asia 2024).
TetSphere Splatting: Representing High-Quality Geometry with Lagrangian Volumetric Meshes
WebUI extension for ControlNet
Training-Free Text-Guided Image Editing Using Visual Autoregressive Model
[CVPR 2025] Sparse Voxels Rasterization: Real-time High-fidelity Radiance Field Rendering
[CVPR 2025] CoDe: Collaborative Decoding Makes Visual Auto-Regressive Modeling Efficient
[ICCV 2025] VisualCloze: A universal image generation framework that can support a wide range of in-domain tasks and generalize to unseen ones. (🔥 🔥 🔥 Merged into offical pipelines of diffusers.)
[CVPR 2025] Code for Deformable Radial Kernel Splatting
SpatialLM: Training Large Language Models for Structured Indoor Modeling
[SIGGRAPH Asia 2024 (Journal Track)] StableNormal: Reducing Diffusion Variance for Stable and Sharp Normal
Wan: Open and Advanced Large-Scale Video Generative Models
Builder and index for PyTorch packages
High-Resolution 3D Assets Generation with Large Scale Hunyuan3D Diffusion Models.
🚀🤖 Crawl4AI: Open-source LLM Friendly Web Crawler & Scraper. Don't be shy, join here: https://discord.gg/jP8KfhDhyN
Alternative to get fivek expert images without Lightroom.
[ICML 2024] 3D-VLA: A 3D Vision-Language-Action Generative World Model
PaSa -- an advanced paper search agent powered by large language models. It can autonomously make a series of decisions, including invoking search tools, reading papers, and selecting relevant refe…
[SGP 2025] OctFusion: Octree-based Diffusion Models for 3D Shape Generation
Don't Starve Together server panel. Manage room with ease, featuring visual world and mod management, player log collection。饥荒联机服务器面板。轻松管理房间,支持可视化的世界和模组管理,玩家日志采集
VideoVAE+: Large Motion Video Autoencoding with Cross-modal Video VAE
[CVPR 2025 Oral]Infinity ∞ : Scaling Bitwise AutoRegressive Modeling for High-Resolution Image Synthesis
[Survey] Next Token Prediction Towards Multimodal Intelligence: A Comprehensive Survey