-
CSIRO
- Canberra, Australia
- https://researchers.anu.edu.au/researchers/li-xxxxx
- @Benzlee4
Highlights
- Pro
Starred repositories
Qwen2.5-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.
Demo of a customer service use case implemented with the OpenAI Agents SDK
Janus-Series: Unified Multimodal Understanding and Generation Models
Official Code Release for [SIGGRAPH 2025] RenderFormer: Transformer-based Neural Rendering of Triangle Meshes with Global Illumination
[CVPR'25] DroneSplat: 3D Gaussian Splatting for Robust 3D Reconstruction from In-the-Wild Drone Imagery
DeepSeekMath: Pushing the Limits of Mathematical Reasoning in Open Language Models
PC-NBV: A Point Cloud Based Deep Network for Efficient Next Best View Planning, IROS, 2020
My learning notes/codes for ML SYS.
[CVPR 2025] AerialMegaDepth: Learning Aerial-Ground Reconstruction and View Synthesis
[CVPR 2025] MASt3R-SLAM: Real-Time Dense SLAM with 3D Reconstruction Priors
[CVPR 2025 Highlight] Real-time dense scene reconstruction with SLAM3R
🌟A curated list of DUSt3R-related papers and resources, tracking recent advancements using this geometric foundation model.
The simplest, fastest repository for training/finetuning small-sized VLMs.
Qwen3 is the large language model series developed by Qwen team, Alibaba Cloud.
Codebase of our paper in CVPR 2025: "Neural Hierarchial Decomposition for Single Image Plant Modeling."
Minimalistic 4D-parallelism distributed training framework for education purpose
[CVPR 2025] Code for Deformable Radial Kernel Splatting
Lightweight coding agent that runs in your terminal
程序员在家做饭方法指南。Programmer's guide about how to cook at home (Simplified Chinese only).
This is the official implementation for WACV 2024 paper "Label Shift Estimation for Class-Imbalance Problem: A Bayesian Approach".
A collection of production-ready Generative AI Agent templates built for Google Cloud. It accelerates development by providing a holistic, production-ready solution, addressing common challenges (D…
Archon is an AI agent that is able to create other AI agents using an advanced agentic coding workflow and framework knowledge base to unlock a new frontier of automated agents.
[Lumina Embodied AI Community] 具身智能技术指南 Embodied-AI-Guide
[ICLR 2025] Where Am I and What Will I See : An Auto-Regressive Model for Spatial Localization and View Prediction
Open source impl of **MV-DUSt3R+ Single-Stage Scene Reconstruction from Sparse Views In 2 Seconds** from Meta Reality Labs. Project page https://mv-dust3rp.github.io/
A simple training-free approach adapting DUSt3R for dynamic scenes.