Stars
This repository contains a collection of resources and papers on Diffusion Models for Robotic Manipulation.
Image and video Tokenizer/VAE selection guide, text and face reconstruction evaluation.
SQL-o1: A Self-Reward Heuristic Dynamic Search Method for Text-to-SQL
By converting single-channel grayscale images into multi-channel images through various data enhancement techniques, SimOTM enhances the detection capabilities of object detection models without co…
用于Android平台,可将Surface图像数据快速提取出来的工具类,Surface数据源可以来自MediaCodec、Camera、VirtualDisplay等。 SurfaceBridge支持多种输出格式,比如RGB、YUV420、YUV444等等。当然,你也可以直接把SurfaceView、TextureView作为输出源。同时为了兼容各种不同的输出大小和比例,SurfaceBri…
[RA-L 2025 Accept without Revision] A stereo visual-inertial odometry system based on voxel map
Spatial navigation. Arrow key navigation. 空间导航。键盘十字键导航。
Matrix-Game: Interactive World Foundation Model
Train your Agent model via our easy and efficient framework
AudioTrust: Benchmarking the Multi-faceted Trustworthiness of Audio Large Language Models
Official implementation of RARE: Retrieval-Augmented Reasoning Modeling
HiGoalVita is a modular, layered, production ready AI RAG suite.
Memory-Guided Diffusion for Expressive Talking Video Generation
Official repo for paper "Large Language Models can be Guided to Evade AI-Generated Text Detection" in TMLR 2024.
simple web ui to manage mcp (model context protocol) servers in the claude app
[ICML 2025] Official repository for paper "Scaling Video-Language Models to 10K Frames via Hierarchical Differential Distillation"
AI-powered tool for efficient abstract and PDF screening in systematic reviews.
A Python library for Gene–environment interaction analysis via deep learning
Create production-ready, full-suite agents that offer: RAG (Retrieval-Augmented Generation) Function Calling Code Interpreter Streaming capabilities And all of this with a great UI!
Decouple and Track: Benchmarking and Improving Video Diffusion Transformers for Motion Transfer
Fast, stateless gateway with HMAC-based token auth, request-level tracing, and vector-ready logs.
AI Manus is a general-purpose AI Agent system that supports running various tools and operations in a sandbox environment.