Lists (1)
Sort Name ascending (A-Z)
Stars
[NeurIPS 2023 Spotlight] Real-World Image Variation by Aligning Diffusion Inversion Chain
[ICML 2024] Official repository of the paper: "Diving into Underwater: Segment Anything Model Guided Underwater Salient Instance Segmentation and A Large-scale Dataset"
Machine Learning library for educational purpose.
Robotics Mathematical modeling and theory with Python (learning through extensive numerical simulations and animations.)
Deep Reinforcement Learning for mobile robot navigation in IR-SIM simulation. Using DRL (SAC, TD3, PPO, DDPG) neural networks, a robot learns to navigate to a random goal point in a simulated envir…
Open Source Text-To-Speech Portuguese Dataset
ComfyUI : 163 nodes : Display, manipulate, and edit text, images, videos, loras and more. Manage looping operations, generate randomized content, use logical conditions and work with external AI to…
Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"
Learn the basics of robotics through hands-on experience using ROS 2 and Gazebo simulation.
Python sample codes and textbook for robotics algorithms.
Python sample codes and documents about Autonomous vehicle control algorithm. This project can be used as a technical guide book to study the algorithms and the software architectures for beginners.
This repository offers a comprehensive collection of tutorials on state-of-the-art computer vision models and techniques. Explore everything from foundational architectures like ResNet to cutting-e…
💦 Seamless, distributed, real-time integration of Blender into PyTorch data pipelines
Godot 4 is an excellent choice for ROS2 robots
PyTorch implementation of CLIP Maximum Mean Discrepancy (CMMD) for evaluating image generation models.
An MIT License of YOLOv9, YOLOv7, YOLO-RD
Simulation and control software for robots
Official repository of "SAMURAI: Adapting Segment Anything Model for Zero-Shot Visual Tracking with Motion-Aware Memory"
Downloads videos and playlists from YouTube
DreamDA: Generative Data Augmentation with Diffusion Models (Official Implementation)
Non-Reference IQA implementation in Pytorch CUDA, UCIQE, UIQM, ...
[ACM MM 2024] Training-free Cross-domain Image Composition via Adaptive Latent Manipulation and Energy-guided Optimization
💡 All-in-one open-source AI framework for semantic search, LLM orchestration and language model workflows
A robust, efficient, low-latency speech-to-text library with advanced voice activity detection, wake word activation and instant transcription.