Starred repositories
An open-source, code-first Python toolkit for building, evaluating, and deploying sophisticated AI agents with flexibility and control.
The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.
A robust, efficient, low-latency speech-to-text library with advanced voice activity detection, wake word activation and instant transcription.
🔥🔥 btrace(AKA RheaTrace) is a high performance Android trace tool which is based on Perfetto, it support to define custom events automatically during building apk and using bhook to provider more n…
We write your reusable computer vision tools. 💜
Multiple samples showing the best practices in media APIs on Android (audio, video, etc.).
Mobile-Agent: The Powerful Mobile Device Operation Assistant Family
GPT-4V in Wonderland: LMMs as Smartphone Agents
MobileLLM / DroidBot-GPT
Forked from honeynet/droidbotAutomating Android apps with ChatGPT-like LLM.
Official code implementation of Vary-toy (Small Language Model Meets with Reinforced Vision Vocabulary)
A modern, fast and productivity driven SQL client with a focus in UX
Database manager for MySQL, PostgreSQL, SQL Server, MongoDB, SQLite and others. Runs under Windows, Linux, Mac or as web application
Official home of the DB Browser for SQLite (DB4S) project. Previously known as "SQLite Database Browser" and "Database Browser for SQLite". Website at:
A free, open source, multi-platform SQLite database manager.
[ECCV 2024] Official code implementation of Vary: Scaling Up the Vision Vocabulary of Large Vision Language Models.
[CVPR 2024 Highlight🔥] Chat-UniVi: Unified Visual Representation Empowers Large Language Models with Image and Video Understanding
Generative Models by Stability AI
all kinds of text classification models and more with deep learning
Bolt is a deep learning library with high performance and heterogeneous flexibility.
The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.
[NeurIPS 2023] Official implementation of the paper "Segment Everything Everywhere All at Once"
High-Resolution Image Synthesis with Latent Diffusion Models
🎒 飞书 ×(GPT-4 + GPT-4V + DALL·E-3 + Whisper)= 飞一般的工作体验 🚀 语音对话、角色扮演、多话题讨论、图片创作、表格分析、文档导出 🚀
Play ChatGPT and other LLM with Xiaomi AI Speaker
BibiGPT v1 · one-Click AI Summary for Audio/Video & Chat with Learning Content: Bilibili | YouTube | Tweet丨TikTok丨Dropbox丨Google Drive丨Local files | Websites丨Podcasts | Meetings | Lectures, etc. 音视…