Stars
A simple framework for Android Bluetooth Low Energy (BLE)
本项目为xiaozhi-esp32提供后端服务,帮助您快速搭建ESP32设备控制服务器。Backend service for xiaozhi-esp32, helps you quickly build an ESP32 device control server.
Multilingual Voice Understanding Model
Instant voice cloning by MIT and MyShell. Audio foundation model.
机器学习、深度学习的学习路径及知识总结
Speech-to-text, text-to-speech, speaker diarization, speech enhancement, and VAD using next-gen Kaldi with onnxruntime without Internet connection. Support embedded systems, Android, iOS, HarmonyOS…
💯2025年 系统规划与管理师 (软考高级)备考资源库。PC版免费刷题软件:https://ruankaodaren.com
谷歌翻译服务器在中国大陆的IP地址扫描、测速工具。
MiniCPM-o 2.6: A GPT-4o Level MLLM for Vision, Speech and Multimodal Live Streaming on Your Phone
MiniCPM3-4B: An edge-side LLM that surpasses GPT-3.5-Turbo.
深度学习面试宝典(含数学、机器学习、深度学习、计算机视觉、自然语言处理和SLAM等方向)
WebSocket, gRPC and WebRTC speech recognition server based on Vosk and Kaldi libraries
Offline speech recognition for Android with Vosk library.
provide read and write debugging tools between USB serial port and serial port (UART ,RS232) under Android system
[🔥updating ...] AI 自动量化交易机器人(完全本地部署) AI-powered Quantitative Investment Research Platform. 📃 online docs: https://ufund-me.github.io/Qbot ✨ :news: qbot-mini: https://github.com/Charmve/iQuant
Code for fintune ChatGLM-6b using low-rank adaptation (LoRA)
WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)
收集关于K210的MaixPy开发和SDK IDE开发等的软硬件入门资料,帮助初学者快速了解、学习和使用K210
ChatGLM-6B HTTP流式解码API的Flask、FastAPI实现,以及开箱即用的Web页面。 a stream decoding demo of ChatGLM-6B using Flask or FastAPI, with web page out-of-the-box.
The official repo of Qwen-VL (通义千问-VL) chat & pretrained large vision language model proposed by Alibaba Cloud.