Starred repositories
A Conversational Speech Generation Model
Retargetting motion from one robot to another.
Qwen2.5-Omni is an end-to-end multimodal model by Qwen team at Alibaba Cloud, capable of understanding text, audio, vision, video, and performing real-time speech generation.
🔥Highlighting the top ML papers every week.
Start building LLM-empowered multi-agent applications in an easier way.
ChatGLM-6B HTTP流式解码API的Flask、FastAPI实现,以及开箱即用的Web页面。 a stream decoding demo of ChatGLM-6B using Flask or FastAPI, with web page out-of-the-box.
something like visual-chatgpt, 文心一言的开源版
Interactive code for image similarity using SIFT algorithm
[Skype Silk Codec SDK]Decode silk v3 audio files (like wechat amr, aud files, qq slk files) and convert to other format (like mp3). Batch conversion support.
Kuboard 是基于 Kubernetes 的微服务管理界面。同时提供 Kubernetes 免费中文教程,入门教程,最新版本的 Kubernetes v1.23.4 安装手册,(k8s install) 在线答疑,持续更新。
Tesseract Open Source OCR Engine (main repository)
Alibaba Java Diagnostic Tool Arthas/Alibaba Java诊断利器Arthas
MovieLens based recommender system.使用MovieLens数据集训练的电影推荐系统。
RSI (Relative Strength Index) written in Python
Code samples from the "Python Cookbook, 3rd Edition", published by O'Reilly & Associates, May, 2013.
An Open Source Machine Learning Framework for Everyone