Lists (14)
Sort Name ascending (A-Z)
Starred repositories
A fast reverse proxy to help you expose a local server behind a NAT or firewall to the internet.
🌐 Make websites accessible for AI agents. Automate tasks online with ease.
Model Context Protocol Servers
精选机器学习,NLP,图像识别, 深度学习等人工智能领域学习资料,搜索,推荐,广告系统架构及算法技术资料整理。算法大牛笔记汇总
The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.
Master programming by recreating your favorite technologies from scratch.
Capture Screen, Audio, Cursor, Mouse Clicks and Keystrokes
A Lightweight Face Recognition and Facial Attribute Analysis (Age, Gender, Emotion and Race) Library for Python
[CVPR 2023] SadTalker:Learning Realistic 3D Motion Coefficients for Stylized Audio-Driven Single Image Talking Face Animation
📄 Awesome OCR multiple programing languages toolkits based on ONNXRuntime, OpenVINO, PaddlePaddle and PyTorch.
chineseocr lite android onnx ,超轻量级中文ocr android demo,支持竖排文字识别, 支持onnx推理(psenet+anglenet+crnn)
Awesome multilingual OCR toolkits based on PaddlePaddle (practical ultra lightweight OCR system, support 80+ languages recognition, provide data annotation and synthesis tools, support training and…
An open-source cross-platform alternative to AirDrop
Qwen3 is the large language model series developed by Qwen team, Alibaba Cloud.
Retrieval and Retrieval-augmented LLMs
小红书笔记 | 评论爬虫、抖音视频 | 评论爬虫、快手视频 | 评论爬虫、B 站视频 | 评论爬虫、微博帖子 | 评论爬虫、百度贴吧帖子 | 百度贴吧评论回复爬虫 | 知乎问答文章|评论爬虫
GLM-4 series: Open Multilingual Multimodal Chat LMs | 开源多语言多模态对话模型
Official implementation for "Automatic Chain of Thought Prompting in Large Language Models" (stay tuned & more will be updated)
视频号、小程序、抖音、快手、小红书、直播流、m3u8、酷狗、QQ音乐等常见网络资源下载!
BISHENG is an open LLM devops platform for next generation Enterprise AI applications. Powerful and comprehensive features include: GenAI workflow, RAG, Agent, Unified model management, Evaluation,…
Dify is an open-source LLM app development platform. Dify's intuitive interface combines AI workflow, RAG pipeline, agent capabilities, model management, observability features and more, letting yo…
50+ mini web projects using HTML, CSS & JS
Get up and running with Llama 3.3, DeepSeek-R1, Phi-4, Gemma 3, Mistral Small 3.1 and other large language models.
A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.
哔哩下载姬downkyi,哔哩哔哩网站视频下载工具,支持批量下载,支持8K、HDR、杜比视界,提供工具箱(音视频提取、去水印等)。