Starred repositories
[CVPR 2024] MAPLM: A Large-Scale Vision-Language Dataset for Map and Traffic Scene Understanding
Chinese version of CLIP which achieves Chinese cross-modal retrieval and representation generation.
AI's query engine - Platform for building AI that 8000 can answer questions over large scale federated data. - The only MCP Server you'll ever need
Official implementation of "Open-Vocabulary Multi-Label Classification via Multi-Modal Knowledge Transfer".
🌐 The Internet OS! Free, Open-Source, and Self-Hostable.
硬核书<<程序原理>>,有难度,有深度,值得拥有! 使用汇编、C、Java全面系统讲解,理论结合实践,图表并茂。
Repo to accompany my mastering LLM engineering course
TypeScript-first schema validation with static type inference
This repository contains the official implementation of "FastVLM: Efficient Vision Encoding for Vision Language Models" - CVPR 2025
The official repository of Mozilla's Firefox web browser.
This repository collects research papers of large Vision Language Models in Autonomous driving and Intelligent Transportation System. The repository will be continuously updated to track the lates…
A GUI Agent application based on UI-TARS(Vision-Language Model) that allows you to control your computer using natural language.
A functionally complete decompilation of LEGO Island (1997)
This repository provides valuable reference for researchers in the field of multimodality, please start your exploratory travel in RL-based Reasoning MLLMs!
Minimal example utilizing fastapi and celery with RabbitMQ for task queue, Redis for celery backend and flower for monitoring the celery tasks.
Fully Python async FastAPI project! 🚀
Ready-to-use and customizable users management for FastAPI
Have a natural, spoken conversation with AI!
An ebook reader application supporting PDF, DjVu, EPUB, FB2 and many more formats, running on Cervantes, Kindle, Kobo, PocketBook and Android devices
A self-hosted dashboard that puts all your feeds in one place
An open source payments switch written in Rust to make payments fast, reliable and affordable