Stars
Official repo of Griffon series including v1(ECCV 2024), v2, and G
DocLayout-YOLO: Enhancing Document Layout Analysis through Diverse Synthetic Data and Global-to-Local Adaptive Perception
A Unified Toolkit for Deep Learning Based Document Image Analysis
OCR, layout analysis, reading order, table recognition in 90+ languages
YOLOv11 trained on DocLayNet dataset.
🦙 LaMa Image Inpainting, Resolution-robust Large Mask Inpainting with Fourier Convolutions, WACV 2022
Official repository of 'Visual-RFT: Visual Reinforcement Fine-Tuning' & 'Visual-ARFT: Visual Agentic Reinforcement Fine-Tuning'’
A Notebook with Flexible Customization and Easy Integration.
OpenUI let's you describe UI using your imagination, then see it rendered live.
PPTC Benchmark: Evaluating Large Language Models for PowerPoint Task Completion
Turn any webpage into structured data using LLMs
ai副业赚钱大集合,教你如何利用ai做一些副业项目,赚取更多额外收益。The Ultimate Guide to Making Money with AI Side Hustles: Learn how to leverage AI for some cool side gigs and rake in some extra cash. Check out the English versi…
@material-tailwind is an easy-to-use components library for Tailwind CSS and Material Design.
daily.dev is a professional network for developers to learn, collaborate, and grow together 👩🏽💻 👨💻
Cleanlab's open-source library is the standard data-centric AI package for data quality and machine learning with messy, real-world data and labels.
StarVector is a foundation model for SVG generation that transforms vectorization into a code generation task. Using a vision-language modeling architecture, StarVector processes both visual and te…
The papers are organized according to our survey: Evaluating Large Language Models: A Comprehensive Survey.
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
Basic Discord app with examples
An LLM-powered autonomous agent platform
Memory for AI Agents; Announcing OpenMemory MCP - local and secure memory management.
This project converts the API of Anthropic's Claude model to the OpenAI Chat API format.
A list of Free Software network services and web applications which can be hosted on your own servers
💥 A flexible rendering engine for visualization.
🚀 JavaScript diagramming library that uses SVG and HTML for rendering.
🌎 Large-scale WebGL-powered Geospatial Data Visualization analysis engine.
🌍 localized message files generating automatic solution
Video.js - open source HTML5 video player