Lists (5)
Sort Name ascending (A-Z)
Stars
[CVPR 2022--Oral] Restormer: Efficient Transformer for High-Resolution Image Restoration. SOTA for motion deblurring, image deraining, denoising (Gaussian/real data), and defocus deblurring.
2019CCF-BDCI大赛 OCR赛题第一名 天晨破晓团队 去水印网络CGAN模型baseline
DocBank 文档图像增强数据集,此数据集用于文档图像增强,具体任务包括以下内容:Seal detection & Removal 印章检测 & 移除 ;Watermark detection & Removal 水印检测 & 移除;Document deblurring 文档去模糊;Document shadow removal 文档去阴影;Document super-resoluti…
Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.
📄 Configuration files that enhance Cursor AI editor experience with custom rules and behaviors
Model Context Protocol Servers
A curated list of top best AI Related Newsletters and ai agents newsletters
Use LLMs to dig out what you care about from massive amounts of information and a variety of sources daily.
[MM'2024] PEneo, an effective algorithm for key-value pair extraction from form-like documents, designed for real-world applications.
LangGPT: Empowering everyone to become a prompt expert!🚀 Structured Prompt,Language of GPT, 结构化提示词,结构化Prompt, Created by 「云中江树」
Repository for the paper "Large Language Model-Based Agents for Software Engineering: A Survey". Keep updating.
Collection of awesome LLM apps with AI Agents and RAG using OpenAI, Anthropic, Gemini and opensource models.
This repository contains demos I made with the Transformers library by HuggingFace.
IntelliJ SDK Platform Documentation
Awesome multilingual OCR and Document Parsing toolkits based on PaddlePaddle (practical ultra lightweight OCR system, support 80+ languages recognition, provide data annotation and synthesis tools,…
A JetBrains series plugin to help git code submission specifications, support IDEA, WebStorm, AndroidStudio, PyCharm, CLoin, GoLand, PhpStorm ... https://plugins.jetbrains.com/plugin/13477-git-comm…
Example code for the book Fluent Python, 1st Edition (O'Reilly, 2015)
👩🏿💻👨🏾💻👩🏼💻👨🏽💻👩🏻💻中国独立开发者项目列表 -- 分享大家都在做什么
A minimal GPU design in Verilog to learn how GPUs work from the ground up
AirLLM 70B inference with single 4GB GPU
2nd solution of ICDAR 2021 Competition on Scientific Literature Parsing, Task B.
A curated list of resources for Document Understanding (DU) topic
Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities
Official Implementation of OCR-free Document Understanding Transformer (Donut) and Synthetic Document Generator (SynthDoG), ECCV 2022
A collection of original, innovative ideas and algorithms towards Advanced Literate Machinery. This project is maintained by the OCR Team in the Language Technology Lab, Tongyi Lab, Alibaba Group.