-
Universitat de Barcelona
- Gran Via de Les Corts Catalanes, 585, 08007 Barcelona
- @nospotfer
Starred repositories
CLI tool for quickly finding and using terminal commands.
[CAAI AIR'24] Bilateral Reference for High-Resolution Dichotomous Image Segmentation
Python tool for converting files and office documents to Markdown.
License Plate Detection and Text Extraction with YoloV8 and EasyOCR
Get your documents ready for gen AI
2-2000x faster ML algos, 50% less memory usage, works on all hardware - new and old.
Convert Compute And Books Into Instruct-Tuning Datasets! Makes: QA, RP, Classifiers.
This repository contains a paper collection of the methods for document image processing, including appearance enhancement, deshadowing, dewarping, deblurring, binarization and so on.
RAG (Retrieval-Augmented Generation) Chatbot Examples Using PyMuPDF
OnnxTR a docTR (Document Text Recognition) library Onnx pipeline wrapper - for seamless, high-performing & accessible OCR
An innovative library for efficient LLM inference via low-bit quantization
Finetune Qwen3, Llama 4, TTS, DeepSeek-R1 & Gemma 3 LLMs 2x faster with 70% less memory! 🦥
RAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine based on deep document understanding.
ExtractThinker is a Document Intelligence library for LLMs, offering ORM-style interaction for flexible and powerful document workflows.
A collection of digital forensics tools for verification, investigations, diagnostics, software, libraries, learning tutorials, frameworks, academic and practical resources in Cybersecurity
The official code for “Deep Unrestricted Document Image Rectification”, TMM, 2023.
Examples and guides for using the OpenAI API
Interact with your documents using the power of GPT, 100% privately, no data leaks
Model for document segmentation trained on the midv-500-models dataset.
Results of experiments for Advanced Hough-based method for on-device document localization
Easy to download and parse version of the Smartdoc 2015 - Challenge 1 dataset.
Refine high-quality datasets and visual AI models
A vertical mill anomaly detection using Isolation Forests
python library for invisible image watermark (blind image watermark)
A tool for refurbishing and modernizing Python codebases