Stars
Real time transcription with OpenAI Whisper.
OCRmyPDF adds an OCR text layer to scanned PDF files, allowing them to be searched
π« Industrial-strength Natural Language Processing (NLP) in Python
A framework for detecting, highlighting and correcting grammatical errors on natural language text. Created by Prithiviraj Damodaran. Open to pull requests and other forms of collaboration.
Open-source and strong foundation image recognition models.
π¦π Build context-aware reasoning applications
Bringing Old Photo Back to Life (CVPR 2020 oral)
Rembg is a tool to remove images background
Background Remover lets you Remove Background from images and video using AI with a simple command line interface that is free and open source.
C0untFloyd / bark-gui
Forked from suno-ai/barkπ Text-Prompted Generative Audio Model with Gradio
DEPRECATED by https://github.com/mozilla-firefox/firefox. Read-only Git mirror of the Mercurial gecko repositories at https://hg.mozilla.org
OBS Studio - Free and open source software for live streaming and screen recording
Notepad++ official repository
darktable is an open source photography workflow application and raw developer
Read-only mirror of https://gitlab.gnome.org/GNOME/gimp
VLC media player - All pull requests are ignored, please use MRs on https://code.videolan.org/videolan/vlc
π Text-Prompted Generative Audio Model
Code for the ICML 2021 (long talk) paper: "ViLT: Vision-and-Language Transformer Without Convolution or Region Supervision"
Tesseract Open Source OCR Engine (main repository)
Robust Speech Recognition via Large-Scale Weak Supervision
Stable Diffusion web UI
A Gradio web UI for Large Language Models with support for multiple inference backends.
A multi-voice TTS system trained with an emphasis on quality
An easy way to extract information from documents