Stars
Open NLU and NLG datasets created within the Latvian Language Technology Initiative
Data for the HIPE 2022 shared task.
LU Datorikas fakultātes BSP studiju moduļa "Valodu tehnoloģijas" kursa "Valodu tehnoloģiju pamati" (DatZB022) praktisko darbu materiāli
Software for humanities scholars using quantitative or computational methods.
This repo includes all the projects I have finished in the Udacity Nanodegree programs
All projects and lecture notes of the Udacity Machine Learning Engineer Nanodegree.
This repo is the homebase of a community driven course on Computer Vision with Neural Networks. Feel free to join us on the Hugging Face discord: hf.co/join/discord
Flask-based web framework for visualisation and explorative listening of audio.
Linked Places format is used to describe attestations of places in a standard way, primarily for linking gazetteer datasets.
Curated corpus of parallel data derived from versions of the Bible provided by eBible.org.
OpenOrienteering Mapper is a software for creating maps for the orienteering sport.
Interactive Visualization Interface for Multidimensional Datasets
Jupyter notebooks (with answers) used during the Deep Learning MOOC
A word-list based post-OCR correction, originally designed for historical medical text
OCR based on pytesseract
This repo work as a sandbox enviroment for htrflow.
Gazeteers for Swedish first names, surnames, organisations and different locations
All the material (paper, code, dataset, results) of our DAS 2022 paper (OCR+NER benchmark)
A repository for mathematics, machine learning and deep learning formula sheets created by Fady Morris
Free MLOps course from DataTalks.Club
Editor for aligned parallel texts (personal desktop application).
This dataset contains naturally-occurring English sentences that feature non-trivial noun-verb ambiguity.
Code and associated files for the AI Programming with Python Nanodegree Program