Lists (2)
Sort Name ascending (A-Z)
Starred repositories
Master programming by recreating your favorite technologies from scratch.
Greek open source Morphological dictionary and application of it to Greek spelling tools
A collection of tools for better IDE experience.
A curated list of awesome places to learn and/or practice algorithms.
Materials for the Learn PyTorch for Deep Learning: Zero to Mastery course.
A Fast and Accurate Neural Thai Word Segmenter
Thai natural language processing in Python
Implement a ChatGPT-like LLM in PyTorch from scratch, step by step
Kyrgyz language processing software, models and datasets.
📝 A text file containing 479k English words for all your dictionary/word-based projects e.g: auto-completion / autosuggestion
ASCII transliterations of Unicode text - GitHub mirror
Unicode to ASCII transliteration - C Elixir Go Java JS Julia PHP Python Ruby Rust Shell .NET
A ~150K-token named entity corpus of Modern Eastern Armenian
Character-level conversion between Hebrew text and Latin transliteration using deep learning - a demonstration of seq2seq training.
Training scripts and instructions how to reproduce our systems submitted to the NEWS 2018 Task on Transliteration of Named Entities: R. Grundkiewicz, K. Heafield: Neural Machine Translation Techniq…
Get up and running with Llama 3.3, DeepSeek-R1, Phi-4, Gemma 3, Mistral Small 3.1 and other large language models.
Language-Agnostic SEntence Representations
A library for Multilingual Unsupervised or Supervised word Embeddings
a JavsScript package for transliterating Greek
Cleaned, parsed, and analyzed Hebrew textual corpus of documents pertaining to construction, planning, and architecture.
Persian raw text - حدود ۸۰ گیگابایت متن خام فارسی
A code for transliterating (romanizing) Arabic text using the American Library Association - Library of Congress (ALA-LC) standard