Stars
Master the command line, in one page
This is a sample code for AutoSimulTrans Workshop (https://autosimtrans.github.io)
Template Makefile for ML projects in Python.
The Dakshina dataset is a collection of text in both Latin and native scripts for 12 South Asian languages. For each language, the dataset includes a large collection of native script Wikipedia tex…
PyTorch Implementation and Explanation of Graph Representation Learning papers: DeepWalk, GCN, GraphSAGE, ChebNet & GAT.
Python & Command-line tool to gather text and metadata on the Web: Crawling, scraping, extraction, output as CSV, JSON, HTML, MD, TXT, XML
arXiv LaTeX Cleaner: Easily clean the LaTeX code of your paper to submit to arXiv
Google AI 2018 BERT pytorch implementation
Transformer seq2seq model, program that can build a language translator from parallel corpus
Extremely simple and fast word2vec implementation with Negative Sampling + Sub-sampling
💥 Fast State-of-the-Art Tokenizers optimized for Research and Production
A comprehensive list of pytorch related content on github,such as different models,implementations,helper libraries,tutorials etc.
Minimum description length principle algorithm in Python, for optimal binning of continuous variables
REST api to view and send daily quote as SMS from goodreads.com
Master programming by recreating your favorite technologies from scratch.
generative adversarial nets for neural machine translation
Generative Adversarial Networks in Neural Machine Translation
scripts and configuration files for Edinburgh neural MT submission to WMT 16 shared translation task
Differentiable Optimization-Based Modeling for Machine Learning
Python library for converting UTF to WX and vice-versa for Indian languages.
Train bilingual embeddings as described in our NAACL 2015 workshop paper "Bilingual Word Representations with Monolingual Quality in Mind". Besides, it has all the functionalities of word2vec with …
InferSent sentence embeddings
A machine translation reading list maintained by Tsinghua Natural Language Processing Group
Toolkit for Machine Learning, Natural Language Processing, and Text Generation, in TensorFlow. This is part of the CASL project: http://casl-project.ai/