Stars
A curated list of awesome synthetic data for text location and recognition
A general list of resources to image text localization and recognition 场景文本位置感知与识别的论文资源与实现合集 シーンテキストの位置認識と識別のための論文リソースの要約
100+ Chinese Word Vectors 上百种预训练中文词向量
Implementation of Vision Transformer, a simple way to achieve SOTA in vision classification with only a single transformer encoder, in Pytorch
Hierarchical Multi-Features Combination Model for Uyghur-Chinese Machine Translation
A curated list of pretrained sentence and word embedding models
Stanford CoNLL 2018 Graph-based Dependency Parser
Uyghur Single Speaker Speech Dataset. ウイグル語音声データセット
中英文敏感词、语言检测、中外手机/电话归属地/运营商查询、名字推断性别、手机号抽取、身份证抽取、邮箱抽取、中日文人名库、中文缩写库、拆字词典、词汇情感值、停用词、反动词表、暴恐词表、繁简体转换、英文模拟中文发音、汪峰歌词生成器、职业名称词库、同义词库、反义词库、否定词库、汽车品牌词库、汽车零件词库、连续英文切割、各种中文词向量、公司名字大全、古诗词库、IT词库、财经词库、成语词库、地名词库、…
collection of fonts for Uyghur arabic script
对知识库Wikidata的爬虫以及数据处理脚本 将三元组关系对齐到语料库的脚本 获取知识图谱数据的脚本
A way to do annotations for NER. TALEN: Tool for Annotation of Low-resource ENtities
TensorFlow code and pre-trained models for BERT
Neural Cross-Lingual Named Entity Recognition with Minimal Resources
A collections of public and free annotated datasets of relationships between entities/nominals (Portuguese and English)
TensorFlow tutorials and best practices.
Software in C and data files for the popular GloVe model for distributed word representations, a.k.a. word vectors or embeddings
TensorFlow implementation of Relation Classification via Convolutional Deep Neural Network
Joint Extraction of Entities and Relations Based on a Novel Tagging Scheme
Entity Summarization using Ontology-based Topic Models
Deployment (and development) environment for the DURAARK system.
A graphical user interface for t 31E8 he DURAARK platform implemented with EmberJS.