Stars
The simplest, fastest repository for training/finetuning medium-sized GPTs.
ai副业赚钱大集合,教你如何利用ai做一些副业项目,赚取更多额外收益。The Ultimate Guide to Making Money with AI Side Hustles: Learn how to leverage AI for some cool side gigs and rake in some extra cash. Check out the English versi…
keras implement of transformers for humans
Tesseract Open Source OCR Engine (main repository)
Scikit-Learn, NLTK, Spacy, Gensim, Textblob and more
pycorrector is a toolkit for text error correction. 文本纠错,实现了Kenlm,T5,MacBERT,ChatGLM3,Qwen2.5等模型应用在纠错场景,开箱即用。
The official repository for ERNIE 4.5 and ERNIEKit – its industrial-grade development toolkit based on PaddlePaddle.
A flexible, high-performance serving system for machine learning models
Kashgari is a production-level NLP Transfer learning framework built on top of tf.keras for text-labeling and text-classification, includes Word2Vec, BERT, and GPT2 Language Embedding.
Pre-Training with Whole Word Masking for Chinese BERT(中文BERT-wwm系列模型)
本实验,是用BERT进行中文情感分类,记录了详细操作及完整程序
YSDA course in Natural Language Processing
A system for quickly generating training data with weak supervision
Repo for counting stars and contributing. Press F to pay respect to glorious developers.
大规模中文自然语言处理语料 Large Scale Chinese Corpus for NLP
TensorFlow code and pre-trained models for BERT
Chinese Named Entity Recognition with IDCNN/biLSTM+CRF, and Relation Extraction with biGRU+2ATT 中文实体识别与关系提取
Bug-tracking for Jeff's algorithms book, notes, etc.
深度学习500问,以问答形式对常用的概率知识、线性代数、机器学习、深度学习、计算机视觉等热点问题进行阐述,以帮助自己及有需要的读者。 全书分为18个章节,50余万字。由于水平有限,书中不妥之处恳请广大读者批评指正。 未完待续............ 如有意合作,联系scutjy2015@163.com 版权所有,违权必究 Tan 2018.06
Matlab code of machine learning algorithms in book PRML
nwth / chinese-xinhua
Forked from pwxcoo/chinese-xinhua中华新华字典数据库。包括歇后语,成语,词语,汉字。提供新华字典API。