Stars
Let your Claude able to think
Fast inference from large lauguage models via speculative decoding
Awesome-llm-role-playing-with-persona: a curated list of resources for large language models for role-playing with assigned personas
Doing simple retrieval from LLM models at various context lengths to measure accuracy
收集的一些敏感词汇,挺全的,还细分了暴恐词库、反动词库、民生词库、色情 8000 库、贪腐词库、其他词库等
Awesome-LLM: a curated list of Large Language Model
Tencent Pre-training framework in PyTorch & Pre-trained Model Zoo
Large-scale Pre-training Corpus for Chinese 100G 中文预训练语料
Explain, analyze, and visualize NLP language models. Ecco creates interactive visualizations directly in Jupyter notebooks explaining the behavior of Transformer-based language models (like GPT2, B…
Open source annotation tool for machine learning practitioners.
(Linear-chain) Conditional random field in PyTorch.
Implementation of a linear-chain CRF in PyTorch
A PyTorch-based knowledge distillation toolkit for natural language processing
Chinese Pre-Trained Language Models (CPM-LM) Version-I
Implemention of Linear CRF to Chinese Word Segmentaion
pkuseg多领域中文分词工具; The pkuseg toolkit for multi-domain Chinese word segmentation
[ICML 2020] Continuously Indexed Domain Adaptation
刷算法全靠套路,认准 labuladong 就够了!English version supported! Crack LeetCode, not only how, but also why.
Transformers for Information Retrieval, Text Classification, NER, QA, Language Modelling, Language Generation, T5, Multi-Modal, and Conversational AI