8000 Mionies (Joanne) / Starred · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
View Mionies's full-sized avatar

Block or report Mionies

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Rust library for indexing and quickly searching large pretraining corpora

Rust 26 4 Updated May 12, 2025
Jupyter Notebook 5 2 Updated Mar 20, 2024

Repository containing code for the paper on identification of source domains by contrastive learning

Python 2 Updated Mar 4, 2024

The official repo for the GlobalBias dataset and associated paper: 'Who is better at math, Jenny or Jingzhen? Exploring Intersectional Biases in Large Language Models'

Jupyter Notebook 3 Updated Dec 30, 2024

Metaphor Dataset

7 3 Updated Jun 6, 2024

Calculate perplexity on a text with pre-trained language models. Support MLM (eg. DeBERTa), recurrent LM (eg. GPT3), and encoder-decoder LM (eg. Flan-T5).

Python 156 13 Updated Oct 1, 2024

ChatArena (or Chat Arena) is a Multi-Agent Language Game Environments for LLMs. The goal is to develop communication and collaboration capabilities of AIs.

Python 1,461 140 Updated May 27, 2024

A Corpus of Potentially Idiomatic Expressions

Python 3 Updated Apr 10, 2019

Evaluating Text Representations on Lexical Composition

Jupyter Notebook 24 5 Updated Oct 30, 2019
Jupyter Notebook 23 1 Updated Jun 23, 2022

A browser extension that alerts you when you navigate to a website belonging to an organization whose employees are on strike.

JavaScript 154 10 Updated Mar 24, 2025

Automatic Idiomatic Expression Detection

Jupyter Notebook 13 1 Updated Sep 26, 2021

CKIP Transformers

Python 729 76 Updated Apr 21, 2023

Language model fine-tuning on NER with an easy interface and cross-domain evaluation. "T-NER: An All-Round Python Library for Transformer-based Named Entity Recognition, EACL 2021"

Python 389 41 Updated May 11, 2023

A BERT-based Chinese Text Encoder Enhanced by N-gram Representations

Python 647 105 Updated Jul 24, 2022

搜索所有中文NLP数据集,附常用英文NLP数据集

Python 4,311 624 Updated Nov 21, 2022

A curated list of pretrained sentence and word embedding models

Python 2,254 262 Updated Apr 23, 2021

Cross-lingual metaphor detection.

Python 66 15 Updated Apr 25, 2019

Neural models for documents with metadata

Python 104 18 Updated Apr 14, 2020

Zebra Crossing: an easy-to-use digital safety checklist

445 33 Updated Jan 28, 2025

Super easy library for BERT based NLP models

Python 1,900 342 Updated Aug 19, 2024

Chinese NER using Lattice LSTM. Code for ACL 2018 paper.

Python 20 5 Updated Sep 12, 2018

使用预训练语言模型BERT做中文NER

Python 955 277 Updated Feb 26, 2020

中文自然语言处理数据集,平时做做实验的材料。欢迎补充提交合并。

Python 4,432 792 Updated Nov 21, 2023

A fast LSTM Language Model for large vocabulary language like Japanese and Chinese

Python 109 23 Updated Jun 4, 2019

Code for the ACL 2018 paper "Sharp Nearby, Fuzzy Far Away: How Neural Language Models Use Context"

Python 54 8 Updated May 15, 2018

Bayesian Evolutionary Analysis by Sampling Trees

Java 249 84 Updated May 12, 2025

Python Flask & jQuery AJAX sample app

CSS 45 58 Updated Sep 30, 2020
0