Stars
Working repo for testing the effects of code training on entity tracking.
course website for Experimental and Computational Methods for Linguistic Research
tools for generating experimental materials and packaging psycholinguistic experiments
A package for handy processing of semantic graphs such as AMR, with a special focus on standardized evaluation
ripgrep recursively searches directories for a regex pattern while respecting your gitignore
Code to compute a word's probability using the fixes from "How to Compute the Probability of a Word"
Tree edit distance using the Zhang Shasha algorithm
Constraction is an open-source tool for the automatic extraction and interactive exploration of linguistic constructions
Stanford NLP Python library for understanding and improving PyTorch models via interventions
CausalGym: Benchmarking causal interpretability methods on linguistic tasks
Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization.
Python API for loading language data from American-English CHILDES database
ngram language modeling and naive bayes classification
Highlight errors in a bib file: missing URLs, capitalization protection, etc
[Kauf & Ivanova, ACL 2023] A Better Way to Do Masked Language Model Scoring
Predict psycholoinguistic feature norms for words in context.
Inspecting and Editing Knowledge Representations in Language Models
Python implementation of semantic projection from Grand et al. (2022)
Grade-School Math with Irrelevant Context (GSM-IC) benchmark is an arithmetic reasoning dataset built upon GSM8K, by adding irrelevant sentences in problem descriptions. GSM-IC is constructed to ev…
The official code of LM-Debugger, an interactive tool for inspection and intervention in transformer-based language models.
An awesome & curated list for Artificial General Intelligence, an emerging inter-discipline field that combines artificial intelligence and computational cognitive sciences.
Scaffold for NLP researcher to quickly set up the codebase