Stars
Perform forward and backward citation chasing as part of an evidence synthesis project
A package to run embedded topic modelling with ETM. Adapted from the original at: https://github.com/adjidieng/ETM
Leveraging BERT and c-TF-IDF to create easily interpretable topics.
JSLint, The JavaScript Code Quality and Coverage Tool
Modeling, training, eval, and inference code for OLMo
Reproducible identifiers & fine-grained build dependency tracking for software artifacts.
A plugin that does one thing only: Detect and manage duplicate items in Zotero.
An open-source Python framework for creating, editing, and invoking Noisy Intermediate-Scale Quantum (NISQ) circuits.
Statistical inference and graphical procedures for RD designs using local polynomial and partitioning regression methods.
R package for Regresssion Design Discontinuity
ConvoKit is a toolkit for extracting conversational features and analyzing social phenomena in conversations. It includes several large conversational datasets along with scripts exemplifying the u…
Toolkit for linearizing PDFs for LLM datasets/training
Code for the EMNLP 2020 paper "Learning to Ignore: Long Document Coreference with Bounded Memory Neural Networks"