Negation and Natural Language Inference (forked)

For this repo to work:

you need python3.7 (strictly)
create a venv:

install latest torch
install transformers/requirments.txt
install transformers as editable

create a second venv:

install latest torch transformers datasets and download models using converter.py

Run the models. make sure you check model pathes

Negation and Natural Language Inference (forked)

This repository contains code of the EMNLP2020 paper "An Analysis of Natural Language Inference Benchmarks through the Lens of Negation". Paper link: https://www.aclweb.org/anthology/2020.emnlp-main.732.pdf
Authors: Md Mosharaf Hossain, Venelin Kovatchev, Pranoy Dutta, Tiffany Kao, Elizabeth Wei and Eduardo Blanco

Data Requirements

Download RTE, SNLI, and MNLI using "download_glue_data.py" script of https://github.com/nyu-mll/GLUE-baselines.

python ./data/download_glue_data.py --data_dir ./data/GLUE --tasks RTE,SNLI,MNLI

Python Requirements

Python 3.6+ (Recommended: Python 3.7)
Python packages: list of packages are provided in ./env-setup/requirements.txt file.
(We used an older version of the huggingface transformers (version 2.1.1) package).

# Create virtual env (Assuming you have Python 3.7 installed in your machine) -> optional step
python3 -m venv your_location/negation-and-nli
source your_location/negation-and-nli/bin/activate

# Install required packages -> required step
pip install -r ./env-setup/requirements.txt

Fine-tune Transformer systams and evaluate on the original dev splits

At the very begining, below directories need to be created. The predicted labels are saved in "outputs/predictions" directory and the fine-tuned models are saved in the "outputs/models" directory.

mkdir outputs
mkdir outputs/predictions
mkdir outputs/models

Fine-tune the transformers using RTE training split and evaluate on the RTE dev split:

sh rte-train.sh

Fine-tune the transformers using SNLI training split and evaluate on the SNLI dev split:

sh snli-train.sh

Fine-tune the transformers using MNLI training split and evaluate on the MNLI dev split:

sh mnli-train.sh

Evaluate the fine-tuned systems on new benchmarks containing negations

Evaluate on new RTE benchmark

sh rte-evaluate.sh

Evaluate on new SNLI benchmark

sh snli-evaluate.sh

Evaluate on new MNLI benchmark

sh mnli-evaluate.sh

Results

Results (Table 7 of our paper) can be achieved by the below script.

  python evaluate.py --corpus corpus_name

Arguments: --corpus: Name of the corpus (e.g., rte or snli or mnli)

New NLI benchmarks containing negation

The annotation files of the new NLI benchmarks containing negation are given below.
RTE: ./data/new_benchmarks/clean_data/RTE.txt
SNLI: ./data/new_benchmarks/clean_data/SNLI.txt
MNLI: ./data/new_benchmarks/clean_data/MNLI.txt

Reference

@inproceedings{hossain-etal-2020-analysis,
    title = "An Analysis of Natural Language Inference Benchmarks through the Lens of Negation",
    author = "Hossain, Md Mosharaf  and
      Kovatchev, Venelin  and
      Dutta, Pranoy  and
      Kao, Tiffany  and
      Wei, Elizabeth  and
      Blanco, Eduardo",
    booktitle = "Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP)",
    month = nov,
    year = "2020",
    address = "Online",
    publisher = "Association for Computational Linguistics",
    url = "https://www.aclweb.org/anthology/2020.emnlp-main.732",
    pages = "9106--9118",
}

Name		Name	Last commit message	Last commit date
Latest commit History 108 Commits
.vscode		.vscode
RoBERTa		RoBERTa
bashes		bashes
data		data
env-setup		env-setup
lastrte		lastrte
newrte-2		newrte-2
newrte/RTE/RoBERTa/mhr2004		newrte/RTE/RoBERTa/mhr2004
oldpreds/predictions		oldpreds/predictions
preds/predictions		preds/predictions
rtenew		rtenew
snli_1.0		snli_1.0
transformers		transformers
verynewrte		verynewrte
.DS_Store		.DS_Store
.gitignore		.gitignore
LICENSE.md		LICENSE.md
README.md		README.md
clean-tsv.py		clean-tsv.py
command.sh		command.sh
converter.py		converter.py
converter2.py		converter2.py
evaluate.py		evaluate.py
evaluate2.py		evaluate2.py
lastrte.txt		lastrte.txt
mnli-pre-evaluate.sh		mnli-pre-evaluate.sh
mnli-pre.sh		mnli-pre.sh
nli-out-neg.txt		nli-out-neg.txt
nli-out2.txt		nli-out2.txt
nli-outputs.txt		nli-outputs.txt
outputs.txt		outputs.txt
rte-pre-evaluate.sh		rte-pre-evaluate.sh
rte-pre.sh		rte-pre.sh
rtenewnew.txt		rtenewnew.txt
rtenewnew2.txt		rtenewnew2.txt
snli-pre-evaluate.sh		snli-pre-evaluate.sh
snli-pre.sh		snli-pre.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Negation and Natural Language Inference (forked)

Data Requirements

Python Requirements

Fine-tune Transformer systams and evaluate on the original dev splits

Evaluate the fine-tuned systems on new benchmarks containing negations

Results

New NLI benchmarks containing negation

Reference

About

Uh oh!

Releases

Packages

Languages

License

mhrezaei1/negation-and-nli

Folders and files

Latest commit

History

Repository files navigation

Negation and Natural Language Inference (forked)

Data Requirements

Python Requirements

Fine-tune Transformer systams and evaluate on the original dev splits

Evaluate the fine-tuned systems on new benchmarks containing negations

Results

New NLI benchmarks containing negation

Reference

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages