RNN-Test

RNN-Test: Towards Adversarial Testing for Recurrent Neural Network Systems

This repository provides the source code of RNN-Test, example adversarial inputs for three models and figures not listed in the paper, mainly for the reproducibility. The code space is rough now and will be maintained continously.

Description

RNN-Test is an adversarial testing framework for Recurrent Neural Network Systems(RNN), which could be adopted for variants of RNN models. Here, we evaluate on three RNN models, which are PTB language model, spell checker model and DeepSpeech ASR model. PTB language model is a well known language model, basically to generate subsequent texts taking previous texts as input. Spell checker model is one of the widespread seq2seq models in NLP tasks, which receives a sentence with spelling mistakes as input and outputs the sentence with the mistakes corrected. DeepSpeech ASR model is a state-of-the-art speech-to-text RNN model, which is employed in lots of security-critical scenarios.

Repository structure

In terms of this repository, it composes three main parts. The file structure is listed as below.

|-- RNN-Test
    |-- README.md 
    |-- src                                                       //Source code, including the model to test
    |   |-- audio                                                 //code for testing DeepSpeech ASR model
    |   |   |-- statefulRNNCell.py                                //RNN wrapper, to retrieve states
    |   |   |-- gen_adv_exp.py                                    //main procedure to generate adversarial inputs, able to output experiment data to .csv file
    |   |   |-- coverage.py                                       //boosting and computing the coverage metrics
    |   |   |-- make_checkpoint.py                                //utilities for the model
    |   |   |-- filterbanks.npy                                   //utilities for the model
    |   |   |-- tf_logits.py                                      //utilities for the model
    |   |   |-- testdata                                          //original test inputs, of .wav format
    |   |   |   |-- sample-000000.wav                             //one of the inputs
    |   |   |   |-- ...                                           //other inputs
    |   |   |-- commonvoice
    |   |       |-- labels.csv                                    //the predicted label 
    |   |       |-- ...
    |   |-- ptb                                                   //code for testing PTB language model
    |   |   |-- ptb_word_lm_wrapped.py                            //code of PTB language model, wrapped with RNN wrapper
    |   |   |-- gen_adv_exp.py                                    //same function as that of audio
    |   |   |-- coverage.py                                       //same function as that of audio
    |   |   |-- myreader.py                                       //utility to read data for the model
    |   |   |-- ckpt                                              //binary model to test
    |   |   |   |-- ... 
    |   |   |-- testdata                                          //training, valid, test data for the model
    |   |       |-- ptb.test.txt
    |   |       |-- ptb.train.txt
    |   |       |-- ptb.valid.txt
    |   |-- spell checker                                         //code for testing spell checker model
    |        |-- statefulRNNCell.py                               //RNN wrapper, to retrieve states
    |        |-- SpellChecker_wrapped.py                          //code of spell checker model, wrapped with RNN wrapper
    |        |-- gen_adv_exp.py                                   //same as above
    |        |-- coverage_sp.py                                   //same as above
    <
8A8E
span class="pl-k">|        |-- kp=0.85,nl=2,th=0.95,11.ckpt.*                   //binary model to test
    |        |-- test_seq_short                                   //test inputs, with labels
    |        |-- books                                            //resources for training the model
    |            |-- ...
    |-- adv outputs                                               //example experimental outputs, including adversarial inputs 
    |   |-- audio
    |   |   |-- 2_state_diff_cell_state_adv_1572945539            //adversarial inputs of RNN-Test w. CS_C
    |   |       |-- adv_sample-000000.wav                         //each adversarial input
    |   |       |-- ...
    |   |-- ptb
    |   |   |-- 2_state_diff_hidden_state_adv1                    //output of its gen_adv_exp.py, including the adversarial inputs obtained
    |   |-- spell checker
    |       |-- 2_state_diff_cell_state_adv1                      //same as above
    |-- figures                                                   //figures not listed in the paper
        |-- RQ2                                                   //figures for RQ2 in the paper
        |   |-- perturbation                                      //perturbation vectors and the figures
        |       |-- audio                                         
        |       |   |-- sample-000000.eps                         //perturbation figure of sample-000000 input
        |       |   |-- sample-000000_vector.txt                  //perturbation vector of sample-000000 input before TSNE transformation
        |       |   |-- ...                                       //similar figures and vectors of other inputs
        |       |-- ptb                                           
        |       |   |-- step_0.eps                                //perturbation figure of input of step 0
        |       |   |-- step_0_vector.txt                         //perturbation vector of input of step 0 before TSNE transformation
        |       |   |-- ... 
        |       |-- sp                                            
        |           |-- step_0.eps                                //similar as above
        |           |-- step_0_vector.txt                         
        |           |-- ...
        |-- RQ3                                                   //figures for RQ3 in the paper
            |-- correlation                                       //correlation between model performance metrics and coverage value
            |   |-- audio                                         
            |   |   |-- DXSuccess rate.eps                        //success rate w.r.t value of neuron coverage
            |   |   |-- DXWER.eps                                 //WER w.r.t value of neuron coverage
            |   |   |-- hiddenSuccess rate.eps                    //success rate w.r.t value of hidden state coverage
            |   |   |-- hiddenWER.eps                             //WER w.r.t value of hidden state coverage
            |   |   |-- cell[0]Success rate.eps                   //success rate w.r.t value of cell state coverage on section 1
            |   |   |-- cell[0]WER.eps                            //WER w.r.t value of cell state coverage on section 1
            |   |   |-- ...                                       //other performance metrcs w.r.t cell state coverage on other sections
            |   |-- ptb                                           
            |   |   |-- DX_pp.eps                                 //perplexity w.r.t value of neuron coverage
            |   |   |-- hidden_pp.eps                             //similar as above
            |   |   |-- cell[0]pp.eps                             //similar as above
            |   |   |-- ...
            |   |-- sp                                            //similar as above
            |       |-- ...                                       
            |-- distribution                                      //distribution of value ranges of coverage metrics
                |-- audio                                         
                |   |-- DX_box_toge.eps                           //value range of neuron coverage
                |   |-- hidden_box_toge.eps                       //value range of hidden state coverage
                |   |-- cell[0]_box_toge.eps                      //value range of cell state coverage on section 1
                |   |-- ...                                       
                |-- ptb                                           //similar as above
                |   |-- ...
                |-- sp                                            //similar as above
                    |-- ...

Dependencies

These libraries could be installed with pip or conda.

Python 3.5/3.6
tensorflow 1.8
DeepSpeech 0.1.1 (for DeepSpeech ASR model)
python_speech_features (for DeepSpeech ASR model)
numpy
matplotlib
scipy
pandas
nltk
Levenshtein

To run the models

Running configuration

python gen_adv_exp.py  --objective <objective>
                       --objective1 <objective1>
                       --guided_coverage <guided_coverage>
                       --exp <exp>

where,

objective: the modules leveraged to test, could be:

{0, 1, 2}, refers to pure adversary search module, pure coverage boosting module and joint modules, respectively.
objective1: the search method used in adversary search module, could be:

{state_diff, cost, diff}, refers to RNN-Test, FGSM-based and DLFuzz-based, respectively.
guided_coverage: the coverage metrics used in coverage boosting module, could be:

{hidden_state, cell_state, DX}, refers to hidden state coverage, cell state coverage and neuron coverage, respectively.
exp: whether to adversarial testing or random testing, could be:

{adv, baseline}, refers to adversarial testing methods or random testing method.

Example comands to run

For PTB language model:

 python gen_adv_exp.py --data_path=./testdata/ --objective 2 --objective1 state_diff --guided_coverage hidden_state --exp adv

For spell checker model:

 python gen_adv_exp.py --objective 2 --objective1 cost --guided_coverage cell_state --exp adv

For DeepSpeech ASR model:

The model file has not been given here, need to extract first. The steps are the same as https://github.com/carlini/audio_adversarial_examples, listed below:

 cd src/audio
 git clone https://github.com/mozilla/DeepSpeech.git
 (cd DeepSpeech; git checkout tags/v0.1.1)
 wget https://github.com/mozilla/DeepSpeech/releases/download/v0.1.0/deepspeech-0.1.0-models.tar.gz
 tar -xzf deepspeech-0.1.0-models.tar.gz
 python3 make_checkpoint.py

Then run the model:

 python gen_adv_exp.py --objective 2 --objective1 state_diff --guided_coverage cell_state --exp adv

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

RNN-Test

Description

Repository structure

Dependencies

To run the models

Running configuration

Example comands to run

About

Uh oh!

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 21 Commits
adv outputs		adv outputs
figures		figures
src		src
README.md		README.md

turned2670/RNN-Test

Folders and files

Latest commit

History

Repository files navigation

RNN-Test

Description

Repository structure

Dependencies

To run the models

Running configuration

Example comands to run

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages