GitHub - lojzezust/junit-eval: Simple batch evaluator for JUnit 4 tests with a GPT analysis addon.

junit-eval: A simple JUnit evaluation tool for batch processing of multiple implementations. Suitable for automated grading of assignments.

Features

Running multiple test classes on a large number of submissions
Counting and reporting the number of successful tests and test cases
Reporting source compilation errors and test compilation errors
Support for concurrent processing of multiple files
Experimental and optional feature: submission analysis and comments using OpenAI gpt API

Caution

No internal sandboxing or security features are currently implemented. Make sure you trust the source code you are trying to evaluate or run it in a sandboxed environment (e.g. virtual machine).

Getting started

Step 1: Install the package using pip

pip install git+https://github.com/lojzezust/junit-eval.git

Step 2: Prepare the test and the configuration file for your project

Place all the unit tests (and any dependencies) in a single directory. Place resources in a single directory. Place all the required libraries (.jar files) in single directory. Your project might look something like this:

.
├── tests/
│   ├── Test01.java
│   └── ...
├── submissions/
│   ├── Name1=email2=...=Main.java
│   └── ...
├── res/
│   └── viri/
│       ├── input1.txt
│       └── ...
├── lib/
│   ├── hamcrest-core-1.3.jar
│   └── junit-4.13.2.jar
└── gpt-instructions.txt

Create a configuration YAML file. You may use configs/example.yaml as a template. Most important fields:
- SUBMISSIONS_DIR: root directory of your submissions
- CLASS_NAME: main class name of the submission. Used for renaming submissions file to the correct filename, matching the class name.
- UNIT_TESTS_DIR: location of you JAVA JUnit test files.
- RESOURCES_DIR: location of the resources dir (res in above example). The JAVA program will understand paths relative to this directory (e.g. viri/input1.txt).
- LIBS_DIR: Libraries required by the tests and/or submissions. These will be used during every compilation and execution.
- JAVA_COMPILE_CMD and JAVA_RUN_CMD: Location of javac and java binaries to use. If the binaries are located on the system path, you can just set these to javac and java.
- OUTPUT.SUMMARY: enables/disables summaries in the reports (number of points for each test, etc.)
- OUTPUT.DETAILED: enables/disables detailed reports for each of the test files (including error and assertion messages)
- GPT_GRADER: configuration for the optional GPT analysis tool. You need an OpenAI API key to use this feature.

Step 3: Run the evaluator

junit-eval junit config.yaml

Tip

Use the --num-workers argument to enable parallel processing.

Manual installation

If you wish to modify the code, clone and install the package in editable mode:

git clone https://github.com/lojzezust/junit-eval.git

cd junit-eval
pip install -e .

How it works

For each of the submissions in the submission directory the following steps are performed:

A temporary directory is created. The following files are copied to the temp location:
- Submission source code
- Contents of the tests directory
- Contents of the resources directory
Source file is compiled using javac. If compilation is not successful end the process and report the compilation error.
Find all test classes (files in the tests directory containing "Test" in the name). For each test class:
1. Compile the test class against the assignment. In case of compilation failure the test counts as failed and compilation error is reported.
  Note: Not implemented methods/classes will cause compilation errors. It is thus recommended to separate test files into unit-sized chunks that test a single aspect of the assignment.
2. Run the test cases and report the number of passed tests and the encountered errors.
3. Output the report for the assignment.

GPT Analysis of Submissions

To use this feature you need an OpenAI API key.

Enter your API key in the GPT_GRADER.API_KEY field.
Prepare instructions for grading in a file and link to it in the GPT_GRADER.INSTRUCTION_FILE field. The instructions might look something like this.

Score the provided JAVA program using the following criteria. You may use half points. Keep the comments short.

Criteria:
Ex. 1:
(1p) Methods X and X are implemented and perform Y
...

Run junit-eval gpt command. GPT analysis results will be stored into the output directory. Since OpenAI imposes per-minute rate limits, we keep retrying individual requests with exponential backoff for up to GPT_GRADER.MAX_RETRY_TIME seconds. Due to the rate limits the GPT analysis may take some time.

junit-eval gpt config.yaml

Warning

Depending on the length of instructions and submissions, and the selected model, token use may be large. It is always recommended to test on a small sample, to preview the expected costs and quality of the responses.

Merging results

We also provide an utility for merging comments from different sources (CSV files and per-submission unit/GPT comments). You may use the following command to append comments from unit tests and GPT analysis to base CSV results:

junit-eval merge config.yaml --output-file output_results.csv --csv-base base_results.csv

The command will look through the output directory defined in the config file and add the comments from found result files (Junit and GPT) to the base CSV file and save it into a new output file.

Limitations

This is a quickly implemented project with a few caveats and limitations:

No security features are currently implemented.
Currently limited to single-file programs/submissions.
Some features are hard-coded to our specific use-case. Feel free to adapt the code if you need.
- This includes the expectation that submissions are a named a specific way as exported by Moodle (name=email=*.java). The script extracts the email field which is stored in the final CSV report.

Name		Name	Last commit message	Last commit date
Latest commit History 29 Commits
configs		configs
images		images
junit_eval		junit_eval
.gitignore		.gitignore
README.md		README.md
requirements.txt		requirements.txt
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Features

Getting started

Manual installation

How it works

GPT Analysis of Submissions

Merging results

Limitations

About

Uh oh!

Releases

Packages

Uh oh!

Languages

lojzezust/junit-eval

Folders and files

Latest commit

History

Repository files navigation

Features

Getting started

Manual installation

How it works

GPT Analysis of Submissions

Merging results

Limitations

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages