Madoka

A fuzzing harness for adversarial testing of Google's Magika AI-based file classifier.

Introduction

Defensive security tools that analyze malware are often dependent on static analyzers and file classifiers to establish baseline information about a sample. Google released Magika, a performant ML based file classifier that has an advertised 99% accuracy rating, and has already been integrated into various analysis stacks. But this, and other file classifiers are potentially vulnerable to errors as well as polyglots-- files that can be correctly classified as more than one type.

Compounding this problem is the gap between what a specification states is a valid file, and what real-world systems will interpret as a valid file.

To this end, Madoka is a barebones harness to use file mutation and fuzzing to produce output files that are incorrectly classified by Magika (or other static signatures) yet are still interpreted as valid by a target process.

Prerequisites

Honggfuzz
Magika

How to deploy the ELF fuzzer/polyglot finder

Included with Magika is an ELF fuzzer that mutates a known crashing binary. If we can create a variant of this file that is not classified as an ELF, but still crashes, we've succeeded in evading the classifier!

In the test directory, run make to compile the example binaries.
Copy crasher to a directory called input
(Optional) Use a script in the util directory to create a RAMdisk
Copy the checkout to a directory where you're going to execute the fuzzing cases
Run ./inputgen.sh
Run ./checker.py with the output directory to find if any of these files successfully evade Magika

Methodology

F 5BDA undamentally we need to:

Generate a corpus of mutated ELF executable files
Load Magika
Find which files aren't identified as executable
Verify which of these files still crash

Building your own fuzz harness

The seed input examples that are included with Madoka are based on known crashing files, and the validation step is to simply verify that they still crash. However, there is the possibility of adding your own targets and fuzz scenarios by overriding the Verify with your own callable, and adding seed inputs to mutate it.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Madoka

Introduction

Prerequisites

How to deploy the ELF fuzzer/polyglot finder

Methodology

Building your own fuzz harness

About

Uh oh!

Releases

Packages

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 20 Commits
input		input
test		test
util		util
README.md		README.md
checker.py		checker.py
inputgen.sh		inputgen.sh
requirements.txt		requirements.txt

desperadosec/madoka

Folders and files

Latest commit

History

Repository files navigation

Madoka

Introduction

Prerequisites

How to deploy the ELF fuzzer/polyglot finder

Methodology

Building your own fuzz harness

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages