The Havoc Paradox in Generator-Based Fuzzing

This repository contains the code and data for the paper "The Havoc Paradox in Generator-Based Fuzzing".

Minimum Hardware Requirements

CPU: >= 1 cores
Memory: >= 16 G
Disk: >= 50 G

Use Pre-baked Data

Aggregated Data

The aggregated data is available in the data/aggregated folder. The data is organized in the following structure:

.
├── campaign_trials_detail.csv        
├── corpus_sizes.csv
├── coverage.csv
├── mutation_distances.csv
├── short_runtime_campaigns.txt
└── technique_benchmark_summary.csv

You can use this data to reproduce the results in the paper directly. See Visualize the Results.

Pre-baked Raw Data

To analyze pre-baked raw result, you first need to download data from FigShare and unzip the data in the data/raw folder.

cd ./data/raw
wget ...
unzip pre-baked.zip
cd ../..

Next, you can use the provided scripts to generate the aggregated data. See Post-process the Results.

Build

Docker

We provide a Docker image that includes all the required dependencies and automatically builds the fuzzers. To use it:

Build the Docker image:

docker build -t havoc-mutation-eval .

Or if you want to use the pre-built image, you can pull it from Docker Hub:

docker pull leeleo3x/havoc-mutation-eval
docker tag leeleo3x/havoc-mutation-eval havoc-mutation-eval

Maven

Requirements

Python: >= 3.10
java: == 11
Maven: >= 3.8.6
To build the fuzzers, run the following command:

cd fuzzers
./setup.sh

Verify the Build

To verify that you have succesfully compiled all fuzzers, you may run a small compaign for each fuzzer.

For example, to run a small campaign for the ei fuzzer with the closure target, you can run the following command:

Docker

docker run -v $(pwd)/data:/havoc-mutation-eval/data havoc-mutation-eval single ei closure /havoc-mutation-eval/data/raw/ei-closure-single-run PT5M

Maven

cd fuzzers/zeugma
mvn -pl :zeugma-evaluation-tools meringue:fuzz meringue:analyze \
  -Pei,closure,log-mutation \
  -Dmeringue.outputDirectory=../../../../data/raw/ei-closure-single-run \
  -Dmeringue.duration=PT5M

Note

Duration is defined the ISO-8601 duration format, e.g., PT1H for 1 hour, PT5M for 5 minutes, etc.

Then you can check the output in data/raw/ei-closure-single-run directory:

.
├── campaign
│   ├── corpus              ---> Contains raw byte stream of each saved input.
│   ├── coverage_hash
│   ├── failures
│   ├── fuzz.log            ---> Contains logs of the fuzzing process.
│   ├── mutation.log        ---> Contains logs of the mutation process (mutation distance).
│   └── plot_data
├── coverage.csv            ---> Raw coverage data.
├── failures.json
├── jacoco
│   └── jacoco.csv
└── summary.json            ---> Contains a summary of the fuzzing results.

Run all Evaluations

We provide a script to run all evaluations. To do so, run the following command:

options:
  -h, --help            show this help message and exit
  --time Time           Running time in minutes
  --cpus CPUS           Number of instances running in parallel
  --rep REP             Number of repetitions
  --log-mutation LOG_MUTATION
                        Log mutation distance of each technique

Docker

docker run -v $(pwd)/data:/havoc-mutation-eval/data havoc-mutation-eval run --time 5 --cpus 5 --rep 1 --log-mutation true

Maven

cd fuzzers
python3 ./run.py --time 5 --cpus 5 --rep 1 --log-mutation true

This script will run all the fuzzers in parallel, using the number of CPUs specified by the --cpus argument. The default value is 1. The script will also run the fuzzers for the number of repetitions specified by the --rep argument. The default value is 1. The script will enable the mutation distance logging of each technique, if the --log-mutation argument is set to true. The default value is false.

The results will be saved in the data/raw/fresh-baked folder. Each campaign will be saved in a separate folder, with name {target}-{fuzzer}-results-{id}.

Note

Each instance takes ~4-6 GBs so you may need to adjust the number of instances running in parallel according to your machine's memory. The default value is 1.

Post-process the Results

Once you have finished all campaigns, you can run the following command to post-process the results:

First, you need to create a virtual environment and install the required packages. You can do this by running the following command:

python3 -m venv .venv
source .venv/bin/activate
pip3 install -r requirements.txt

Then, you can run the following command to extract the results from the raw data:

To extract the coverage data

python3 ./scripts/extract.py PATH_TO_RAW_DATA PATH_TO_AGGREGATED_DATA

To extract the mutation distance data

python3 ./scripts/extract_mutation_data.py PATH_TO_RAW_DATA PATH_TO_AGGREGATED_DATA

For example, to extract the coverage data from the pre-baked data, you can run the following command:

python3 ./scripts/extract.py ./data/raw/pre-baked/24h-no-mutation-distance ./data/aggregated/pre-baked

To extract the mutation distance data, you can run the following command:

python3 ./scripts/extract_mutation_data.py ./data/raw/pre-baked/1h-with-mutation-distance ./data/aggregated/pre-baked

Visualize the Results

You may open notebooks/Final Results.ipynb to visualize the results. Remember to change DATA_DIR to ../data/aggregated/fresh-baked in the notebook if you want to analyze fresh-baked data.

Note

If you run the campaign too short, you may not get enough data to visualize. You can run the campaign longer, e.g., 5 minutes, and then run the post-process script again.

Name		Name	Last commit message	Last commit date
Latest commit History 122 Commits
data		data
fuzzers		fuzzers
notebooks		notebooks
scripts		scripts
.gitattributes		.gitattributes
.gitignore		.gitignore
Dockerfile		Dockerfile
INSTALL.md		INSTALL.md
LICENSE		LICENSE
README.md		README.md
REQUIREMENTS.md		REQUIREMENTS.md
STATUS		STATUS
docker-entrypoint.sh		docker-entrypoint.sh
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

The Havoc Paradox in Generator-Based Fuzzing

Minimum Hardware Requirements

Use Pre-baked Data

Aggregated Data

Pre-baked Raw Data

Build

Docker

Maven

Requirements

Verify the Build

Docker

Maven

Run all Evaluations

Docker

Maven

Post-process the Results

Visualize the Results

About

Uh oh!

Releases

Packages

Uh oh!

Contributors 2

Uh oh!

Languages

License

cmu-pasta/havoc-mutation-eval

Folders and files

Latest commit

History

Repository files navigation

The Havoc Paradox in Generator-Based Fuzzing

Minimum Hardware Requirements

Use Pre-baked Data

Aggregated Data

Pre-baked Raw Data

Build

Docker

Maven

Requirements

Verify the Build

Docker

Maven

Run all Evaluations

Docker

Maven

Post-process the Results

Visualize the Results

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors 2

Uh oh!

Languages

Packages