The Salute Project

Automatic data preparation for machine learning

The need

In Big Data processing most of the time is spent in preparing the data ready for use by advanced machine learning tools or humans building reports. This work means that insights remain locked away for too long.

The answer

Salute is able to process any type of file (Text, Image, Video, Audio, etc) and generate an output file with all the features created ready to be loaded into a machine learning or reporting tool

The technology

Salute is based on Spark and is able to process huge files.

Running Salute

The best way to run Salute is:

<spark_home>/bin/spark-submit target/salute-0.1-SNAPSHOT.jar <input_file> <output_dir>

Name		Name	Last commit message	Last commit date
Latest commit History 54 Commits
src		src
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
dependency-reduced-pom.xml		dependency-reduced-pom.xml
pom.xml		pom.xml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

The Salute Project

The need

The answer

The technology

Running Salute

About

U 4867 h oh!

Releases

Packages

Uh oh!

Languages

License

rtjarvis/salute

Folders and files

Latest commit

History

Repository files navigation

The Salute Project

The need

The answer

The technology

Running Salute

About

Resources

License

U 4867 h oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages