https://github.com/cupertank implemented essential data structures presented in src/model, src/parser, src/tests and src/util
-
The following instructions were tested on Ubuntu 18.04.4 LTS.
Prior to cloning the repository and attempting to build the project, ensure that you have the following software:
- GNU g++ compiler, version 10+
- CMake, version 3.13+
- Boost library, version 1.72.0+
To use test datasets you will need:
- Git Large File Storage, version 3.0.2+
Firstly, navigate to a desired directory. T 8BEF hen, clone the repository, cd into the project directory and launch the build script:
git clone https://github.com/Mstrutov/Desbordante/ cd Desbordante ./build.sh
The script generates the following file structure in
/path/to/Desbordante/build/target
:├───inputData │ └───some-sample-csv's.csv ├───Desbordante_test ├───Desbordante_run
The
inputData
directory contains several .csv files that may be used byDesbordante_test
. RunDesbordante_test
to perform unit testing:cd build/target ./Desbordante_test
The tool itself is launched via the following line:
./Desbordante_run --algo=tane --data=<dataset_name>.csv
The
<dataset_name>.csv
, which is a user-provided dataset, should be placed in the/path/to/Desbordante/build/target
directory.
ATTENTION
Currently Windows is not supported. The last compatible version has a tag windows-compatible
—
later versions might compile, but the likelihood of such an event drastically declines with each commit.
You can use the following guide to checkout, build and run this particular version.
The following instructions were tested on Windows 10 .
Prior to cloning the repository and attempting to build the project, ensure that you have the following software:
- Microsoft Visual Studio 2019
- CMake, version 3.13+
- Boost library, version 1.65.1+
The recommended way to install Boost is by using chocolatey
Firstly, launch the command prompt and navigate to a desired directory. Then, clone the repository, cd into the project directory and launch the build script:
git clone https://github.com/Mstrutov/Desbordante/
cd Desbordante
git checkout windows-compatible
build.bat
Note: to compile the project, the script uses hard-coded path to MSVC developer command prompt, which is located
by default at C:\Program Files (x86)\Microsoft Visual Studio\2019\Community\Common7\Tools\VsDevCmd.bat
.
You should change the path in the script if it differs from the default one.
The script generates the following file structure in \path\to\Desbordante\build\target
:
├───inputData
│ └───some-sample-csv's.csv
├───fdtester_test.exe
├───fdtester_run.exe
The inputData
directory contains several .csv files that may be used by fdtester_test
. Run fdtester_test
to perform unit testing:
cd build\target
fdtester_test.exe
The tool itself is launched via the following line:
fdtester_run.exe --algo=tane --data=<dataset_name>.csv
The <dataset_name>.csv
, which is a user-provided dataset, should be placed in the \path\to\Desbordante\build\target
directory.
Arthur Saliou — Fdep implementation
Eduard Gaisin — Dep-Miner implementation
Alexandr Smirnov — DFD implementation
Ilya Shchuckin — FD_Mine implementation
Michael Polyntsov — FastFDs implementation
Ilya Vologin — core classes
Maxim Strutovsky — team lead, Pyro & TANE implementation
Nikita Bobrov — product owner, consult, papers
Kirill Smirnov — product owner, code quality, infrastructure, consult
George Chernishev — product owner, consult, papers
If you use this software for research, please cite the paper (https://fruct.org/publications/fruct29/files/Strut.pdf, https://ieeexplore.ieee.org/document/9435469) as follows:
M. Strutovskiy, N. Bobrov, K. Smirnov and G. Chernishev, "Desbordante: a Framework for Exploring Limits of Dependency Discovery Algorithms," 2021 29th Conference of Open Innovations Association (FRUCT), 2021, pp. 344-354, doi: 10.23919/FRUCT52173.2021.9435469.