BOP_3D_visualizer

GUI application for visualizing the BOP challenge 6D object pose estimation results.

Features:

RGB point cloud visualization if depth is provided.
Ground truth object poses visualization if it is provided.
Predictions poses visualization, support for multiple methods comparison.
2D image projection of the predicted object poses into the image and possibility to save the them.
Color selection for the method predictions by the user.
Camera pose visualization - basic camera frame visualization as OpenCV coordinate system.

Be mindfull that the application is still in development and some features might not work as expected. Also the application shows the whole split, so if showcasing the result for example LM-O datset, the application does not consirder the test_targets_bop19.json or test_targets_bop24.json file, which is used for the evaluation of the results in BOP challenge, wher only certain images are used (3, 7, ...) and showcases results on all of the images.

Exported visualization example


Inference Image	3D view
First Method Contour Highlight	First Method Mask Overlay
Second Method Contour Highlight	Second Method Mask Overlay

Data format

The application expects the format same as the BOP challenge for 6D pose estimation, that is a csv file named METHOD_DATASET-test.csv with the following columns:

scene_id -> The scene id of the dataset.
im_id -> The image id of the image in the scene.
obj_id -> The object id of the object in the image.
score -> The score of the object detection(Not used in this application).
R -> 3x3 rotation matrix whose elements are saved row-wise and separated by a white space (i.e. r11 r12 r13 r21 r22 r23 r31 r32 r33, where rij is an element from the i-th row and the j-th column of the matrix).
t -> x1 translation vector (in mm) whose elements are separated by a white space (i.e. t1 t2 t3).
time -> Inference time of all objects poses for the whole image (Not used in this application).

Usage

Clone the repository.
Install the required packages:

    TODO: Open3d pandas numpy opencv-python and some others

Setup the config path as seen in the example_config.json file.

split_scene_path -> Path to the dataset folder and its split.
models_path -> Path to the 3D models of the objects whic are used for the visualization.
csv_paths -> List of the paths to the csv files with the predictions for possible comparison of multiple methods.
saving_path -> Path to the folder where the images with the 2D projections will be saved.

Run the application:

    python main.py -c config/example_config.json

Will call 2 subprocesses for the GUI/3D visualization and the 2D visualization service.

Use the GUI to navigate through the images and customize the visualization. Use left and righ mouse button to rotate and translate the camera in the 3D view. Use the scroll wheel to zoom in and out.

TODO: Features to add

Add the switching between rgb and gray. This is required due to some of the datasets being in grayscale.
Add config loading in the main.py and its usege in the application.
Add the camera pose visualization.
Make more robust for xyzibd where multiple cameras are present in the scene.
Finish the documentation and type hints.

Name		Name	Last commit message	Last commit date
Latest commit History 24 Commits
config		config
images		images
src		src
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
main.py		main.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

BOP_3D_visualizer

Exported visualization example

Data format

Usage

TODO: Features to add

About

Uh oh!

Releases

Packages

Languages

License

vitzeman/BOP_3D_visualizer

Folders and files

Latest commit

History

Repository files navigation

BOP_3D_visualizer

Exported visualization example

Data format

Usage

TODO: Features to add

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages