8000 GitHub - IBCNServices/lab-kappa-solution: Solution for the hands-on lab about the Kappa architecture and Spark streaming.
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
This repository was archived by the owner on Sep 18, 2024. It is now read-only.

Solution for the hands-on lab about the Kappa architecture and Spark streaming.

License

Notifications You must be signed in to change notification settings

IBCNServices/lab-kappa-solution

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

78 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Big Data hands-on - Kappa Architecture - Solution

Getting started

Note: Please follow the installation docs for the Kappa course first.

  1. Clone this repository to your computer using git.

  2. Open lab-kappa.code-workspace using Visual Studio Code.

  3. Choose "Yes, I trust the authors", so VSCode enables regular mode.

    workspace-trust-dialog

  4. Click on the "Remote" icon on the bottom left of VSCode.

    remote-button

    1. Choose the option "Reopen in container" from the command palette.

      reopen-in-container

    2. The container will be automatically created based on the configuration included in the lab files.

  5. Wait until the container is set up. This can take a few minutes because the container needs to be pulled and built. You can check the progress by clicking "Starting Dev Container (show log)" in the notification on the bottom right of VSCode.

  6. When the containers are setup, open introduction.ipynb and follow the instructions there. When asked about which kernel to use, always choose the Python kernel base (Python 3.9.7) /opt/conda/bin/python.

Appendix

Reset environment and remove databases

If you encounter uncommon issues with Spark or connectivity issues to Kafka, it might help to remove all containers from your system, restart vscode and choose "open folder in container" again.

For users on Linux and macOS, run the following command in order to remove ALL containers from your system.

docker stop $(docker ps -aq) && docker rm $(docker ps -aq)

Windows users can do this from the Docker GUI or run the following commands in PowerShell:

docker stop $(docker ps -aq)
docker rm $(docker ps -aq)

Copyright

Creative Commons License
This work is licensed under a Creative Commons Attribution 4.0 International License.

Copyright © teaching staff of the Big Data hands-on course at UGent Academie voor Ingenieurs (UGain) at the Faculty of Engineering and Architecture - Ghent University.

About

Solution for the hands-on lab about the Kappa architecture and Spark streaming.

Resources

License

Stars

Watchers

Forks

Packages

No packages published
0