cassandra_etl

A simple etl pipeline using Apache Cassandra

Getting Started

Prerequisites

Running this project will require

python 3
jupyter lab
docker

Running a Cassandra server

Get the cassandra docker image docker pull cassandra:3.11.4

Create the docker container, running dettached, providing an available memory limit, and exposing the appropriate port docker run --memory 4g --name cass-serv -p 9042:9042 -d cassandra:3.11.4

This creates and runs the container the first time, each additional time you need to start the container, you can use the docker start command: docker start cass-serv

and stop the container using docker stop cass-serv.

Download Dependencies and Start Jupyter Notebook

Run pip install -r requirements.txt to download libraries required in the jupyter notebook.

To start the notebook. Run jupyter lab, a browser should start in your current directory and you should be able to interact with the cassandra server running in the docker container.

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
event_data		event_data
images		images
.gitignore		.gitignore
LICENSE		LICENSE
Project_1B_ Project_Template.ipynb		Project_1B_ Project_Template.ipynb
README.md		README.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

cassandra_etl

Getting Started

Prerequisites

Running a Cassandra server

Download Dependencies and Start Jupyter Notebook

About

Uh oh!

Releases

Packages

Uh oh!

Languages

License

jtbricker/cassandra_etl

Folders and files

Latest commit

History

Repository files navigation

cassandra_etl

Getting Started

Prerequisites

Running a Cassandra server

Download Dependencies and Start Jupyter Notebook

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages