8000 cloudera-hadoop · GitHub Topics · GitHub

More Web Proxy on the site http://driver.im/

#

cloudera-hadoop

Here are 38 public repositories matching this topic...

sergevs / ansible-cloudera-hadoop

ansible playbook to deploy cloudera hadoop components to the cluster

kafka impala hbase hadoop-cluster oozie cloudera-hadoop

Updated Sep 8, 2018
Shell

tilakpatidar / cdh5

Docker image for Cloudera Hadoop components (CDH5)

mysql docker hive docker-compose postgresql zookeeper hdfs cloudera-hadoop

Updated Jan 2, 2018
Shell

smartlin5228 / CCA175

scala spark cloudera sparksql cloudera-hadoop

Updated Oct 31, 2017
Java

Ranjandas / Dirty-CDH-Docker

A quick and dirty CDH cluster skeleton using Docker for Testing

docker cloudera cdh cloudera-hadoop

Updated Aug 13, 2016
Shell

dengshaochun / cdh-tools

cloudera hadoop auto install

ansible cloudera-hadoop auto-install

Updated Jun 13, 2018
Shell

haspdecrypted / OS-for-Big-Data-and-Hadoop

Getting Started with Hadoop and Big Data

spark hadoop bigdata cloudera cloudera-hadoop

Updated Jul 24, 2021

rapsoulhaonan / graphic-theoretic-problems

💂‍♂️ Hadoop/MapReduce Streaming

python virtualbox hadoop-mapreduce cloudera-hadoop

Updated Sep 14, 2017
Python

kwartile / spark-benchmark

Spark Benchmark suite to evaluate cluster configuration and compare the performance with other big data frameworks.

benchmark performance scala spark apache-spark hive hadoop impala benchmarking-suite cdh cloudera-hadoop

Updated May 26, 2017
Scala

kumar-de / BD2017

Otto-von-Guericke Universität Magdeburg - Big Data SoSe 2017

java bigdata cluster-computing cloudera-hadoop ovgu

Updated Apr 11, 2018
Java

arunkthomasuncc / Query_Search_Using_TF-IDF

This repository contains the TF-IDF score calculation for the documents in the Canterbury dataset for a user given search query

java hadoop tfidf hadoop-mapreduce cloudera-hadoop

Updated Oct 9, 2018
Java

SakhriHoussem / Apache-Hive-Tutorial

Learn How Hive Work in Simple Example

hive cloudera cloudera-hadoop

Updated Jul 11, 2018

vodkolav / DataEngineerProject

This is my final project for Data Engineer Expert course at Naya College.

twitter kafka spark hive hadoop jupyter-notebook python3 hdfs cloudera-hadoop spark-structured-streaming

Updated Jan 19, 2020
Jupyter Notebook

Ishuan / Page-Rank-Implementation

The goal of this programming assignment is to compute the PageRanks of an input set of hyperlinked Wikipedia documents using Hadoop MapReduce. The PageRank score of a web page serves as an indicator of the importance of the page. Many web search engines (e.g., Google) use PageRank scores in some form to rank user-submitted queries. The goals of …

java cloud-computing hadoop-mapreduce cloudera-hadoop mapreduce-java

Updated Apr 22, 2018
Java

JohnnyFoulds / local-hadoop

This project creates a small local Hadoop cluster using Cloudera CDH and CentOS.

hadoop vmware-esxi centos cloudera powercli cloudera-hadoop vmware-vsphere

Updated Jul 12, 2020
Python

dorianbg / cloudera-quickstart-installation-guide

How to install Cloudera quickstart

big-data hadoop cloudera oozie hue cloudera-hadoop

Updated Aug 17, 2017

syscrest / cloudera-manager-hipchat-chatbot

chatbot for hipchat (cloud or onpremise) that enables you to talk to your cloudera manager

devops chatops hadoop communication chatbot hipchat cdh cloudera-hadoop cloudera-manager

Updated Apr 21, 2017
Java

nikitaeverywhere / hadoop-network-of-keywords

Keywords network builder based on TF-IDF with the use of Hadoop platform

hadoop cloudera term-frequency document-frequency tf-idf mapreduce cloudera-hadoop hadoop-platform keywords-builder

Updated Dec 17, 2017
Python

jcrespoortega / Docker-Twitter-Sentiment-analysis

docker twitter mongodb sentiment-analysis map-reduce mrjob cloudera-hadoop

Updated Jun 11, 2018
Python

AdrianYuu / qualification-big-data-processing

A qualification project for teaching as an assistant at SLC in the COMP6579001 Big Data Processing course.

jupyter-notebook pyspark cloudera-hadoop

Updated Feb 15, 2024
Jupyter Notebook

Rifat392000 / BigDataAnalytics

visualization sql clustering eclipse virtual-machine python3 rdbms hue hadoop-filesystem hadoop-mapreduce cloudera-hadoop pyspark-notebook big-data-analytics java-mapreduce big-data-processing google-colab-notebook

Updated Sep 10, 2024
Jupyter Notebook

Improve this page

Add a description, image, and links to the cloudera-hadoop topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the cloudera-hadoop topic, visit your repo's landing page and select "manage topics."

0