Stars
Event data simulator. Generates a stream of pseudo-random events from a set of users, designed to simulate web traffic.
Vertica Hadoop Connector
Elephant Twin LZO uses Elephant Twin to create LZO block indexes
Elephant Twin is a framework for creating indexes in Hadoop
Elephant Twin LZO uses Elephant Twin to create LZO block indexes
Elephant Twin is a framework for creating indexes in Hadoop
A platform for visualization and real-time monitoring of data workflows
A port of LINQ (Language-Integrated Query) to Java
dvryaboy / hadoop-lzo
Forked from twitter/hadoop-lzoPatched, refactored version of code.google.com/hadoop-gpl-compression for hadoop 0.20
dvryaboy / PigEditor
Forked from romainr/PigEditorEclipse plugin for Apache Pig
dvryaboy / bud
Forked from bloom-lang/budPrototype Bud runtime (Bloom Under Development)
Twitter common libraries for python and the JVM (deprecated)
A reporistory of User-defined functions for Apache Pig
813D Use JNI to implement process spawning without fork()
Common metadata layer for Hadoop's Map Reduce, Pig, and Hive
dvryaboy / flume
Forked from cloudera/flumeFlume is a distributed, reliable, and available service for efficiently collecting, aggregating, and moving large amounts of log data. It has a simple and flexible architecture based on streaming d…
dvryaboy / scribe
Forked from traviscrawford/scribeScribe is a server for aggregating log data streamed in real time from a large number of servers. It is designed to be scalable, extensible without client-side modification, and robust to failure o…
dvryaboy / elephant-bird
Forked from twitter/elephant-birdTwitter's collection of LZO and Protocol Buffer-related Hadoop, Pig, and HBase code.
Indexed HBase. An extnestion of HBASE core which support faster scans at the expense of larger RAM consumption.
Twitter's collection of LZO and Protocol Buffer-related Hadoop, Pig, Hive, and HBase code.