10000 GitHub - PSwati/pig: Mirror of Apache Pig
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content

PSwati/pig

 
 

Repository files navigation

Apache Pig
===========
Pig is a dataflow programming environment for processing very large files. Pig's
language is called Pig Latin. A Pig Latin program consists of a directed
acyclic graph where each node represents an operation that transforms data.
Operations are of two flavors: (1) relational-algebra style operations such as
join, filter, project; (2) functional-programming style operators such as map,
reduce. 

Pig compiles these dataflow programs into (sequences of) map-reduce jobs and
executes them using Hadoop. It is also possible to execute Pig Latin programs
in a "local" mode (without Hadoop cluster), in which case all processing takes
place in a single local JVM. 

General Info
===============

For the latest information about Pig, please visit our website at:

   http://pig.apache.org/

and our wiki, at:

   http://wiki.apache.org/pig/

Getting Started
===============
1. To learn about Pig, try http://wiki.apache.org/pig/PigTutorial
2. To build and run Pig, try http://wiki.apache.org/pig/BuildPig and
http://wiki.apache.org/pig/RunPig
3. To check out the function library, try http://wiki.apache.org/pig/PiggyBank

Build and run Pig on Power
==========================

For building pig, issue the command "ant jar". By default, pig will 
build against Hadoop1 which here refers to 1.0.4. To build against 
Hadoop2 (2.0.3), execute "ant jar -Dhadoopversion=23".

To run the unit test, execute "ant test".

Contributing to the Project
===========================

We welcome all contributions. For the details, please, visit
http://wiki.apache.org/pig/HowToContribute.

About

Mirror of Apache Pig

Resources

License

Stars

Watchers

Forks

Packages

No packages published

Languages

  • Java 92.8%
  • Perl 4.3%
  • GAP 1.1%
  • PigLatin 0.7%
  • Shell 0.5%
  • Python 0.4%
  • Other 0.2%
0