8000 GitHub - dgleich/hadoopy: Python MapReduce library written in Cython
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content

dgleich/hadoopy

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Brandyn White <bwhite@dappervision.com>

Source  https://github.com/bwhite/hadoopy/
Issues  https://github.com/bwhite/hadoopy/issues
Docs    http://bwhite.github.com/hadoopy/

Requirements
cython (>=.13), cxfreeze (for freeze support)

Features
- oozie support
- typedbytes support (very fast)
- critical path is in Cython
- works on clusters without any extra installation, Python, or any Python libraries (requires cx_Freeze)
- Simple HDFS access (cat and ls) inside Python, even inside running jobs
- Unit test interface
- Reporting using status and counters
- Supports design patterns in the Lin/Dyer book (http://www.umiacs.umd.edu/~jimmylin/book.html)

Used in
- A Case for Query by Image and Text Content: Searching Computer Help using Screenshots and Keywords (to appear in WWW'11)
- Web-Scale Computer Vision using MapReduce for Multimedia Data Mining (at KDD'10)
- Vitrieve: Visual Search engine

Install
sudo python setup.py install

About

Python MapReduce library written in Cython

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Python 94.7%
  • C 5.3%
0