Stars
Dapp learning project for developers at all stages. Becoming and cultivating sovereign individuals. Nonprofit organization.
Repo for counting stars and contributing. Press F to pay respect to glorious developers.
Apache Spark - A unified analytics engine for large-scale data processing
A curated list of data engineering tools for software developers
My blogs and code for machine learning. http://cnblogs.com/pinard
Source Code for the book Building Machine Learning Systems with Python
some notes about the basic knowledge of machine learning and data mining
fastutil extends the Java™ Collections Framework by providing type-specific maps, sets, lists and queues.
VIP cheatsheets for Stanford's CS 229 Machine Learning
Apache RocketMQ is a cloud native messaging and streaming platform, making it simple to build event-driven applications.
A curated list of awesome Amazon Web Services (AWS) libraries, open source repos, guides, blogs, and other resources. Featuring the Fiery Meter of AWSome.
Spark: The Definitive Guide's Code Repository
A Flexible and Powerful Parameter Server for large-scale machine learning
Alibaba Java Coding Guidelines pmd implements and IDE plugin
A sample project that exists for PyPUG's "Tutorial on Packaging and Distributing Projects"
A collection of example UDFs for Amazon Redshift.
Amazon Redshift Utils contains utilities, scripts and view which are useful in a Redshift environment
Simple and extensible administrative interface framework for Flask
python ip proxy tool scrapy crawl. 抓取大量免费代理 ip,提取有效 ip 使用
The official home of the Presto distributed SQL query engine for big data