Stars
A compiled list of kaggle competitions and their winning solutions for regression problems.
Neologism dictionary based on the language resources on the Web for mecab-ipadic
LINE: Large-scale information network embedding
A Face detector for anime/manga using OpenCV
Scalable, Portable and Distributed Gradient Boosting (GBDT, GBRT or GBM) Library, for Python, R, Java, Scala, C++ and more. Runs on single machine, Hadoop, Spark, Dask, Flink and DataFlow
Code for Large Scale Hierarchical Text Classification competition. Final place: 3rd
GraphChi's C++ version. Big Data - small machine.
Approximate Nearest Neighbors in C++/Python optimized for memory usage and loading/saving to disk
KDD CUP 2013 - Track 1 - 2nd place model
Tracking Dynamics of Topic Trends Using a Finite Mixture Model (KDD 2004)
Code to create benchmarks for Kaggle's Facebook Recruiting Competition
MongoDBの薄い本(The Little MongoDB Book)