Stars
参考《自己动手写 docker》从零开始实现一个简易的 docker 以及相关教程。Build a simple Docker from scratch along with related tutorials.
O'Reilly Book: [Data Algorithms with Spark] by Mahmoud Parsian
(MLSys' 21) An Acceleration System for Large-scare Unsupervised Heterogeneous Outlier Detection (Anomaly Detection)
程序员延寿指南 | A programmer's guide to live longer
Distribution transparent Machine Learning experiments on Apache Spark
SKAB - Skoltech Anomaly Benchmark. Time-series data for evaluating Anomaly Detection algorithms.
A simple Spark-powered ETL framework that just works 🍺
A distributed Spark/Scala implementation of the isolation forest algorithm for unsupervised outlier detection, featuring support for scalable training and ONNX export for easy cross-platform infere…
A curated list of awesome Apache Spark packages and resources.
TensorFlow for macOS 11.0+ accelerated using Apple's ML Compute framework.
C++ 资源大全中文版,标准库、Web应用框架、人工智能、数据库、图片处理、机器学习、日志、代码分析等。由「开源前哨」和「CPP开发者」微信公号团队维护更新。
搞定C++:punch:。C++ Primer 中文版第5版学习仓库,包括笔记和课后练习答案。
Cross-platform GUI management tools for OpenCore(OCAT)
Apache Spark & Python (pySpark) tutorials for Big Data Analysis and Machine Learning as IPython / Jupyter notebooks
Now we have become very big, Different from the original idea. Collect premium software in various categories.
🎯 🌟[大数据面试题]分享自己在网络上收集的大数据相关的面试题以及自己的答案总结.目前包含Hadoop/Hive/Spark/Flink/Hbase/Kafka/Zookeeper框架的面试题知识总结
wudashuo / yolov5
Forked from ultralytics/yolov5YOLOv5 汉化版,保持官方同步更新
Deep Learning on Flink aims to integrate Flink and deep learning frameworks (e.g. TensorFlow, PyTorch, etc) to enable distributed deep learning training and inference on a Flink cluster.
Python 开源项目之「自学编程之路」,保姆级教程:AI实验室、宝藏视频、数据结构、学习指南、机器学习实战、深度学习实战、网络爬虫、大厂面经、程序人生、资源分享。
定期更新Hadoop生态圈中常用大数据组件文档 重心依次为: Flink Solr Sparksql ES Scala Kafka Hbase/phoenix Redis Kerberos (项目包含hadoop思维导图 印象笔记 Scala版本简单demo 常用工具类 去敏后的train code 持续更新!!!)
基于 PyFlink 的学习文档,通过一个个小实践,便于大家快速入手 PyFlink
🚀🚀🚀优质的历史文章,大数据高频考点,Java一线大厂知识考点,更有精美简历模板,简历指导手册和上百本技术书籍,最重要的就是被全网下载上千次的我自己花精力去画的大数据生态圈,Kafka,Spark,Scala的思维导图...这是一个你在大数据学习路上不能错过的宝藏项目!
Awesome pre-trained models toolkit based on PaddlePaddle. (400+ models including Image, Text, Audio, Video and Cross-Modal with Easy Inference & Serving)
PaddlePaddle GAN library, including lots of interesting applications like First-Order motion transfer, Wav2Lip, picture repair, image editing, photo2cartoon, image style transfer, GPEN, and so on.
Make stream processing easier! Easy-to-use streaming application development framework and operation platform.