-
velox Public
Forked from oap-project/veloxA new C++ vectorized database acceleration library aimed to optimizing query engines and data processing systems.
-
-
CS-Notes Public
Forked from CyC2018/CS-Notes📚 技术面试必备基础知识、Leetcode、计算机操作系统、计算机网络、系统设计
UpdatedAug 21, 2024 -
async-profiler Public
Forked from async-profiler/async-profilerSampling CPU and HEAP profiler for Java featuring AsyncGetCallTrace + perf_events
C++ Apache License 2.0 UpdatedJul 2, 2024 -
arrow-datafusion-comet Public
Forked from apache/datafusion-cometApache Arrow DataFusion Comet Spark Accelerator
Rust Apache License 2.0 UpdatedApr 8, 2024 -
substrait Public
Forked from substrait-io/substraitA cross platform way to express data transformation, relational algebra, standardized record expression and plans.
HTML Apache License 2.0 UpdatedAug 18, 2023 -
IntelQATCodec Public
Forked from Intel-bigdata/IntelQATCodecJava Apache License 2.0 UpdatedAug 8, 2023 -
oneDAL Public
Forked from uxlfoundation/oneDALoneAPI Data Analytics Library (oneDAL)
C++ Apache License 2.0 UpdatedJun 7, 2023 -
spark Public
Forked from apache/sparkMirror of Apache Spark
Scala Apache License 2.0 UpdatedApr 26, 2023 -
arrow-1 Public
Forked from oap-project/arrowApache Arrow is a cross-language development platform for in-memory data. It specifies a standardized language-independent columnar memory format for flat and hierarchical data, organized for effic…
C++ Apache License 2.0 UpdatedMar 22, 2023 -
presto Public
Forked from prestodb/prestoThe official home of the Presto distributed SQL query engine for big data
Java Apache License 2.0 UpdatedNov 10, 2022 -
native-sql-engine Public
Forked from oap-project/gazelle_pluginNative SQL Engine plugin for Spark SQL with vectorized SIMD optimizations.
Scala Apache License 2.0 UpdatedOct 21, 2021 -
tensorflow Public
Forked from tensorflow/tensorflowAn Open Source Machine Learning Framework for Everyone
C++ Apache License 2.0 UpdatedSep 14, 2021 -
petastorm Public
Forked from uber/petastormPetastorm library enables single machine or distributed training and evaluation of deep learning models from datasets in Apache Parquet format. It supports ML frameworks such as Tensorflow, Pytorch…
Python Apache License 2.0 UpdatedSep 4, 2021 -
spark-nlp Public
Forked from JohnSnowLabs/spark-nlpState of the Art Natural Language Processing
Scala Apache License 2.0 UpdatedJul 22, 2021 -
raydp Public
Forked from oap-project/raydpRayDP: Distributed data processing library that provides simple APIs for running Spark on Ray and integrating Spark with distributed deep learning and machine learning frameworks.
Python Apache License 2.0 UpdatedJul 20, 2021 -
BigDL Public
Forked from intel/ipex-llmBigDL: Distributed Deep Learning Framework for Apache Spark
Scala Apache License 2.0 UpdatedJul 8, 2021 -
ecosystem Public
Forked from tensorflow/ecosystemIntegration of TensorFlow with other open-source frameworks
Scala Apache License 2.0 UpdatedJun 20, 2021 -
analytics-zoo Public
Forked from intel/BigDLDistributed Tensorflow, Keras and PyTorch on Apache Spark/Flink & Ray
Jupyter Notebook Apache License 2.0 UpdatedJun 11, 2021 -
models Public
Forked from intel/ai-reference-modelsModel Zoo for Intel® Architecture: contains Intel optimizations for running deep learning workloads on Intel® Xeon® Scalable processors
Python Apache License 2.0 UpdatedMay 24, 2021 -
keras_bert_text_classification Public
Forked from percent4/keras_bert_text_classification本项目采用Keras和Keras-bert实现文本多分类任务,对BERT进行微调。
Python UpdatedApr 4, 2021 -
xgboost Public
Forked from Intel-bigdata/xgboostScalable, Portable and Distributed Gradient Boosting (GBDT, GBRT or GBM) Library, for Python, R, Java, Scala, C++ and more. Runs on single machine, Hadoop, Spark, Flink and DataFlow
C++ Apache License 2.0 UpdatedFeb 9, 2021 -
horovod Public
Forked from horovod/horovodDistributed training framework for TensorFlow, Keras, PyTorch, and Apache MXNet.
Python Other UpdatedFeb 9, 2021 -
sql-ds-cache Public
Forked from oap-project/sql-ds-cacheSpark* plug-in for accelerating Spark* SQL performance by using cache and index at SQL data source layer.
Scala Apache License 2.0 UpdatedFeb 7, 2021 -
arrow-data-source Public
Forked from oap-project/arrow-data-sourceSpark DataSouce plugin for reading files from various formats like Parquet into Arrow compatible columnar vectors.
Scala Apache License 2.0 UpdatedJan 7, 2021 -
spark-adaptive Public
Forked from shuangshuangwang/spark-adaptive -
5AAB OAP Public
Forked from LinhongLiu/OAPOptimized Analytics Package for Spark* Platform
Scala Apache License 2.0 UpdatedAug 27, 2020 -
keras-bert Public
Forked from CyberZHG/keras-bertImplementation of BERT that could load official pre-trained models for feature extraction and prediction
Python MIT License UpdatedJul 28, 2020 -
horovodRunnerBenchMark_IPython Public
Forked from psychologyphd/horovodRunnerBenchMark_IPythonsame as horovodRunnerBenchMark but ipyton version for better readability
Jupyter Notebook GNU General Public License v3.0 UpdatedJun 24, 2020 -
ray Public
Forked from ray-project/rayA fast and simple framework for building and running distributed applications. Ray is packaged with RLlib, a scalable reinforcement learning library, and Tune, a scalable hyperparameter tuning libr…
Python Apache License 2.0 UpdatedApr 22, 2020