8000 SemyonSinchenko (Sem) · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
View SemyonSinchenko's full-sized avatar

Organizations

@apache

Block or report SemyonSinchenko

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
SemyonSinchenko/README.md

Sem Sinchenko

Data Egnineer, Open Source Software enthusiast, Apache Software Foundation committer.

I'm developing in Python, Scala/Java and some Rust. Mostly my activities are related to the Apache Spark / PySpark ecosystem and Data Engineering tools.

I'm a maintainer at the following projects:

  • GraphFrames -- scalabale graph algorithms on top of Apache Spark DataFrames.
  • Apache GraphAr (incubating) -- universal "open-table" format for storing Property Graphs.
  • spark-fast-tests -- Apache Spark testing helpers and assertions (Scala).
  • chispa -- Apache Spark testing helpers and assertions (Python).
  • falsa -- CLI tool for generating datasets of the H2O benchmark. Wriiten in Rust.

And other various projects.

sbt               1 hr 23 mins    █████████████████████▒░░░   85.93 %
YAML              6 mins          █▓░░░░░░░░░░░░░░░░░░░░░░░   06.58 %
Python            3 mins          █░░░░░░░░░░░░░░░░░░░░░░░░   04.03 %
TOML              3 mins          ▓░░░░░░░░░░░░░░░░░░░░░░░░   03.27 %
Java Properties   0 secs          ░░░░░░░░░░░░░░░░░░░░░░░░░   00.20 %

Semyon's GitHub stats

About any open source activities and / or collaborations you can reach me using ssinchenko@apache.org.

About any other activities and / or collaborations you can reach me using my private email ssinchenko@pm.me.

Pinned Loading

  1. ibisgraph ibisgraph Public

    An implementation of Pregel framework and graph algorithms on top of it with Ibis project DataFrames.

    Python 23

  2. apache/incubator-graphar apache/incubator-graphar Public

    An open source, standard data file format for graph data storage and retrieval.

    C++ 281 71

  3. flake8-pyspark-with-column flake8-pyspark-with-column Public

    A flake8 plugin that detects of usage withColumn in a loop or inside reduce

    Python 28 1

  4. graphframes/graphframes graphframes/graphframes Public

    GraphFrames is a package for Apache Spark which provides DataFrame-based Graphs

    Scala 1.1k 250

  5. apache/datafusion-comet apache/datafusion-comet Public

    Apache DataFusion Comet Spark Accelerator

    Rust 980 220

  6. feature-generation-benchmark feature-generation-benchmark Public

    A database-like benchmark of feature generation from time-series data

    Jupyter Notebook 13 1

0