8000 tpvasconcelos's list / 👨‍🔬 Data Science · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
View tpvasconcelos's full-sized avatar
👨‍🏭
building...
👨‍🏭
building...

Block or report tpvasconcelos

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Stars

👨‍🔬 Data Science

37 repositories

Tries to shrink your Pandas column dtypes with no data loss so you have more spare RAM

Python 84 14 Updated Jan 12, 2024

Kedro is a toolbox for production-ready data science. It uses software engineering best practices to help you create data engineering and data science pipelines that are reproducible, maintainable,…

Python 10,395 952 Updated Jun 27, 2025

A scalable general purpose micro-framework for defining dataflows. THIS REPOSITORY HAS BEEN MOVED TO www.github.com/dagworks-inc/hamilton

Python 859 36 Updated Jul 3, 2023

Awesome list of open-source startup alternatives to well-known SaaS products 🚀

Python 17,791 951 Updated Dec 26, 2024

PySpark test helper methods with beautiful error messages

Python 699 72 Updated Jun 10, 2025

Streamlit — A faster way to build and share data apps.

Python 40,120 3,526 Updated Jun 27, 2025

Collections of vector search related libraries, service and research papers

1,506 103 Updated Aug 6, 2024

Composable transformations of Python+NumPy programs: differentiate, vectorize, JIT to GPU/TPU, and more

Python 32,627 3,068 Updated Jun 27, 2025

Build and share delightful machine learning apps, all in Python. 🌟 Star to support our work!

Python 38,768 2,955 Updated Jun 27, 2025

Survival analysis in Python

Python 2,457 560 Updated Oct 29, 2024

aka "Bayesian Methods for Hackers": An introduction to Bayesian methods + probabilistic programming with a computation/understanding-first, mathematics-second point of view. All in pure Python ;)

Jupyter Notebook 27,541 7,926 Updated Jun 25, 2024

Create powerful Hydra applications without the yaml files and boilerplate code.

Python 387 17 Updated Jun 16, 2025

Hydra is a framework for elegantly configuring complex applications

Python 9,437 688 Updated May 15, 2025

Tigramite is a python package for causal inference with a focus on time series data. The Tigramite documentation is at

Jupyter Notebook 1,480 292 Updated Dec 20, 2024

ML powered analytics engine for outlier detection and root cause analysis.

Python 758 85 Updated Sep 12, 2024

Uniform Manifold Approximation and Projection

Python 7,826 843 Updated May 12, 2025

Qdrant - High-performance, massive-scale Vector Database and Vector Search Engine for the next generation of AI. Also available in the cloud https://cloud.qdrant.io/

Rust 24,342 1,666 Updated Jun 27, 2025

DoWhy is a Python library for causal inference that supports explicit modeling and testing of causal assumptions. DoWhy is based on a unified language for causal inference, combining causal graphic…

Python 7,559 971 Updated Jun 24, 2025

🤗 Evaluate: A library for easily evaluating machine learning models and datasets.

Python 2,247 283 Updated Jun 20, 2025

A light-weight, flexible, and expressive statistical data testing library

Python 3,872 346 Updated Jun 26, 2025

Automatically visualize your pandas dataframe via a single print! 📊 💡

Python 5,280 375 Updated Mar 20, 2024

Extract data from a wide range of Internet sources into a pandas DataFrame.

Python 3,051 679 Updated Apr 3, 2025

World beating online covariance and portfolio construction.

Jupyter Notebook 302 51 Updated Jun 16, 2025

Open-Source Information Retrieval Courses @ TU Wien

Python 614 87 Updated Jun 12, 2023

Google's Operations Research tools:

C++ 12,123 2,234 Updated Jun 26, 2025

Visual Pandas Selector: Visualize and interactively select time-series data

Python 82 5 Updated Jan 24, 2025

A reactive notebook for Python — run reproducible experiments, query with SQL, execute as a script, deploy as an app, and version with git. All in a modern, AI-native editor.

Python 14,040 575 Updated Jun 27, 2025

Friends don't let friends make certain types of data visualization - What are they and why are they bad.

R 6,775 269 Updated Dec 10, 2024

Automatic exploratory data analysis

Python 3 Updated Feb 20, 2024

Synthetic data generation for tabular data

Python 3,040 366 Updated Jun 26, 2025
0