8000 okasag (Gabriel Okasa) / Starred · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
View okasag's full-sized avatar
:octocat:
and not a single loop was given
:octocat:
and not a single loop was given

Block or report okasag

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Model implementation for the contextual embeddings project

Python 27 Updated Jun 2, 2025

Replication code and results for: A general framework to quantify the event importance in multi-event contests.

R 1 Updated Apr 15, 2025

DoubleML - Double Machine Learning in R

R 146 26 Updated Apr 14, 2025

Interrupted Time Series for Causal Estimation

HTML 2 Updated Oct 7, 2024

Efficient few-shot learning with Sentence Transformers

Jupyter Notebook 2,492 244 Updated Apr 11, 2025

Code for measuring novelty in science using publication text

Jupyter Notebook 27 7 Updated Mar 4, 2025

Code to replicate the simulation study in the paper "Causal Machine Learning for Moderation Analysis".

Python 2 Updated Dec 17, 2024

A reading list for papers on causality for natural language processing (NLP)

646 69 Updated May 29, 2025

LlamaIndex is the leading framework for building LLM-powered agents over your data.

Python 42,086 6,016 Updated Jun 4, 2025

Code to replicate the simulation study and empirical application in the paper "Improving the Finite Sample Performance of Double/Debiased Machine Learning with Propensity Score Calibration"

Python 5 2 Updated Dec 17, 2024

Code for "Fine-Tuned 'Small' LLMs (Still) Significantly Outperform Zero-Shot Generative AI Models in Text Classification", arXiv 2024

Jupyter Notebook 10 4 Updated Jun 24, 2024

BERT classification model for processing texts longer than 512 tokens. Text is first divided into smaller chunks and after feeding them to BERT, intermediate results are pooled. The implementation …

Python 142 32 Updated Jun 19, 2024

Quarto document on using tidymodels and Databricks for predicting lending rates.

R 6 4 Updated Jun 24, 2024

Tensorflow 2 implementation of Causal-BERT

Python 71 19 Updated Nov 5, 2023

This repository contains demos I made with the Transformers library by HuggingFace.

Jupyter Notebook 10,957 1,628 Updated May 26, 2025

Fuzzy string matching, grouping, and evaluation.

Python 764 72 Updated May 6, 2025

A blazing fast inference solution for text embeddings models

Rust 3,646 266 Updated Jun 4, 2025

🎯 Task-oriented embedding tuning for BERT, CLIP, etc.

Python 1,500 69 Updated Mar 11, 2024

The multilingual language model for Switzerland

Jupyter Notebook 26 4 Updated Jan 19, 2024

A curated list of pretrained sentence and word embedding models

Python 2,255 262 Updated Apr 23, 2021

Code to replicate the simulation study in the paper "Calibrating doubly-robust estimators with unbalanced treatment assignment"

Python 13 3 Updated May 20, 2024

Model Confidence Set (MCS) implementation in Python

Python 10 Updated Oct 21, 2024

Uncertainty Toolbox: a Python toolbox for predictive uncertainty quantification, calibration, metrics, and visualization

Python 1,895 138 Updated Mar 5, 2025

General purpose unsupervised sentence representations

C++ 1,204 261 Updated Aug 3, 2022

Python implementation of TextRank algorithm for automatic keyword extraction and summarization using Levenshtein distance as relation between text units. This project is based on the paper "TextRan…

Python 780 224 Updated May 5, 2022

A python tool for evaluating the quality of sentence embeddings.

Python 2,108 307 Updated Mar 19, 2024
Next
0