8000 brunovilar (Bruno Vilar) / Starred · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
View brunovilar's full-sized avatar

Block or report brunovilar

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Collection of extracted System Prompts from popular chatbots like ChatGPT, Claude & Gemini

JavaScript 6,312 1,343 Updated Jun 4, 2025

Revisiting Pretrarining Objectives for Tabular Deep Learning

Python 63 10 Updated Aug 22, 2022

Deepchecks: Tests for Continuous Validation of ML Models & Data. Deepchecks is a holistic open-source solution for all of your AI & ML validation needs, enabling to thoroughly test your data and mo…

Python 3,819 271 Updated Jun 1, 2025

🎓 Um caminho para a educação autodidata em Ciência da Computação!

15,398 1,187 Updated Mar 21, 2025

Beyond Accuracy: Behavioral Testing of NLP models with CheckList

Jupyter Notebook 2,037 208 Updated Jan 9, 2024

Example repo to kickstart integration with mlflow pipelines.

Python 76 63 Updated Nov 14, 2022
TypeScript 832 74 Updated Jul 16, 2024

JupyterLite demo deployed to GitHub Pages 🚀

Jupyter Notebook 390 222 Updated Jun 5, 2025
Jupyter Notebook 347 96 Updated Aug 8, 2024

The data factory for next gen AI

Python 138 67 Updated Jun 11, 2025

The website for the CMU Language Technologies Institute low resource NLP bootcamp 2020

Jupyter Notebook 601 67 Updated Jun 4, 2020

Dict2vec is a framework to learn word embeddings using lexical dictionaries.

Python 114 30 Updated Jan 8, 2021

My PhD thesis with all its source files, including all .tex files and images created, as well as the slides of my defense.

TeX 4 1 Updated Nov 9, 2020

📖 A curated list of resources dedicated to Natural Language Processing (NLP)

17,225 2,608 Updated Nov 13, 2023

Benchmarks of approximate nearest neighbor libraries in Python

Python 5,305 811 Updated Jun 10, 2025

Curated repository of notes from papers I'm reading, mostly NLP related. Updated regularly.

128 29 Updated Apr 19, 2021

Compute Sentence Embeddings Fast!

Jupyter Notebook 623 84 Updated Mar 2, 2023

sentence embedding by Smooth Inverse Frequency weighting scheme

Python 1,088 308 Updated Jul 23, 2019

A collection of modern/faster/saner alternatives to common unix commands.

32,014 803 Updated Sep 10, 2024

A pure python implementation of the Word Mover‘s Embedding Algorithm

Python 6 1 Updated Apr 24, 2021

WordMoversEmbeddings(WME) is a simple code for generating the vector representation of sentence/document for text classification and clustering.

C 81 15 Updated Dec 5, 2018

📚 Papers & tech blogs by companies sharing their work on data science & machine learning in production.

28,015 3,767 Updated Jul 18, 2024
Jupyter Notebook 57 11 Updated May 14, 2024

Best Practices on Recommendation Systems

Python 20,354 3,218 Updated Jun 10, 2025

Roadmap to becoming a data engineer in 2021

12,642 1,350 Updated Jan 25, 2022

Self-Supervised Euphemism Detection and Identification for Content Moderation, IEEE S&P (Oakland) 2021

Python 33 10 Updated Mar 26, 2025

PyTorch implementation for "Matching the Blanks: Distributional Similarity for Relation Learning" paper

Python 595 134 Updated Sep 24, 2023

This Universal Dependencies (UD) Portuguese treebank.

Common Lisp 50 12 Updated Jun 1, 2025

SemClinBr - a multi-institutional and multi-specialty semantically annotated corpus for Portuguese clinical NLP tasks

Java 29 5 Updated Mar 12, 2024
Python 98 19 Updated Feb 25, 2022
Next
0