8000 gengala (gennaro gala | gg) / Starred · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
View gengala's full-sized avatar
🎯
Focusing
🎯
Focusing

Block or report gengala

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

OLMo-core ported for Snellius

Python 2 Updated Jul 10, 2025

Code accompanying the paper "Generalized Interpolating Discrete Diffusion"

Python 93 13 Updated Jun 9, 2025

Minimalistic large language model 3D-parallelism training

Python 2,021 205 Updated Jul 10, 2025

TransMLA: Multi-Head Latent Attention Is All You Need

Python 327 22 Updated Jul 4, 2025

Implement a ChatGPT-like LLM in PyTorch from scratch, step by step

Jupyter Notebook 58,925 8,211 Updated Jul 13, 2025

Efficient Triton Kernels for LLM Training

Python 5,362 369 Updated Jul 14, 2025

PyTorch building blocks for the OLMo ecosystem

Python 260 50 Updated Jul 14, 2025

A tiny scalar-valued autograd engine and a neural net library on top of it with PyTorch-like API

Jupyter Notebook 12,321 1,817 Updated Aug 8, 2024

Get up and running with Llama 3.3, DeepSeek-R1, Phi-4, Gemma 3, Mistral Small 3.1 and other large language models.

Go 146,467 12,378 Updated Jul 12, 2025

Democratizing Reinforcement Learning for LLMs

Jupyter Notebook 3,780 353 Updated Jul 13, 2025

An extension of the nanoGPT repository for training small MOE models.

Python 162 20 Updated Mar 9, 2025

Stanford Drone Dataset with non-convex Constraints

Jupyter Notebook 6 Updated Apr 18, 2025
Python 6 Updated Feb 3, 2025

Sum-of-squares Non-monotonic Probabilistic Circuits

Python 7 Updated Jan 16, 2025
Python 3 Updated Feb 3, 2025

Tensor Network Learning with PyTorch

Python 298 42 Updated May 23, 2024

A computer algebra system written in pure Python

Python 13,739 4,720 Updated Jul 11, 2025

Code for "TabZilla: When Do Neural Nets Outperform Boosted Trees on Tabular Data?"

HTML 162 35 Updated Mar 22, 2024

Official implementation of E(n)-equivariant Graph Neural Cellular Automata

Jupyter Notebook 29 4 Updated Apr 25, 2024

A New Modeling Framework for Continuous, Sequential Domains

Jupyter Notebook 2 1 Updated Jun 16, 2024

Code release for Hoogeboom, Emiel, Jorn WT Peters, Rianne van den Berg, and Max Welling. "Integer Discrete Flows and Lossless Compression." Conference on Neural Information Processing Systems (2019).

Python 98 15 Updated Nov 29, 2019

LLM training in simple, raw C/CUDA

Cuda 27,145 3,123 Updated Jun 26, 2025

Squared Non-monotonic Probabilistic Circuits

Python 22 Updated Jan 16, 2025

Vector (and Scalar) Quantization, in Pytorch

Python 3,406 272 Updated Jun 16, 2025

Tabular Deep Learning Library for PyTorch

Python 682 69 Updated Jul 7, 2025

Solve puzzles. Learn CUDA.

Jupyter Notebook 11,267 855 Updated Sep 1, 2024

a python framework to build, learn and reason about probabilistic circuits and tensor networks

Python 111 17 Updated Jul 12, 2025

Pytorch implementation of Block Neural Autoregressive Flow

Python 179 33 Updated Aug 19, 2021
Next
0