8000 johncalesp (John Calderon) / Starred · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
View johncalesp's full-sized avatar
  • Toronto,CA

Block or report johncalesp

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

NVIDIA curated collection of educational resources related to general purpose GPU programming.

Jupyter Notebook 567 105 Updated Jul 12, 2025

Learn LLVM 17, published by Packt

C++ 197 38 Updated May 28, 2024

A tiny scalar-valued autograd engine and a neural net library on top of it with PyTorch-like API

Jupyter Notebook 12,318 1,817 Updated Aug 8, 2024

Introduction to Machine Learning Systems

TeX 2,011 234 Updated Jul 14, 2025

CUDA Templates for Linear Algebra Subroutines

C++ 7,964 1,319 Updated Jul 6, 2025

Video+code lecture on building nanoGPT from scratch

Python 4,215 641 Updated Aug 13, 2024

GPU programming related news and material links

1,616 90 Updated Jan 6, 2025

Awesome LLM compression research papers and tools.

1,596 102 Updated Jul 2, 2025

Learn CUDA Programming, published by Packt

Cuda 1,165 257 Updated Dec 30, 2023

CUDA by Example, written by two senior members of the CUDA software platform team, shows programmers how to employ this new technology. The authors introduce each area of CUDA development through w…

C 428 136 Updated Jun 30, 2023

📰 Must-read papers and blogs on Speculative Decoding ⚡️

826 44 Updated Jun 22, 2025

Accelerate inference without tears

Python 319 21 Updated Mar 14, 2025

[ICML 2024] Break the Sequential Dependency of LLM Inference Using Lookahead Decoding

Python 1,260 74 Updated Mar 6, 2025

LLVM Tutorial: Kaleidoscope (Implementing a Language with LLVM)

C++ 263 50 Updated Dec 29, 2022

The simplest, fastest repository for training/finetuning medium-sized GPTs.

Python 42,814 7,169 Updated Dec 9, 2024

LLM training in simple, raw C/CUDA

Cuda 27,140 3,123 Updated Jun 26, 2025

Training materials associated with NVIDIA's CUDA Training Series (www.olcf.ornl.gov/cuda-training-series/)

Cuda 815 293 Updated Aug 19, 2024

A minimal GPU design in Verilog to learn how GPUs work from the ground up

SystemVerilog 8,577 664 Updated Aug 18, 2024

Fast CUDA matrix multiplication from scratch

Cuda 768 119 Updated Dec 28, 2023

Code from the "CUDA Crash Course" YouTube series by CoffeeBeforeArch

Cuda 845 177 Updated Jul 19, 2023

Open deep learning compiler stack for cpu, gpu and specialized accelerators

Python 12,446 3,620 Updated Jul 14, 2025

header-only Windows implementation of the <sys/time.h> header

C 12 3 Updated Mar 22, 2023

A Cross-Browser, Event-based, Element Resize Detection for React

TypeScript 1,283 95 Updated Jun 24, 2025

This a classification model using tensorflow and Keras. The dataset is from kaggle.

Python 2 Updated Aug 26, 2022
0