8000 OxxoCodes (Nathan Brown) / Starred · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
View OxxoCodes's full-sized avatar

Block or report OxxoCodes

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Flash-Muon: An Efficient Implementation of Muon Optimizer

Python 137 10 Updated Jun 15, 2025

NVIDIA Linux open GPU with P2P support

C 1,184 116 Updated Jun 6, 2025

A Survey on Data Selection for Language Models

240 15 Updated Apr 29, 2025

Train VAE like a boss

Jupyter Notebook 282 12 Updated Oct 21, 2024

A reading list on LLM based Synthetic Data Generation 🔥

1,330 77 Updated Jun 5, 2025

Code for the paper "Rethinking Benchmark and Contamination for Language Models with Rephrased Samples"

Python 305 25 Updated Dec 20, 2023
TypeScript 25,705 1,747 Updated Jul 5, 2025

A framework for few-shot evaluation of language models.

Python 9,468 2,516 Updated Jul 7, 2025

Run GPT model on the browser with WebGPU. An implementation of GPT inference in less than ~1500 lines of vanilla Javascript.

JavaScript 3,714 210 Updated Jan 12, 2024

All-in-one text de-duplication

Python 697 74 Updated May 25, 2025

Retro device nc2000/nc2600 emulator (6502 cpu). 文曲星nc2000/nc2600模拟器

C++ 19 2 Updated Jul 7, 2025

🕸 GlotCC Dataset and Pipline -- NeurIPS 2024

Jupyter Notebook 19 Updated Apr 6, 2025

MultilingualSIFT: Multilingual Supervised Instruction Fine-tuning

Python 93 5 Updated Aug 15, 2023

A list of microgrant programs for your good ideas

1,589 75 Updated May 9, 2025

A pure NumPy implementation of Mamba.

Python 225 8 Updated Jul 8, 2024

The data set contains cabinet statements from the South African government. Data was scraped from the governments website: https://www.gov.za/cabinet-statements

Jupyter Notebook 4 Updated Jul 4, 2025

MAFAND-MT

Jupyter Notebook 57 27 Updated Jul 9, 2024

Implement a ChatGPT-like LLM in PyTorch from scratch, step by step

Jupyter Notebook 58,407 8,134 Updated Jul 6, 2025

Python scraper based on AI

Python 20,199 1,724 Updated Jul 3, 2025

Devon: An open-source pair programmer

Python 3,436 282 Updated May 26, 2025

A unified library of state-of-the-art model optimization techniques like quantization, pruning, distillation, speculative decoding, etc. It compresses deep learning models for downstream deployment…

Python 1,031 95 Updated Jun 30, 2025

Benchmarking PDF libraries

Python 292 16 Updated Jul 2, 2025

Minimal example scripts of the Hugging Face Trainer, focused on staying under 150 lines

Python 197 12 Updated May 6, 2024

A playbook for systematically maximizing the performance of deep learning models.

28,896 2,374 Updated Jun 18, 2024

PyMuPDF is a high performance Python library for data extraction, analysis, conversion & manipulation of PDF (and other) documents.

Python 7,530 622 Updated Jul 7, 2025

Large language models (LLMs) made easy, EasyLM is a one stop solution for pre-training, finetuning, evaluating and serving LLMs in JAX/Flax.

Python 75 16 Updated Aug 17, 2024

A PyTorch native platform for training generative AI models

Python 4,014 420 Updated Jul 7, 2025

Instant voice cloning by MIT and MyShell. Audio foundation model.

Python 32,893 3,464 Updated Apr 19, 2025

The original sources of MS-DOS 1.25, 2.0, and 4.0 for reference purposes

Assembly 31,286 4,470 Updated Apr 25, 2024

OCR, layout analysis, reading order, table recognition in 90+ languages

Python 17,761 1,189 Updated Jul 3, 2025
Next
0