8000 itsnamgyu (Namgyu Ho) / Starred · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
View itsnamgyu's full-sized avatar
🌝
Excited
🌝
Excited

Block or report itsnamgyu

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Doing simple retrieval from LLM models at various context lengths to measure accuracy

Jupyter Notebook 1,866 193 Updated Aug 17, 2024

Large language models (LLMs) made easy, EasyLM is a one stop solution for pre-training, finetuning, evaluating and serving LLMs in JAX/Flax.

Python 2,475 260 Updated Aug 13, 2024

A PyTorch native platform for training generative AI models

Python 3,834 377 Updated May 24, 2025

XAttention: Block Sparse Attention with Antidiagonal Scoring

Python 156 7 Updated May 13, 2025

Gemma open-weight LLM library, from Google DeepMind

Jupyter Notebook 3,306 450 Updated May 23, 2025

JAX-based neural network library

Python 3,033 245 Updated May 1, 2025

Pax is a Jax-based machine learning framework for training large scale models. Pax allows for advanced and fully configurable experimentation and parallelization, and has demonstrated industry lead…

Python 496 68 Updated May 23, 2025

An interactive HTML pretty-printer for machine learning research in IPython notebooks.

Python 415 23 Updated May 1, 2025

Modular, scalable library to train ML models

Python 118 14 Updated May 23, 2025

Kanana: Compute-efficient Bilingual Language Models

239 9 Updated May 23, 2025

Meta Lingua: a lean, efficient, and easy-to-hack codebase to research LLMs.

Python 4,551 254 Updated May 20, 2025

s1: Simple test-time scaling

Python 6,392 745 Updated May 19, 2025

Code for paper called Self-Training Elicits Concise Reasoning in Large Language Models

Python 30 2 Updated Apr 22, 2025

📰 Must-read papers on KV Cache Compression (constantly updating 🤗).

417 10 Updated May 7, 2025
Jupyter Notebook 19 7 Updated Oct 12, 2024

Evaluation of speculative inference over multilingual tasks

Python 8 Updated Jul 1, 2024

🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.

Python 144,750 29,078 Updated May 24, 2025

Official code for "MAmmoTH2: Scaling Instructions from the Web" [NeurIPS 2024]

Python 141 10 Updated Oct 27, 2024

Welcome to the Llama Cookbook! This is your go to guide for Building with Llama: Getting started with Inference, Fine-Tuning, RAG. We also show you how to solve end to end problems using Llama mode…

Jupyter Notebook 17,352 2,488 Updated May 22, 2025

Doing simple retrieval from LLM models at various context lengths to measure accuracy

Python 99 1 Updated Apr 4, 2024

Deep learning for dummies. All the practical details and useful utilities that go into working with real models.

Python 791 40 Updated Apr 30, 2025

Official repository for EXAONE built by LG AI Research

184 13 Updated Aug 8, 2024

OpenLLaMA, a permissively licensed open source reproduction of Meta AI’s LLaMA 7B trained on the RedPajama dataset

7,493 400 Updated Jul 16, 2023

Official PyTorch implementation of DistiLLM: Towards Streamlined Distillation for Large Language Models (ICML 2024)

Python 216 25 Updated Mar 13, 2025

LLM101n: Let's build a Storyteller

33,493 1,826 Updated Aug 1, 2024

Official implementation of "Perturbed-Attention Guidance"

Jupyter Notebook 306 13 Updated Jul 2, 2024

Official repository of "Distort, Distract, Decode: Instruction-Tuned Model Can Refine its Response from Noisy Instructions", ICLR 2024 Spotlight

Python 20 Updated Mar 7, 2024
Next
0