itsnamgyu

🌝

Excited

Namgyu Ho itsnamgyu

🌝

Excited

PhD @ KAIST AI working on large language models

138 followers · 101 following

KAIST AI (OSI LAB)
Seoul, Korea
11:54 (UTC +09:00)
namgyu.com
https://orcid.org/0000-0002-2445-3026
@itsnamgyu

Achievements

x2 x2

Achievements

x2 x2

Stars

gkamradt / LLMTest_NeedleInAHaystack

Doing simple retrieval from LLM models at various context lengths to measure accuracy

Jupyter Notebook 1,866 193 Updated Aug 17, 2024

young-geng / EasyLM

Large language models (LLMs) made easy, EasyLM is a one stop solution for pre-training, finetuning, evaluating and serving LLMs in JAX/Flax.

Python 2,475 260 Updated Aug 13, 2024

pytorch / torchtitan

A PyTorch native platform for training generative AI models

Python 3,834 377 Updated May 24, 2025

mit-han-lab / x-attention

XAttention: Block Sparse Attention with Antidiagonal Scoring

Python 156 7 Updated May 13, 2025

google-deepmind / gemma

Gemma open-weight LLM library, from Google DeepMind

Jupyter Notebook 3,306 450 Updated May 23, 2025

google-deepmind / dm-haiku

JAX-based neural network library

Python 3,033 245 Updated May 1, 2025

google / paxml

Pax is a Jax-based machine learning framework for training large scale models. Pax allows for advanced and fully configurable experimentation and parallelization, and has demonstrated industry lead…

Python 496 68 Updated May 23, 2025

google-deepmind / treescope

An interactive HTML pretty-printer for machine learning research in IPython notebooks.

Python 415 23 Updated May 1, 2025

google-research / kauldron

Modular, scalable library to train ML models

Python 118 14 Updated May 23, 2025

kakao / kanana

Kanana: Compute-efficient Bilingual Language Models

239 9 Updated May 23, 2025

facebookresearch / lingua

Meta Lingua: a lean, efficient, and easy-to-hack codebase to research LLMs.

Python 4,551 254 Updated May 20, 2025

simplescaling / s1

s1: Simple test-time scaling

Python 6,392 745 Updated May 19, 2025

TergelMunkhbat / concise-reasoning

Code for paper called Self-Training Elicits Concise Reasoning in Large Language Models

Python 30 2 Updated Apr 22, 2025

da03 / Internalize_CoT_Step_by_Step

Python 174 17 Updated Apr 19, 2025

October2001 / Awesome-KV-Cache-Compression

📰 Must-read papers on KV Cache Compression (constantly updating 🤗).

417 10 Updated May 7, 2025

etri-edgeai / nn-dist-train-poc

Python 18 8 Updated Oct 12, 2024

etri-edgeai / nn-dist-train

Jupyter Notebook 19 7 Updated Oct 12, 2024

Kthyeon / Multilingual-SpecBench

Evaluation of speculative inference over multilingual tasks

Python 8 Updated Jul 1, 2024

huggingface / transformers

🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.

Python 144,750 29,078 Updated May 24, 2025

TIGER-AI-Lab / MAmmoTH2

Official code for "MAmmoTH2: Scaling Instructions from the Web" [NeurIPS 2024]

Python 141 10 Updated Oct 27, 2024

meta-llama / llama-cookbook

Welcome to the Llama Cookbook! This is your go to guide for Building with Llama: Getting started with Inference, Fine-Tuning, RAG. We also show you how to solve end to end problems using Llama mode…

Jupyter Notebook 17,352 2,488 Updated May 22, 2025

Arize-ai / LLMTest_NeedleInAHaystack

Forked from gkamradt/LLMTest_NeedleInAHaystack

Doing simple retrieval from LLM models at various context lengths to measure accuracy

Python 99 1 Updated Apr 4, 2024

Jupyter Notebook 306 13 Updated Jul 2, 2024

joonkeekim / Instructive-Decoding

Official repository of "Distort, Distract, Decode: Instruction-Tuned Model Can Refine its Response from Noisy Instructions", ICLR 2024 Spotlight

Python 20 Updated Mar 7, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Namgyu Ho itsnamgyu

Achievements

Achievements

Block or report itsnamgyu

Stars

gkamradt / LLMTest_NeedleInAHaystack

young-geng / EasyLM

pytorch / torchtitan

mit-han-lab / x-attention

google-deepmind / gemma

google-deepmind / dm-haiku

google / paxml

google-deepmind / treescope

google-research / kauldron

kakao / kanana

facebookresearch / lingua

simplescaling / s1

TergelMunkhbat / concise-reasoning

da03 / Internalize_CoT_Step_by_Step

October2001 / Awesome-KV-Cache-Compression

etri-edgeai / nn-dist-train-poc

etri-edgeai / nn-dist-train

Kthyeon / Multilingual-SpecBench

huggingface / transformers

TIGER-AI-Lab / MAmmoTH2

meta-llama / llama-cookbook

Arize-ai / LLMTest_NeedleInAHaystack

EleutherAI / cookbook

LG-AI-EXAONE / EXAONE-3.0

google-deepmind / pg19

openlm-research / open_llama

jongwooko / distillm

karpathy / LLM101n

cvlab-kaist / Perturbed-Attention-Guidance

joonkeekim / Instructive-Decoding