8000 sahandrez (Sahand Rezaei-Shoshtari) / Starred · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
View sahandrez's full-sized avatar

Highlights

  • Pro

Block or report sahandrez

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Everything about the SmolLM and SmolVLM family of models

Python 2,741 172 Updated Jul 8, 2025

DeepGEMM: clean and efficient FP8 GEMM kernels with fine-grained scaling

Python 5,515 636 Updated Jul 2, 2025

DeepEP: an efficient expert-parallel communication library

Cuda 8,265 843 Updated Jul 10, 2025

Minimal reproduction of DeepSeek R1-Zero

Python 11,994 1,490 Updated Apr 24, 2025

Fully open reproduction of DeepSeek-R1

Python 25,005 2,327 Updated Jul 9, 2025

A framework for few-shot evaluation of language models.

Python 9,493 2,522 Updated Jul 10, 2025

Sharing both practical insights and theoretical knowledge about LLM evaluation that we gathered while managing the Open LLM Leaderboard and designing lighteval!

Jupyter Notebook 1,459 86 Updated Jan 7, 2025

Modeling, training, eval, and inference code for OLMo

Python 5,756 630 Updated Jul 7, 2025

Minimalistic large language model 3D-parallelism training

Python 1,997 204 Updated Jul 7, 2025

Unsupervised text tokenizer for Neural Network-based text generation.

C++ 11,066 1,264 Updated Jul 1, 2025

Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization.

Python 9,740 924 Updated Jul 1, 2024

Implement a ChatGPT-like LLM in PyTorch from scratch, step by step

Jupyter Notebook 58,603 8,163 Updated Jul 9, 2025

This repository contains LLM (Large language model) interview question asked in top companies like Google, Nvidia , Meta , Microsoft & fortune 500 companies.

1,365 314 Updated Feb 12, 2025

[WIP] Resources for AI engineers. Also contains supporting materials for the book AI Engineering (Chip Huyen, 2025)

Jupyter Notebook 4,955 642 Updated Feb 12, 2025

Solve puzzles. Improve your pytorch.

Jupyter Notebook 3,640 329 Updated Jul 15, 2024

Leetcode for Pytorch

Jupyter Notebook 508 89 Updated Jun 7, 2025

Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.

57,130 6,148 Updated Jun 4, 2025

Persian raw text - حدود ۸۰ گیگابایت متن خام فارسی

98 9 Updated Aug 28, 2020

Persian (Farsi) Question Answering Dataset (+ Models)

Jupyter Notebook 210 17 Updated Sep 8, 2021

The simplest, fastest repository for training/finetuning medium-sized GPTs.

Python 42,697 7,147 Updated Dec 9, 2024

LLM101n: Let's build a Storyteller

33,946 1,845 Updated Aug 1, 2024

Reinforcement Learning from Human Feedback (RLHF) with Unpaired Preferences

Python 2 1 Updated Jan 14, 2025

LLM inference in C/C++

C++ 82,801 12,308 Updated Jul 10, 2025

https://huyenchip.com/ml-interviews-book/

HTML 3,778 594 Updated Mar 21, 2025

Machine Learning and Computer Vision Engineer - Technical Interview Questions

3,789 618 Updated May 20, 2025

Curated list of data science interview questions and answers

4,791 1,102 Updated Sep 29, 2024

Test your prompts, agents, and RAGs. Red teaming, pentesting, and vulnerability scanning for LLMs. Compare performance of GPT, Claude, Gemini, Llama, and more. Simple declarative configs with comma…

TypeScript 7,490 601 Updated Jul 10, 2025

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 51,867 8,603 Updated Jul 10, 2025
2985 Next
0