sahandrez

Sahand Rezaei-Shoshtari sahandrez

AI Research Scientist @ Autodesk

44 followers · 13 following

Mila, McGill University
Montreal, Canada
sahandrez.github.io

Achievements

Highlights

Lists (3)

Sort

Stars

huggingface / smollm

Everything about the SmolLM and SmolVLM family of models

Python 2,741 172 Updated Jul 8, 2025

deepseek-ai / DeepGEMM

DeepGEMM: clean and efficient FP8 GEMM kernels with fine-grained scaling

Python 5,515 636 Updated Jul 2, 2025

deepseek-ai / DeepEP

DeepEP: an efficient expert-parallel communication library

Cuda 8,265 843 Updated Jul 10, 2025

Jiayi-Pan / TinyZero

Minimal reproduction of DeepSeek R1-Zero

Python 11,994 1,490 Updated Apr 24, 2025

huggingface / open-r1

Fully open reproduction of DeepSeek-R1

Python 25,005 2,327 Updated Jul 9, 2025

deepseek-ai / DeepSeek-V3

Python 98,154 15,977 Updated Jun 27, 2025

EleutherAI / lm-evaluation-harness

A framework for few-shot evaluation of language models.

Python 9,493 2,522 Updated Jul 10, 2025

huggingface / evaluation-guidebook

Sharing both practical insights and theoretical knowledge about LLM evaluation that we gathered while managing the Open LLM Leaderboard and designing lighteval!

Jupyter Notebook 1,459 86 Updated Jan 7, 2025

allenai / OLMo

Modeling, training, eval, and inference code for OLMo

Python 5,756 630 Updated Jul 7, 2025

huggingface / nanotron

Minimalistic large language model 3D-parallelism training

Python 1,997 204 Updated Jul 7, 2025

google / sentencepiece

Unsupervised text tokenizer for Neural Network-based text generation.

C++ 11,066 1,264 Updated Jul 1, 2025

karpathy / minbpe

Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization.

Python 9,740 924 Updated Jul 1, 2024

rasbt / LLMs-from-scratch

Implement a ChatGPT-like LLM in PyTorch from scratch, step by step

Jupyter Notebook 58,603 8,163 Updated Jul 9, 2025

llmgenai / LLMInterviewQuestions

This repository contains LLM (Large language model) interview question asked in top companies like Google, Nvidia , Meta , Microsoft & fortune 500 companies.

1,365 314 Updated Feb 12, 2025

chiphuyen / aie-book

[WIP] Resources for AI engineers. Also contains supporting materials for the book AI Engineering (Chip Huyen, 2025)

Jupyter Notebook 4,955 642 Updated Feb 12, 2025

srush / Tensor-Puzzles

Solve puzzles. Improve your pytorch.

Jupyter Notebook 3,640 329 Updated Jul 15, 2024

Exorust / TorchLeet

Leetcode for Pytorch

Jupyter Notebook 508 89 Updated Jun 7, 2025

mlabonne / llm-course

Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.

57,130 6,148 Updated Jun 4, 2025

persiannlp / persian-raw-text

Persian raw text - حدود ۸۰ گیگابایت متن خام فارسی

98 9 Updated Aug 28, 2020

sajjjadayobi / PersianQA

Persian (Farsi) Question Answering Dataset (+ Models)

Jupyter Notebook 210 17 Updated Sep 8, 2021

karpathy / nanoGPT

The simplest, fastest repository for training/finetuning medium-sized GPTs.

Python 42,697 7,147 Updated Dec 9, 2024

karpathy / LLM101n

LLM101n: Let's build a Storyteller

33,946 1,845 Updated Aug 1, 2024

deepseek-ai / DeepSeek-R1

90,466 11,678 Updated Jun 27, 2025

sahandrez / unpaired_rlhf

Reinforcement Learning from Human Feedback (RLHF) with Unpaired Preferences

Python 2 1 Updated Jan 14, 2025

ggml-org / llama.cpp

LLM inference in C/C++

C++ 82,801 12,308 Updated Jul 10, 2025

chiphuyen / ml-interviews-book

https://huyenchip.com/ml-interviews-book/

HTML 3,778 594 Updated Mar 21, 2025

andrewekhalel / MLQuestions

Machine Learning and Computer Vision Engineer - Technical Interview Questions

3,789 618 Updated May 20, 2025

youssefHosni / Data-Science-Interview-Questions-Answers

Curated list of data science interview questions and answers

4,791 1,102 Updated Sep 29, 2024

promptfoo / promptfoo

Test your prompts, agents, and RAGs. Red teaming, pentesting, and vulnerability scanning for LLMs. Compare performance of GPT, Claude, Gemini, Llama, and more. Simple declarative configs with comma…

TypeScript 7,490 601 Updated Jul 10, 2025

vllm-project / vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 51,867 8,603 Updated Jul 10, 2025

2985 Next

Sahand Rezaei-Shoshtari sahandrez

Highlights

Lists (3)

Interviews

LLM

RL

Stars