8000 StellaAthena (Stella Biderman) / Starred · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
View StellaAthena's full-sized avatar

Organizations

@EleutherAI

Block or report StellaAthena

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

The nnsight package enables interpreting and manipulating the internals of deep learned models.

Jupyter Notebook 559 50 Updated May 6, 2025

Official Code for Stable Cascade

Jupyter Notebook 6,594 524 Updated Jul 25, 2024

Generative Representational Instruction Tuning

Jupyter Notebook 627 44 Updated Mar 14, 2025

The Art of Debugging

C 877 39 Updated Aug 3, 2024

Machine Learning Engineering Open Book

Python 13,635 823 Updated May 1, 2025
Python 4 2 Updated Dec 6, 2023

A framework for few-shot evaluation of language models.

Python 8,863 2,361 Updated May 6, 2025

the LLM vulnerability scanner

Python 4,399 435 Updated May 7, 2025

The hub for EleutherAI's work on interpretability and learning dynamics

Jupyter Notebook 2,473 185 Updated May 6, 2025

Tools for understanding how transformer predictions are built layer-by-layer

Python 490 55 Updated Jun 2, 2024

Taming Transformers for High-Resolution Image Synthesis

Jupyter Notebook 6,149 1,189 Updated Jul 30, 2024

A repo for distributed training of language models with Reinforcement Learning via Human Feedback (RLHF)

Python 4,637 477 Updated Jan 8, 2024

Toolkit for creating, sharing and using natural language prompts.

Python 2,839 364 Updated Oct 23, 2023

High-Resolution Image Synthesis with Latent Diffusion Models

Jupyter Notebook 12,795 1,616 Updated Feb 29, 2024

A dataset of alignment research and code to reproduce it

HTML 77 17 Updated Jun 22, 2023

A framework for few-shot evaluation of autoregressive language models.

Python 103 29 Updated May 9, 2023
Python 4 1 Updated May 4, 2022

CLOOB training (JAX) and inference (JAX and PyTorch)

Python 71 7 Updated May 16, 2022

MAGMA - a GPT-style multimodal model that can understand any combination of images and language. NOTE: The freely available model from this repo is only a demo. For the latest multimodal and multil…

Python 490 57 Updated Apr 11, 2025

Locating and editing factual associations in GPT (NeurIPS 2022)

Python 630 138 Updated Apr 20, 2024

Implementation of LogAvgExp for Pytorch

Python 35 2 Updated Apr 10, 2025

GPU implementation of a fast generalized ANS (asymmetric numeral system) entropy encoder and decoder, with extensions for lossless compression of numerical and other data types in HPC/ML applications.

Cuda 331 28 Updated Mar 20, 2025

An annotated implementation of the Transformer paper.

Jupyter Notebook 6,206 1,328 Updated Apr 7, 2024

Annotated transformer blog

2 Updated Nov 22, 2021

Implementation of Enformer, Deepmind's attention network for predicting gene expression, in Pytorch

Python 484 87 Updated Oct 9, 2024

v objective diffusion inference code for PyTorch.

Python 716 108 Updated Nov 29, 2022

Code and explanation for IEEE CoG paper "Predicting Human Card Selection in Magic: The Gathering with Contextual Preference Ranking"

Python 5 5 Updated Dec 13, 2022

State of the Art Magic: the Gathering Draft and DeckBuilder AI.

Python 158 40 Updated Mar 30, 2024

An efficient interactive zero-knowledge proof scheme based on GKR in terms of unlayered circuit.

C++ 17 6 Updated May 23, 2023
Next
0