emergenz

Franz Srambical emergenz

agi @p-doom

13 followers · 13 following

p(doom)
Munich
10:04 (UTC +02:00)
srambical.fr
@lemergenz
in/franz-srambical-418630178

Achievements

x3 x3

Achievements

x3 x3

Highlights

pdoom.org Public

A grassroots initiative on A(G)I research disregarding dumb societal gatekeeping mechanisms.

JavaScript Apache License 2.0 Updated Jun 23, 2025
Stoix Public
Forked from EdanToledo/Stoix

🏛️A research-friendly codebase for fast experimentation of single-agent reinforcement learning in JAX • End-to-End JAX RL

Python Apache License 2.0 Updated Jun 13, 2025
jafar Public
Forked from FLAIROx/jafar

JAX reimplementation of the DeepMind paper "Genie: Generative Interactive Environments"

Python 1 Apache License 2.0 Updated May 15, 2025
zmk-config-miryoku Public

Updated May 6, 2025
TinyZero Public
Forked from Jiayi-Pan/TinyZero

Clean, minimal, accessible reproduction of DeepSeek R1-Zero

Python Apache License 2.0 Updated May 4, 2025
jax Public
Forked from jax-ml/jax

Composable transformations of Python+NumPy programs: differentiate, vectorize, JIT to GPU/TPU, and more

Python Apache License 2.0 Updated Apr 21, 2025
zmk-config-calmar-one Public
Forked from raphaelmosaic/zmk-config-calmar-one

Updated Apr 19, 2025
nano-aha-moment Public
Forked from McGill-NLP/nano-aha-moment

Single File, Single GPU, From Scratch, Efficient, Full Parameter Tuning library for "RL for LLMs"

Jupyter Notebook MIT License Updated Apr 17, 2025
emergenz.github.io Public

personal website

HTML Updated Apr 13, 2025
mle-scheduler Public
Forked from mle-infrastructure/mle-scheduler

Lightweight Cluster/Cloud VM Job Management 🚀

Python MIT License Updated Apr 11, 2025
tuning_playbook Public
Forked from google-research/tuning_playbook

A playbook for systematically maximizing the performance of deep learning models.

Other Updated Apr 10, 2025
submitit Public
Forked from facebookincubator/submitit

Python 3.8+ toolbox for submitting jobs to Slurm

Python MIT License Updated Apr 8, 2025
chex Public
Forked from google-deepmind/chex

Python Apache License 2.0 Updated Mar 25, 2025
scaling-book Public
Forked from jax-ml/scaling-book

Home for "How To Scale Your Model", a short blog-style textbook about scaling LLMs on TPUs

HTML MIT License Updated Mar 22, 2025
grain Public
Forked from google/grain

Library for reading and processing ML training data.

Python Apache License 2.0 Updated Mar 10, 2025
miryoku_zmk Public
Forked from manna-harbour/miryoku_zmk

Miryoku is an ergonomic, minimal, orthogonal, and universal keyboard layout. Miryoku ZMK is the Miryoku implementation for ZMK.

C Updated Feb 19, 2025
sway-cursor Public

A sway-native keyboard-driven cursor with pointer acceleration.

Python MIT License Updated Feb 16, 2025
nvim-config Public

Lua Apache License 2.0 Updated Jan 17, 2025
minimo Public
Forked from gpoesia/minimo

Learning Formal Mathematics from Intrinsic Motivation

Rust MIT License Updated Oct 31, 2024
mup-lr-warmup Public

We investigate the impact of learning rate warmup on GPT-style Transformers using muP/SP trained on a realistic repository (hlb-gpt) on language modeling.

Python Apache License 2.0 Updated Aug 21, 2024
DeepSeek-Prover-V1.5 Public
Forked from deepseek-ai/DeepSeek-Prover-V1.5

Python MIT License Updated Aug 16, 2024
aerospace.toml Public

Default config, but with cmd as modifier + input mode (binding mode without bindings) to circumvent clashes with OS bindings.

Updated Aug 12, 2024
hlb-gpt-mup-warmup Public
Forked from tysam-code/hlb-gpt

Minimalistic, extremely fast, and hackable researcher's toolbench for GPT models in 307 lines of code. Reaches <3.8 validation loss on wikitext-103 on a single A100 in <100 seconds. Scales to large…

Python Apache License 2.0 Updated Jul 29, 2024
ezmup Public
Forked from cloneofsimo/ezmup

Simple implementation of muP, based on Spectral Condition for Feature Learning

Python Updated Jul 28, 2024
mup_transformer_warmup Public

Investigation of whether we can omit/ shorten lr warmup under muP.

Jupyter Notebook 1 Updated Jul 27, 2024
mup Public
Forked from microsoft/mup

maximal update parametrization (µP)

Jupyter Notebook MIT License Updated Jul 17, 2024
maxtext Public
Forked from AI-Hypercomputer/maxtext

A simple, performant and scalable Jax LLM!

Python Apache License 2.0 Updated Jul 16, 2024
transformers Public
Forked from huggingface/transformers

🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.

Python Apache License 2.0 Updated Jun 23, 2024
modded-nanogpt-mup-transformer-warmup Public
Forked from KellerJordan/modded-nanogpt

GPT-2 (124M) quality in 5B tokens. Do we need lr warmup under muP?

Python Updated Jun 19, 2024
bs-mask Public

An attention implementation that uses the causal mask, shifts the queries 'to the right', adjusts the RoPE encodings accordingly and removes the padding tokens from the output. Empirically collapse…

Python Updated May 27, 2024

Franz Srambical emergenz

Achievements

Achievements

Highlights

pdoom.org Public

Uh oh!

Stoix Public

Uh oh!

jafar Public

Uh oh!

zmk-config-miryoku Public

Uh oh!

TinyZero Public

Uh oh!

jax Public

Uh oh!

zmk-config-calmar-one Public

Uh oh!

nano-aha-moment Public

Uh oh!

emergenz.github.io Public

Uh oh!

mle-scheduler Public

Uh oh!

tuning_playbook Public

Uh oh!

submitit Public

Uh oh!

chex Public

Uh oh!

scaling-book Public

Uh oh!

grain Public

Uh oh!

miryoku_zmk Public

Uh oh!

sway-cursor Public

Uh oh!

nvim-config Public

Uh oh!

minimo Public

Uh oh!

mup-lr-warmup Public

Uh oh!

DeepSeek-Prover-V1.5 Public

Uh oh!

aerospace.toml Public

Uh oh!

hlb-gpt-mup-warmup Public

Uh oh!

ezmup Public

Uh oh!

mup_transformer_warmup Public

Uh oh!

mup Public

Uh oh!

maxtext Public

Uh oh!

transformers Public

Uh oh!

modded-nanogpt-mup-transformer-warmup Public

Uh oh!

bs-mask Public

Uh oh!