8000 bluorion.com · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
@bluorion-com

bluorion.com

Pinned Loading

  1. ZClip ZClip Public

    Official implementation of the paper: "ZClip: Adaptive Spike Mitigation for LLM Pre-Training".

    Python 129 9

  2. weight_rescaling weight_rescaling Public

    Official implementation of the "Variance control via weight rescaling in LLM pretraining" paper.

    Python 5

  3. refine_massive_activations refine_massive_activations Public

    Official implementation of the paper: "A Refined Analysis of Massive Activations in LLMs".

    Python 10 3

Repositories

Showing 10 of 18 repositories

People

This organization has no public members. You must be a member to see who’s a part of this organization.

Top languages

Loading…

Most used topics

Loading…

0