bluorion.com
Pinned Loading
Repositories
- weight_rescaling Public
Official implementation of the "Variance control via weight rescaling in LLM pretraining" paper.
bluorion-com/weight_rescaling’s past year of commit activity - ZClip Public
Official implementation of the paper: "ZClip: Adaptive Spike Mitigation for LLM Pre-Training".
bluorion-com/ZClip’s past year of commit activity - refine_massive_activations Public
Official implementation of the paper: "A Refined Analysis of Massive Activations in LLMs".
bluorion-com/refine_massive_activations’s past year of commit activity - raydp Public Forked from oap-project/raydp
RayDP provides simple APIs for running Spark on Ray and integrating Spark with AI libraries.
bluorion-com/raydp’s past year of commit activity - Megatron-LM Public Forked from NVIDIA/Megatron-LM
Ongoing research training transformer models at scale
bluorion-com/Megatron-LM’s past year of commit activity - vllm-production-stack Public Forked from vllm-project/production-stack
vLLM’s reference system for K8S-native cluster-wide deployment with community-driven performance optimization
bluorion-com/vllm-production-stack’s past year of commit activity - lingua Public Forked from facebookresearch/lingua
Meta Lingua: a lean, efficient, and easy-to-hack codebase to research LLMs.
bluorion-com/lingua’s past year of commit activity - flash-attention Public Forked from Dao-AILab/flash-attention
Fast and memory-efficient exact attention
bluorion-com/flash-attention’s past year of commit activity - torchx_nccl_test Public
bluorion-com/torchx_nccl_test’s past year of commit activity
People
This organization has no public members. You must be a member to see who’s a part of this organization.
Top languages
Loading…
Most used topics
Loading…