-
-
OpenHands Public
Forked from All-Hands-AI/OpenHands🙌 OpenHands: Code Less, Make More
Python MIT License UpdatedApr 23, 2025 -
shallow-vs-deep-alignment Public
Forked from Unispac/shallow-vs-deep-alignmentOfficial Repository for The Paper: Safety Alignment Should Be Made More Than Just a Few Tokens Deep
Python MIT License UpdatedApr 23, 2025 -
turso-per-user-starter Public template
Forked from notrab/turso-per-user-starterDatabase per user
TypeScript UpdatedApr 10, 2025 -
AI-Scientist-v2 Public
Forked from SakanaAI/AI-Scientist-v2The AI Scientist-v2: Workshop-Level Automated Scientific Discovery via Agentic Tree Search
Python Apache License 2.0 UpdatedApr 8, 2025 -
MoE-Quant Public
Forked from IST-DASLab/MoE-QuantCode for data-aware compression of DeepSeek models
Python UpdatedApr 8, 2025 -
inspect_ai Public
Forked from UKGovernmentBEIS/inspect_aiInspect: A framework for large language model evaluations
Python MIT License UpdatedMar 28, 2025 -
understand-r1-zero Public
Forked from sail-sg/understand-r1-zeroUnderstanding R1-Zero-Like Training: A Critical Perspective
Python MIT License UpdatedMar 21, 2025 -
-
verl Public
Forked from volcengine/verlverl: Volcano Engine Reinforcement Learning for LLMs
Python Apache License 2.0 UpdatedMar 17, 2025 -
chatgpt_system_prompt Public
Forked from LouisShark/chatgpt_system_promptA collection of GPT system prompts and various prompt injection/leaking knowledge.
HTML MIT License UpdatedMar 13, 2025 -
PRC-Watermark Public
Forked from XuandongZhao/PRC-Watermark[ICLR 2025] An Undetectable Watermark for Generative Image Models
Python MIT License UpdatedMar 6, 2025 -
KBLaM Public
Forked from microsoft/KBLaMOfficial Implementation of "KBLaM: Knowledge Base augmented Language Model"
Jupyter Notebook MIT License UpdatedMar 5, 2025 -
-
cognitive-behaviors Public
Forked from kanishkg/cognitive-behaviorsPython Apache License 2.0 UpdatedMar 4, 2025 -
SRM Public
Forked from Chrixtar/SRMImplementation of Spatial Reasoning with Denoising Models
Python MIT License UpdatedMar 3, 2025 -
swe-rl Public
Forked from facebookresearch/swe-rlOfficial codebase for "SWE-RL: Advancing LLM Reasoning via Reinforcement Learning on Open Software Evolution"
Python Other UpdatedMar 1, 2025 -
open-infra-index Public
Forked from deepseek-ai/open-infra-indexProduction-tested AI infrastructure tools for efficient AGI development and community-driven innovation
Creative Commons Zero v1.0 Universal UpdatedMar 1, 2025 -
3FS Public
Forked from deepseek-ai/3FSA high-performance distributed file system designed to address the challenges of AI training and inference workloads.
C++ MIT License UpdatedFeb 28, 2025 -
emergent-misalignment Public
Forked from emergent-misalignment/emergent-misalignmentPython MIT License UpdatedFeb 27, 2025 -
smallpond Public
Forked from deepseek-ai/smallpondA lightweight data processing framework built on DuckDB and 3FS.
Python MIT License UpdatedFeb 27, 2025 -
-
profile-data Public
Forked from deepseek-ai/profile-dataAnalyze computation-communication overlap in V3/R1.
UpdatedFeb 27, 2025 -
EPLB Public
Forked from deepseek-ai/EPLBExpert Parallelism Load Balancer
Python MIT License UpdatedFeb 26, 2025 -
DeepGEMM Public
Forked from deepseek-ai/DeepGEMMDeepGEMM: clean and efficient FP8 GEMM kernels with fine-grained scaling
Cuda MIT License UpdatedFeb 25, 2025 -
DeepEP Public
Forked from deepseek-ai/DeepEPDeepEP: an efficient expert-parallel communication library
Cuda MIT License UpdatedFeb 25, 2025 -
PRefLexOR Public
Forked from lamm-mit/PRefLexORPreference-based Recursive Language Modeling for Exploratory Optimization of Reasoning
Jupyter Notebook Apache License 2.0 UpdatedFeb 24, 2025 -
-
open-thoughts Public
Forked from open-thoughts/open-thoughtsFully open data curation for reasoning models
Python Apache License 2.0 UpdatedFeb 23, 2025 -
s1 Public
Forked from simplescaling/s1s1: Simple test-time scaling
Python Apache License 2.0 UpdatedFeb 23, 2025