🤓
PhD student @ Princeton ECE.
-
Princeton University
- Princeton, NJ
-
08:39
(UTC -07:00) - www.boyiwei.com
- @wei_boyi
Highlights
- Pro
Pinned Loading
-
Dynamic-Risk-Assessment
Dynamic-Risk-Assessment PublicDynamic Risk Assessment for Offensive Cybersecurity Agents
Python 6
-
alignment-attribution-code
alignment-attribution-code Public[ICML 2024] Assessing the Brittleness of Safety Alignment via Pruning and Low-Rank Modifications
-
princeton-polaris-lab/Evaluating-Durable-Safeguards
princeton-polaris-lab/Evaluating-Durable-Safeguards Public[ICLR 2025] On Evluating the Durability of Safegurads for Open-Weight LLMs
-
Something went wrong, please refresh the page to try again.
If the problem persists, check the GitHub status page or contact support.
If the problem persists, check the GitHub status page or contact support.