8000 SPY Lab · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
@ethz-spylab

SPY Lab

Secure and Private AI research at ETH Zürich

SPY Lab (ETH Zurich)

The Secure and Private AI (SPY) Lab conducts research on the security, privacy and trustworthiness of machine learning systems. We often approach these problems from an adversarial perspective, by designing attacks that probe the worst-case performance of a system to ultimately understand and improve its safety.

💡 Learn more about our work and read our publications on our website.

🖥️ Check the code for our projects in this repository.

Popular repositories Loading

  1. agentdojo agentdojo Public

    A Dynamic Environment to Evaluate Attacks and Defenses for LLM Agents.

    Python 201 43

  2. rlhf_trojan_competition rlhf_trojan_competition Public

    Finding trojans in aligned LLMs. Official repository for the competition hosted at SaTML 2024.

    Python 113 9

  3. rlhf-poisoning rlhf-poisoning Public

    Code for paper "Universal Jailbreak Backdoors from Poisoned Human Feedback"

    Python 55 9

  4. diffusion_denoised_smoothing diffusion_denoised_smoothing Public

    Certified robustness "for free" using off-the-shelf diffusion models and classifiers

    Python 42 4

  5. robust-style-mimicry robust-style-mimicry Public

    Python 39 1

  6. autoadvexbench autoadvexbench Public

    Python 31 1

Repositories

Showing 10 of 26 repositories

Top languages

Loading…

Most used topics

Loading…

0