ai-security

Whistleblower is a offensive security tool for testing against system prompt leakage and capability discovery of an AI application exposed through API. Built for AI engineers, security researchers and folks who want to know what's going on inside the LLM-based app they use daily

ai-security prompt-engineering llm-security jailbreaks prompt-injection-llm-security ai-red-teaming

Updated Jul 28, 2024
Python

reds-lab / Narcissus

Star

The official implementation of the CCS'23 paper, Narcissus clean-label backdoor attack -- only takes THREE images to poison a face recognition dataset in a clean-label way and achieves a 99.89% attack success rate.

adversarial-machine-learning adversarial-attacks ai-security backdoor-attacks deep- poisoning-attacks

Updated May 9, 2023
Python

LLAMATOR-Core / llamator

Star

Framework for testing vulnerabilities of large language models (LLM).

Updated Apr 28, 2025
Python

RjDuan / AdvDrop

Star

Code for "Adversarial attack by dropping information." (ICCV 2021)

pytorch adversarial-examples adversarial-attacks ai-security

Updated Jan 13, 2022
Python

jay-johnson / train-ai-with-django-swagger-jwt

Star

Train AI (Keras + Tensorflow) to defend apps with Django REST Framework + Celery + Swagger + JWT - deploys to Kubernetes and OpenShift Container Platform

machine-learning jwt deep-neural-networks ai openshift tensorflow rest-api django-rest-framework swagger drf keras celery network-analysis network-security celery-tasks machine-learning-security ai-security anti-nex

Updated Nov 2, 2018
Python

mitre-atlas / atlas-data

Star

ATLAS tactics, techniques, and case studies data

security machine-learning mitre-attack ai-security mitre-atlas

Updated Apr 22, 2025
Python

Hacking-Notes / VulnScan

Star

Performing website vulnerability scanning usi 10000 ng OpenAI technologie

hacking-tool vulnerability-scanners vulnerability-scanning ai-security chatgpt

Updated Apr 5, 2025
Python

kereva-dev / kereva-scanner

Star

Code scanner to check for issues in prompts and LLM calls

cli security ai linter evaluation code-scanning red-teaming ai-security hallucination ai-evaluation llm prompt-injection llm-security ai-code-review llm-evaluation owasp-llm-top-10 ai-performance ai-red-teaming llm-performance

Updated Apr 6, 2025
Python

zhangzp9970 / MIA

Star

Unofficial pytorch implementation of paper: Model Inversion Attacks that Exploit Confidence Information and Basic Countermeasures

machine-learning research deep-learning ai-security model-inversion-attacks

Updated Mar 31, 2025
Python

LetterLiGo / Inaudible-Adversarial-Perturbation-Vrifle

Star

[NDSS'24] Inaudible Adversarial Perturbation: Manipulating the Recognition of User Speech in Real Time

artificial-intelligence iot-security ai-security

Updated Sep 28, 2024
Python

wssun / TiSE-CodeLM-Security

Star

This repository provide the studies on the security of language models for code (CodeLMs).

security language-model adversarial-attacks ai-security code-intelligence backdoor-attacks adversarial-defense backdoor-defense ai4se lm4se lm4code

Updated Feb 26, 2025
Python

elliothe / CVPR_2019_PNI

Star

pytorch implementation of Parametric Noise Injection for adversarial defense

ai-security adversarial-defense

Updated Oct 23, 2019
Python

HKU-TASR / Imperio

Star

[IJCAI 2024] Imperio is an LLM-powered backdoor attack. It allows the adversary to issue language-guided instructions to control the victim model's prediction for arbitrary targets.

ai-security backdoor-attacks llm

Updated Feb 18, 2025
Python

AI-Initiative-KAUST / VideoRLCS

Star

Learning to Identify Critical States for Reinforcement Learning from Videos (Accepted to ICCV'23)

reinforcement-learning computer-vision deep-learning explainable-ai ai-security iccv2023

Updated Aug 19, 2023
Python

modzy / sdk-python

Star

Python library for Modzy Machine Learning Operations (MLOps) Platform

python docker machine-learning microservices deployment api-client model-deployment model-serving serving explainable-ai production-machine-learning ai-security mlops kuberenetes drift-detection machine-learning-operations

Updated Sep 8, 2023
Python

Improve this page

Add a description, image, and links to the ai-security topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the ai-security topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ai-security

Here are 62 public repositories matching this topic...

Giskard-AI / giskard

utkusen / promptmap

splx-ai / agentic-radar

normster / llm_rules

LetterLiGo / SafeGen_CCS2024

Repello-AI / whistleblower

reds-lab / Narcissus

LLAMATOR-Core / llamator

RjDuan / AdvDrop

jay-johnson / train-ai-with-django-swagger-jwt

mitre-atlas / atlas-data

Hacking-Notes / VulnScan

kereva-dev / kereva-scanner

zhangzp9970 / MIA

LetterLiGo / Inaudible-Adversarial-Perturbation-Vrifle

wssun / TiSE-CodeLM-Security

elliothe / CVPR_2019_PNI

HKU-TASR / Imperio

AI-Initiative-KAUST / VideoRLCS

modzy / sdk-python

Improve this page

Add this topic to your repo