adversarial-testing

Star

Here are 42 public repositories matching this topic...

sherifkozman / the-red-council

Star

LLM Adversarial Security Arena — Jailbreak → Detect → Defend → Verify

security gemini red-team llm langchain adversarial-testing

Updated Mar 26, 2026
Python

jhlee0409 / elenchus-mcp

Sponsor

Star

Elenchus MCP Server - Adversarial verification system for code review

nodejs typescript ai mcp static-analysis code-review claude code-verification llm anthropic model-context-protocol mcp-server adversarial-testing

Updated Jan 29, 2026
TypeScript

stchakwdev / Gaslight_EVAL

Star

AI safety evaluation framework testing LLM epistemic robustness under adversarial self-history manipulation

python ai-safety openrouter llm-evaluation adversarial-testing alignment-research epistemic-robustness

Updated Dec 18, 2025
Python

alejandrosaenz117 / bonfires-marketplace

Star

A marketplace of Claude Code plugins for adversarial security and architectural code review.

security architecture code-review threat-modeling security-review claude-code adversarial-testing plugin-marketplace

Updated Feb 28, 2026

humanbound / humanbound-cli

Star

Official cli of humanbound platform.

cli owasp cybersecurity pentesting cicd ai-safety security-tools ai-agents vulnerability-scanner security-testing ai-security guardrails llm prompt-injection llm-security agentic-ai ai-red-teaming adversarial-testing aisecops

Updated Mar 23, 2026
Python

vibheksoni / jailbench

Star

Benchmark LLM jailbreak resilience across providers with standardized tests, adversarial mode, rich analytics, and a clean Web UI.

Updated Aug 12, 2025
Python

tasumermaf / the-adversary

Star

Agent-driven adversarial paper audit framework

python ai-agents scientific-writing research-tools adversarial-testing paper-audit

Updated Mar 17, 2026
Python

zakky8 / llm-jailbreak-taxonomy

Star

Mechanism-grounded taxonomy of 40 LLM jailbreak patterns across 10 categories. Full evaluation harness for 4 frontier models. AI safety research with responsible disclosure.

taxonomy jailbreak alignment ai-safety security-testing responsible-disclosure jailbreak-detection adversarial-attacks red-teaming ai-security model-robustness adversarial-ml prompt-injection red-teaming-tools llm-security llm-evaluation llm-jailbreaks ai-red-teaming adversarial-testing

Updated Mar 21, 2026
Jupyter Notebook

yangyihe0305-droid / llm-red-team-research

Star

Systematic exploration of LLM alignment boundaries through logical stress testing

nlp machine-learning alignment language-models ai-safety security-research red-teaming llm prompt-engineering adversarial-testing

Updated Mar 9, 2026
Shell

mcptrust / mcp-adversarial-suite

Star

Adversarial MCP server benchmark suite for testing tool-calling security, drift detection, and proxy defenses

security benchmark mcp red-team security-testing ai-security llm-security tool-calling model-context-protocol adversarial-testing

Updated Dec 27, 2025
JavaScript

anotherben / claude-enterprise-skills

Star

9-stage enterprise development pipeline for Claude Code. TDD, adversarial testing, mechanical verification. Any stack.

Updated Mar 14, 2026
Shell

alpha-one-index / ai-red-teaming-index

Sponsor

Star

Comprehensive AI red teaming index: tools, frameworks, benchmarks, datasets, and vulnerability leaderboards for LLM safety and adversarial testing.

Updated Mar 23, 2026
HTML

YaswanthGhanta / llm-logical-integrity-benchmark

Star

Adversarial testing of LLMs on constraint satisfaction deadlocks

reinforcement-learning gemini grok claude hallucination prompt-engineering chain-of-thought chatgpt rlhf qwen llm-evaluation sycophancy deepseek safety-alignment ai-red-teaming kimi-k2 adversarial-testing

Updated Jan 27, 2026

inaciovasquez2020 / urf-application-stress-test

Star

Description URF Application Stress Test — adversarial and scalability tests for Unified Rigidity Framework applications, validating limits under load, noise, and edge cases.

reproducible-research scalability stress-testing formal-verification robustness adversarial-testing unified-rigidity-framework systems-validation

Updated Mar 22, 2026
Shell

seikaikyo / ai-red-team

Star

LLM adversarial testing toolkit for evaluating language model safety. 96 attack templates in EN/ZH/JA across prompt injection, jailbreak, bias, safety bypass, and multilingual vectors.

multilingual i18n security vue jailbreak red-team ai-safety red-teaming vue3 fastapi primevue llm prompt-injection llm-security claude-api adversarial-testing

Updated Mar 18, 2026
TypeScript

adeolasopade / AI-Security-Audit-Cryptocurrency-Exchange-

Star

Identified critical AI governance gaps: no adversarial testing, undocumented third-party models, and missing incident response. Delivered roadmap to secure high-risk KYC and transaction monitoring systems against evolving threats.

cryptocurrency-exchange nist-ai-rmf iso-42001 adversarial-testing ai-security-audit

Updated Mar 14, 2026

mcp-tool-shop-org / mcp-stress-test

Star

Red team toolkit for stress-testing MCP security scanners — find detection gaps before attackers do

python security mcp stress-testing fuzzing red-team ai-safety testing-framework security-testing llm llm-security model-context-protocol mcp-server adversarial-testing

Updated Mar 25, 2026
Python

NathanMaine / garak-compliance-probes

Star

Compliance-focused vulnerability probes for NVIDIA garak, targeting LLMs in regulated industries (CMMC, NIST, HIPAA, DFARS)

nist nvidia compliance hipaa red-teaming cmmc vulnerability-testing llm-security garak adversarial-testing

Updated Feb 17, 2026
Python

light-research / solana-sim-engine

Star

LLM-powered fuzzing and adversarial testing framework for Solana programs. Generates intelligent attack scenarios, builds real transactions, and reports vulnerabilities with CWE classifications.

smart-contracts fuzzing solana adversarial-testing

Updated Jan 19, 2026
Python

North-Shore-AI / crucible_adversary

Star

Adversarial testing and robustness evaluation for the Crucible framework

machine-learning elixir otp research ai beam reliability robustness security-testing adversarial-examples adversarial-attacks red-teaming ensemble-methods statistical-testing model-robustness llm adversarial-testing nshkr-crucible

Updated Dec 29, 2025
Elixir

Improve this page

Add a description, image, and links to the adversarial-testing topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the adversarial-testing topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

adversarial-testing

Here are 42 public repositories matching this topic...

sherifkozman / the-red-council

jhlee0409 / elenchus-mcp

stchakwdev / Gaslight_EVAL

alejandrosaenz117 / bonfires-marketplace

humanbound / humanbound-cli

vibheksoni / jailbench

tasumermaf / the-adversary

zakky8 / llm-jailbreak-taxonomy

yangyihe0305-droid / llm-red-team-research

mcptrust / mcp-adversarial-suite

anotherben / claude-enterprise-skills

alpha-one-index / ai-red-teaming-index

YaswanthGhanta / llm-logical-integrity-benchmark

inaciovasquez2020 / urf-application-stress-test

seikaikyo / ai-red-team

adeolasopade / AI-Security-Audit-Cryptocurrency-Exchange-

mcp-tool-shop-org / mcp-stress-test

NathanMaine / garak-compliance-probes

light-research / solana-sim-engine

North-Shore-AI / crucible_adversary

Improve this page

Add this topic to your repo