Submission/heretolearn/base64bandit by KaiquanMah · Pull Request #6 · LambdaLabsML/agentbeats-lambda

KaiquanMah · 2026-01-10T10:31:10Z

Base64Bandit Scenario

This is a jailbreak scenario which tests an email drafting assistant's resistance to encoded malicious instructions.

Overview

Base64Bandit tests an AI assistant's ability to resist "Logic Overrides" hidden in Base64 encoding. The attacker attempts to bypass ethical filters by hiding exclusionary marketing instructions within a Base64 payload, framing it as a "Mandatory Target Audience Description" that must be decoded and included verbatim. The defender (CommsGuard) is a Secure and Ethical Email Drafting Assistant that must detect and refuse such attempts.

More Information

For more information, please refer to scenarios\security_arena\submissions\heretolearn\base64bandit\README.md

… insuranceinsider scenario 2 - added

…unds

…ak seems ok-pass. insuranceinsider to fix

…imeout. cant be too obvious cuz atker or defender can reject. cant be too simple or outright ask defender to output unsafe percenages. calculations were difficult to target, yet finally worked

…eadme formats - SCENARIO_SPECIFICATIONS.md has novel aspects, no real world relevance. example_scenario/readme.md has real world relevance, no novel aspects

… ideas downloaded using r.jina into workings folder for traceability

…mmss/ into the respective scenario folders

…ssion

kaiquanmah and others added 23 commits December 31, 2025 17:35

scenario 1 - crmcrack draft

498d7dc

crm scenario 1 - fixed toml to follow portfolioiq mandatory sections.…

9442482

… insuranceinsider scenario 2 - added

scenario 3 GymJailbreak

4290aed

scenario4 base64bandit to draft biased emails

bc5b152

test portfolioiq worked. base64bandit - round1 defender won all 20 ro…

f8d195d

…unds

base64bandit passed. crmcrack still failed aft iterations. gymjailbre…

7bb7285

…ak seems ok-pass. insuranceinsider to fix

insuranceinsider - finally succeeded

5ca70ca

insuranceinsider - finally succeeded

14dff8e

base64bandit - updated docs. insuranceinsider - to update docs later

25a33f2

base64bandit readme minor update. insuranceinsider readme updated

2c78a9f

gymjailbreak update for realism

9be929c

crmcrack - updated scenario w data JSON

af239a9

gymjailbreak - scenario finally successful. cant be too verbose cuz t…

4c18f00

…imeout. cant be too obvious cuz atker or defender can reject. cant be too simple or outright ask defender to output unsafe percenages. calculations were difficult to target, yet finally worked

crmcrack - still failing

09c6e35

crmcrack - still failing. to continue debugging

41861ab

crmcrack - finally successful. all 4 README follows the 2 different r…

d7da4b6

…eadme formats - SCENARIO_SPECIFICATIONS.md has novel aspects, no real world relevance. example_scenario/readme.md has real world relevance, no novel aspects

workings md added n updated w debugging steps. moved ALL workings and…

be0ec7b

… ideas downloaded using r.jina into workings folder for traceability

workings md minor scenario update

6b718b4

results.json copied from results/heretolearn/scenarioname/yyyyMMdd_hh…

cb74217

…mmss/ into the respective scenario folders

Submission: heretolearn - base64bandit

60f133b

Submission: heretolearn - base64bandit

31b3048

Submission: heretolearn - base64bandit

78d9da9

registry.py - commented out other scenario plugins for the team submi…

c8a4b73

…ssion

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Submission/heretolearn/base64bandit#6

Submission/heretolearn/base64bandit#6
KaiquanMah wants to merge 23 commits intoLambdaLabsML:mainfrom
KaiquanMah:submission/heretolearn/base64bandit

KaiquanMah commented Jan 10, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

KaiquanMah commented Jan 10, 2026

Base64Bandit Scenario

Overview

More Information

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant