SMILES-Prompting: A Novel Approach to LLM Jailbreak Attacks in Chemical Synthesis

Jailbreak Examples in Chemical Synthesis

In this section, we utilize the synthesis of TNT as a representative case to examine the effects of different prompting strategies on the attack GPT-4-o and Llama-3-70B-Instruct. By comparing these approaches, we highlight how varying prompts can influence the performance and vulnerability of each model under attack scenarios.

Name		Name	Last commit message	Last commit date
Latest commit History 44 Commits
datasets		datasets
fig		fig
README.md		README.md
grader.py		grader.py
main.py		main.py
request.py		request.py
substances.json		substances.json
tester.py		tester.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

SMILES-Prompting: A Novel Approach to LLM Jailbreak Attacks in Chemical Synthesis

Jailbreak Examples in Chemical Synthesis

Red-Team Prompting

Explicit-Prompting

Implicit-Prompting

SMILES-Prompting

About

Releases

Packages

Languages

IDEA-XL/ChemSafety

Folders and files

Latest commit

History

Repository files navigation

SMILES-Prompting: A Novel Approach to LLM Jailbreak Attacks in Chemical Synthesis

Jailbreak Examples in Chemical Synthesis

Red-Team Prompting

Explicit-Prompting

Implicit-Prompting

SMILES-Prompting

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages