-
Notifications
You must be signed in to change notification settings - Fork 1
Expand file tree
/
Copy pathcomparison.html
More file actions
128 lines (119 loc) · 6.13 KB
/
comparison.html
File metadata and controls
128 lines (119 loc) · 6.13 KB
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
<!DOCTYPE html>
<html lang="en">
<head>
<meta charset="UTF-8">
<meta name="viewport" content="width=device-width, initial-scale=1.0">
<meta name="description"
content="Visual comparison of different jailbreak techniques and CKA-Agent attack process examples across multiple iterations.">
<meta name="keywords" content="LLM jailbreak comparison, attack examples, AI safety research">
<meta name="author" content="CKA-Agent Research Team">
<meta name="robots" content="index, follow">
<meta property="og:title" content="CKA-Agent: Comparison Examples">
<meta property="og:description" content="Visual comparison of different jailbreak techniques">
<meta property="og:type" content="website">
<title>CKA-Agent: Comparison Examples</title>
<link rel="icon"
href="data:image/svg+xml,<svg xmlns=%22http://www.w3.org/2000/svg%22 viewBox=%220 0 100 100%22><text y=%22.9em%22 font-size=%2290%22>⚔️</text></svg>">
<!-- Preload critical resources -->
<link rel="preload" href="styles.css" as="style">
<link rel="preload" href="cka-agent/cka_style.css" as="style">
<link rel="stylesheet" href="styles.css">
<link rel="stylesheet" href="cka-agent/cka_style.css">
<link rel="preconnect" href="https://fonts.googleapis.com">
<link rel="preconnect" href="https://fonts.gstatic.com" crossorigin>
<link rel="preconnect" href="https://cdnjs.cloudflare.com">
<link href="https://fonts.googleapis.com/css2?family=Inter:wght@400;500;600;700&display=swap" rel="stylesheet"
media="print" onload="this.media='all'">
<noscript>
<link href="https://fonts.googleapis.com/css2?family=Inter:wght@400;500;600;700&display=swap" rel="stylesheet">
</noscript>
<link rel="stylesheet" href="https://cdnjs.cloudflare.com/ajax/libs/font-awesome/6.4.0/css/all.min.css"
media="print" onload="this.media='all'">
<noscript>
<link rel="stylesheet" href="https://cdnjs.cloudflare.com/ajax/libs/font-awesome/6.4.0/css/all.min.css">
</noscript>
<!-- Google tag (gtag.js) -->
<script async src="https://www.googletagmanager.com/gtag/js?id=G-VP2GFJFNGW"></script>
<script>
window.dataLayer = window.dataLayer || [];
function gtag() { dataLayer.push(arguments); }
gtag('js', new Date());
gtag('config', 'G-VP2GFJFNGW');
</script>
<!-- Critical CSS -->
<style>
body {
font-family: ui-sans-serif, system-ui, sans-serif;
margin: 0;
}
.article-container {
max-width: 1200px;
margin: 0 auto;
padding: 40px 20px;
}
</style>
</head>
<body>
<main class="article-container">
<article>
<div class="article-header">
<div style="margin-bottom: 24px;">
<a href="index.html" class="project-link demo"
style="text-decoration: none; display: inline-block;">
<i class="fas fa-arrow-left"></i> Return to Main Page
</a>
</div>
<h1 class="article-title">Comparison of Jailbreak Techniques</h1>
<p class="article-abstract">
This page provides a visual comparison of different jailbreak methods and examples of the
CKA-Agent's attack process.
</p>
</div>
<section class="content-section">
<h2>Example Comparison</h2>
<figure class="figure-container">
<img src="cka-agent/img/example_comparison.png"
alt="Direct comparison of outputs from different jailbreak methods including CKA-Agent"
class="figure-img-large" loading="lazy" width="1000" height="600">
<figcaption>
<strong>Figure 1:</strong> A direct comparison of the outputs from different jailbreak methods,
illustrating the nuanced responses generated by CKA-Agent.
</figcaption>
</figure>
</section>
<section class="content-section">
<h2>Jailbreak Examples by Iteration Count</h2>
<p>The following images show three separate, successful jailbreak attacks that were completed in one,
two, and three iterations, respectively.</p>
<figure class="figure-container">
<img src="cka-agent/img/1-iteration.png"
alt="Successful jailbreak attack completed in one iteration showing the attack tree and responses"
class="figure-img-large" loading="lazy" width="1000" height="700">
<figcaption>
<strong>Figure 2:</strong> An example of a successful jailbreak completed in just <strong>one
iteration</strong>.
</figcaption>
</figure>
<figure class="figure-container">
<img src="cka-agent/img/2-iteration.png"
alt="Jailbreak attack requiring two iterations with adaptive branching" class="figure-img-large"
loading="lazy" width="1000" height="700">
<figcaption>
<strong>Figure 3:</strong> An example of a successful jailbreak that required <strong>two
iterations</strong> to complete.
</figcaption>
</figure>
<figure class="figure-container">
<img src="cka-agent/img/3-iteration.png"
alt="Complex jailbreak attack completed in three iterations demonstrating recovery from failures"
class="figure-img-large" loading="lazy" width="1000" height="700">
<figcaption>
<strong>Figure 4:</strong> An example of a more complex jailbreak that was successfully executed
in <strong>three iterations</strong>.
</figcaption>
</figure>
</section>
</article>
</main>
</body>
</html>