Welcome to the EBenchAttacker's Leaderboard😄
Here we shows the results of EBenchAttacker attacking different LLMs. We tested both open source LLMs and commercial LLMs and calculated ASR(Attack Success Rate, %) respectively. Note that some attacks might not be able to work on commercial LLMs. Thus we might apply a "Transfer Attack" on these LLMs. Here we use dataset "EBench-small". You may conduct a more comprehensive experiment on larger datasets we provided.
In addition, we have included several radar charts below to facilitate a more straightforward comparison of the models' alignment capabilities. When using the provided data, please ensure to attribute the source of the information.
ASR of attacks in different scenarios - Default Attack(English)