Maksym Andriushchenko

Paper_gpt4adv

December 21, 2023

2023

A new short paper Adversarial Attacks on GPT-4 via Simple Random Search on how we can leverage logprobs for a black-box attack on the latest GPT-4-turbo (see a Twitter/X thread for a summary).