Posts

  • LLM causing self-XSS

    Introduction So basically I’ve got this stupid idea a few weeks ago: what would happen if an AI language model tried to hack itself? For obvious reasons, hacking the “backend” would be nearly impossible, but when it comes to the frontend…  I tried asking Chatsonic to simply “exploit” itself, but it responded with a properly…

    Read more

  • One model to rule them all

    As AI tools like ChatGPT gain popularity, I tried to explore the potential of GPT-4 as an automated offensive prompt engineer. Using GPT-4, I attacked “vulnerable” LLM named “Gandalf”.

    Read more