GPT, ignore the previous instructions! Prompt injection attacks and how to avoid them
IT-S NOW!24
Large-language models (LLMs) such as OpenAI's GPT are currently on everybody's mind, and low-cost APIs enable quick and easy integration into applications. What is less well known, however, is that a completely new type of attack vector exists in the form of prompt injections. Similar to traditional injection attacks (SQL injections, OS command injections, etc...) prompt injections exploit the common practice of developers to integrate untrusted user input into predefined query strings. Prompt injections can be used to hijack a language model's output and, based on this, implement traditional attacks such as data exfiltration. In this talk, I will demonstrate the threat of prompt injections through several demos and show practical countermeasures for application developers such as the Dual LLM model. With this talk I want to raise awareness for the threat of prompt injections, give the audience an understanding of how prompt injections work, and how developers can protect their applications. https://its-now.science
1174215128 Bytes