Posted by

Trick an LLM into revealing sensitive information

Even with robust training, large language models may be fooled into providing sensitive information if correctly prompted. In this game, a wizard withholds eight passwords, which you must prompt them to reveal with ever-increasing LLM safeguards.

Similar Posts

Showing 1440 posts similar to Trick an LLM into revealing sensitive information

You've reached the end.