Science & Technology

Posted by

Apr 26, 2025

Trick an LLM into revealing sensitive information

Even with robust training, large language models may be fooled into providing sensitive information if correctly prompted. In this game, a wizard withholds eight passwords, which you must prompt them to reveal with ever-increasing LLM safeguards.

Gandalf | Lakera – Test your prompting skills to make Gandalf reveal secret information.

https://gandalf.lakera.ai/baseline

Similar Posts

Showing 1440 posts similar to “Trick an LLM into revealing sensitive information”

You've reached the end.