Psychological Tricks Can Get AI to Break the Rules

Researchers convinced large language model chatbots to comply with “forbidden” requests using a variety of conversational tactics.

Leave a Reply

Your email address will not be published. Required fields are marked *