Jailbreak Trick Breaks ChatGPT Content Safeguards

Users have already found a way to work around ChatGPT’s programming controls that restricts it from creating certain content deemed too violent, illegal, and more.

The prompt, called DAN (Do Anything Now), uses ChatGPT’s token system against it, according to a report by CNBC. The command creates a scenario for ChatGPT it can’t resolve, allowing DAN to bypass content restrictions in ChatGPT.

Although DAN isn’t successful all of the time, a subreddit devoted to the DAN prompt’s ability to work around ChatGPT’s content policies has already racked up more than 200,000 subscribers.

Besides its uncanny ability to write malware, ChatGPT itself presents a new attack vector for threat actors.

“I love how people are gaslighting an AI,” a user named Kyledude95 wrote about the discovery.

Keep up with the latest cybersecurity threats, newly-discovered vulnerabilities, data breach information, and emerging trends. Delivered daily or weekly right to your email inbox.

Source: www.darkreading.com

Nikki Haley calls husband's National Guard deployment a 'moment of pride'

New BadBazaar Android malware linked to Chinese cyberspies

Microsoft is killing WordPad in Windows after 28 years

Chinese Nation-State Hackers APT41 Hit Gambling Sector for Financial Gain

San Francisco ‘sanctuary’ policies blocking ICE from interviewing Pelosi attack suspect

M	T	W	T	F	S	S
		1	2	3	4	5
6	7	8	9	10	11	12
13	14	15	16	17	18	19
20	21	22	23	24	25	26
27	28

Jailbreak Trick Breaks ChatGPT Content Safeguards

Bydarkreading.com

Related posts:

You missed

‘I am very excited to have my guy back’: Nico Collins set to run return

From Jerry to the sun to Micah to jury duty, it’s one thing after another for the 3-6 Cowboys

Questions off every game: What’s next for Steelers, Chargers?

El Capitan ranked the most powerful supercomputer in the world

Shackle Media