Hacker plants false memories in ChatGPT to steal user data in perpetuity

By

Sep 24, 2024

Enlarge (credit: Getty Images)

When security researcher Johann Rehberger recently reported a vulnerability in ChatGPT that allowed attackers to store false information and malicious instructions in a user’s long-term memory settings, OpenAI summarily closed the inquiry, labeling the flaw a safety issue, not, technically speaking, a security concern.

So Rehberger did what all good researchers do: He created a proof-of-concept exploit that used the vulnerability to exfiltrate all user input in perpetuity. OpenAI engineers took notice and issued a partial fix earlier this month.

Strolling down memory lane

The vulnerability abused long-term conversation memory, a feature OpenAI began testing in February and made more broadly available in September. Memory with ChatGPT stores information from previous conversations and uses it as context in all future conversations. That way, the LLM can be aware of details such as a user’s age, gender, philosophical beliefs, and pretty much anything else, so those details don’t have to be inputted during each conversation.

Read 6 remaining paragraphs | Comments

Public

Hacker plants false memories in ChatGPT to steal user data in perpetuity

By

Strolling down memory lane

Related Post

Space Mirror: The FCC Just Approved a Sun-Reflecting Satellite, and Astronomers Are Unimpressed

I’m a Big E Ink Fan, So This New Detachable-Screen Phone From Hisense Intrigues Me

The US government warns that Russia state hackers are coming after your router

Leave a Reply Cancel reply

You missed

Space Mirror: The FCC Just Approved a Sun-Reflecting Satellite, and Astronomers Are Unimpressed

I’m a Big E Ink Fan, So This New Detachable-Screen Phone From Hisense Intrigues Me

The US government warns that Russia state hackers are coming after your router

Smartphone Shipments Crash to 13-Year Low Due to RAMageddon