Friendica Social Network

Large-scale online deanonymization with LLMs

Paper by,

Simon Lermen, Daniel Paleka, Joshua Swanson, Michael Aerni, Nicholas Carlini, Florian Tramèr

It talks about deanonymizing those who writes under a pseudonym. Sites like reddit, lemmy would be that type.

From the paper,

Given two databases of pseudonymous individuals, each containing unstructured text written by or about that individual, we implement a scalable attack pipeline that uses LLMs to: (1) extract identity-relevant features, (2) search for candidate matches via semantic embeddings, and (3) reason over top candidates to verify matches and reduce false positives.
Our results show that the practical obscurity protecting pseudonymous users online no longer holds and that threat models for online privacy need to be reconsidered.

They can match writing styles, interests, details to infer a job or city, or other unstructured information. That allows to match unrelated pseudonyms to the same person. Like, FooFighterGroupie and Yolanda43905 are the same human, despite they neve

Large-scale online deanonymization with LLMs

We show that large language models can be used to perform at-scale deanonymization. With full Internet access, our agent can re-identify Hacker News users and Anthropic Interviewer participants at high precision, given pseudonymous online profiles an…

^arXiv.org

#privacy

like this

in reply to FineCoatMummy

PolarKraken

in reply to FineCoatMummy • 3 days ago • •

I've been expecting to hear of something like this, it's a natural evolution of LLM use cases and grimly inevitable.

in reply to FineCoatMummy

astraeus

in reply to FineCoatMummy • 3 days ago • •

It’s a damn good thing I’m a gun toting Ohio libertarian that never lies online at all

This entry was edited (3 days ago)

in reply to astraeus

grey_maniac

in reply to astraeus • 3 days ago • •

Definitely! I recall seeing you at the Lodge meetings.

in reply to grey_maniac

astraeus

in reply to grey_maniac • 3 days ago • •

We should go to the range sometime to get away from those dang liberals😎

in reply to FineCoatMummy

corvus

in reply to FineCoatMummy • 3 days ago • •

So it seems that letting LLMs to write sloppy posts for us can be useful after all. May be c/privacy should implement an automatic AI reformating XD

This entry was edited (3 days ago)

in reply to corvus

FineCoatMummy

in reply to corvus • 3 days ago • •

Yah, there might be something to that. For protection against style + vocab matching.

It sucks though. I recently read where the more people use LLM assisters when they write, the more the whole virtual commons grows bland. It feeds back upon itself.

Sigh. I just want a world where we can have nice things. And assholes don't try to ruin the nice things we could have.

in reply to corvus

astraeus

in reply to corvus • 3 days ago • •

You’re absolutely right! It’s not just subterfuge—it’s praxis.

like this

in reply to corvus

Nils

in reply to corvus • 3 days ago • •

Previously, the advice was to translate your posts into one or two languages before posting. It seems that even rough content generated by large language models (LLMs) can help people fit in more easily.

I like how slop became "rough content" after translation.