Friendica Social Network

Cory Doctorow

3 days ago • •

Cory Doctorow
3 days ago • •

Today's threads (a thread)

Inside: Supreme Court saves artists from AI; and more!

Archived at: pluralistic.net/2026/03/03/its…

#Pluralistic

The Supreme Court building. It has been tinted sepia. Floating in front of it are a 1920s-era Supreme Court, tinted blue-green, their heads replaced with the glaring red eyes of HAL 9000 from Stanley Kubrick's '2001: A Space Odyssey,' and their hands tinted hot pink. They have been distorted with a ripple effect and TV scan lines. The sky is full of dark clouds.

Image:
Cryteria (modified)
https://commons.wikimedia.org/wiki/File:HAL9000.svg

CC BY 3.0
https://creativecommons.org/licenses/by/3.0/deed.en

Pluralistic: Supreme Court saves artists from AI (03 Mar 2026) – Pluralistic: Daily links from Cory Doctorow

^{pluralistic.net}

#Pluralistic

in reply to Cory Doctorow

Tom Marble

in reply to Cory Doctorow • 3 days ago • •

This post by @pluralistic
(Building upon
mamot.fr/@pluralistic/11609535… )

is making me think about what it means to accept genAI code -- esp. in open source where contribution is often governed by an inbound=outbound rule. That means the inbound code must be licensed under the same copyright license as the project .That's not possible if there is no copyright (and neither is assignment under any CLA).

We can infer that any published software (open or proprietary) built with AI is not copyrightable.

@Cory Doctorow

in reply to Cory Doctorow

Cory Doctorow

in reply to Cory Doctorow • 3 days ago • •

Long thread/2

Sensitive content

in reply to Cory Doctorow

Cory Doctorow

in reply to Cory Doctorow • 3 days ago • •

Long thread/3

Sensitive content

in reply to Cory Doctorow

Cory Doctorow

in reply to Cory Doctorow • 3 days ago • •

Long thread/4

Sensitive content

#20yrsago #15yrsago #10yrsago

in reply to Cory Doctorow

Cory Doctorow

in reply to Cory Doctorow • 3 days ago • •

Long thread/5

Sensitive content

#10yrsago

in reply to Cory Doctorow

Cory Doctorow

in reply to Cory Doctorow • 3 days ago • •

Long thread/6

Sensitive content

#10yrsago

in reply to Cory Doctorow

Dylan

in reply to Cory Doctorow • 3 days ago • •

Long thread/6

Sensitive content

Cory Doctorow reshared this.

in reply to Cory Doctorow

uis

in reply to Cory Doctorow • 3 days ago • •

Long thread/2

Sensitive content

Cory Doctorow reshared this.

in reply to Cory Doctorow

uis

in reply to Cory Doctorow • 3 days ago • •

Long thread/2

Sensitive content

in reply to uis

Cory Doctorow

in reply to uis • 3 days ago • •

Long thread/2

Sensitive content

@uis

in reply to Cory Doctorow

Arvīds Kokins

in reply to Cory Doctorow • 3 days ago • •

> [..] all seek to establish that training an AI model is a copyright infringement. This is wrong [..]

This is such a strange hill to die on. Of course "training" infringes - it's compression.

Framing it as "mathematical analysis" or collecting "facts" is disingenuous.

Fourier transform could be easily described the exact same way, except it's fully reversible.

"Facts" wrt copyright are not meant to be the building blocks needed to reconstruct a work to some degree.

in reply to Arvīds Kokins

Cory Doctorow

in reply to Arvīds Kokins • 3 days ago • •

@archo

> "Facts" wrt copyright are not meant to be the building blocks needed to reconstruct a work to some degree.

Literally you could write a book describing each brushstroke needed to reconstruct a copyrighted painting without violating that painting's copyright.

@Arvīds Kokins

in reply to Cory Doctorow

Cory Doctorow

in reply to Cory Doctorow • 3 days ago • •

@archo Further: the cases against models are only tangentially related to memorization. It's undeniable that these suits would exist even if models had "guardrails" that *perfectly* prevented them from reproducing their training data verbatim. Memorization is *not* the crux of any of these complaints - *training* is.

@Arvīds Kokins

in reply to Cory Doctorow

Arvīds Kokins

in reply to Cory Doctorow • 3 days ago • •

Perfect memorization is also irrelevant - lossy compression is also a thing, doesn't stop copyright from applying (otherwise reencoding videos would remove their copyright).

Ultimately though what matters for infringement is whether the original work was substantially used to produce the new work (regardless of the exact process) - and clearly that's the case.

Not even the AI companies dispute infringement, they argue that it's "fair use" instead.

in reply to Arvīds Kokins

Cory Doctorow

in reply to Arvīds Kokins • 3 days ago • •

@archo

> Ultimately though what matters for infringement is whether the original work was substantially used to produce the new work (regardless of the exact process) - and clearly that's the case.

This is completely incorrect.

You could cut a picture into a billion pixels and reconstitute them as a completely different picture without infringing anyone's copyright.

Copyright has nothing to do with whether the constituent components came from a copyrighted work. That's just completely wrong

@Arvīds Kokins

in reply to Cory Doctorow

Cory Doctorow

in reply to Cory Doctorow • 3 days ago • •

@archo Internationally, the limitations and exceptions to copyright are cabined by the Berne "three step" test which does not mention whether the constituent parts are drawn from a copyrighted work.

In the US, there's Fair Use's "four factors," one of which does interrogate the amount of the taking, but is not in itself sufficient for a finding of infringement.

@Arvīds Kokins

in reply to Cory Doctorow

Arvīds Kokins

in reply to Cory Doctorow • 3 days ago • •

In that case you can argue that the particular work was not substantially used in the end result, if all you needed was a pile of pixels (or a palette) without their positional association to each other.

To me, cases like this sit far outside the "substantially used" circle.

AI "models" on the other hand, explicitly consider the positional associations of the pixels of the original images at various scales, not just their color.

in reply to Arvīds Kokins

Cory Doctorow

in reply to Arvīds Kokins • 3 days ago • •

@archo You could produce a painting that consisted of the "average" face by measuring and averaging out the features of every face in every painting extant and it would not infringe copyright.

@Arvīds Kokins

in reply to Cory Doctorow

Arvīds Kokins

in reply to Cory Doctorow • 3 days ago • •

If it's just measurement and not color, features or anything else, then probably yes.

Aggregate functions are naturally destructive - the source data doesn't come through almost at all. Very much unlike autoencoders.

in reply to Cory Doctorow

Arvīds Kokins

in reply to Cory Doctorow • 3 days ago • •

That doesn't make sense. That's what most computer data effectively is - instructions to reconstruct data in a more accessible format. IDK if you're familiar with art tools but some literally encode the brush strokes (like Substance Painter or Clip Studio Paint vector layers) needed to reconstruct the picture.

in reply to Arvīds Kokins

Cory Doctorow

in reply to Arvīds Kokins • 3 days ago • •

Yes, there's some interesting edge-cases implicit in this new, more explicit rule that copyright inheres only for human (and not machine) work.

While it's unlikely a case regarding the copyrightability of a vector layer (or even a clone brush) would reach a judgment (such cases being likely to founder on the *de minimis* standard that excludes trivialities from legal action), such a judgment might assign a "thin" copyright comparable to a "selection" right.

This entry was edited (3 days ago)

in reply to Cory Doctorow

Arvīds Kokins

in reply to Cory Doctorow • 3 days ago • •

Hypothetically, if someone were to extract all the brush strokes from a file, wrote them into a book, retyped that book back into a file and re-rendered them to claim that it's original work, to me that's still a clear-cut infringement, not even de minimis (which could apply if only e.g. less than 0.1% of the brush strokes were used).

At the end of the day, copyright is an economical tool to ensure the author is fairly rewarded for their work, not an obfuscation challenge.

in reply to Arvīds Kokins

Cory Doctorow

in reply to Arvīds Kokins • 3 days ago • •

@archo

> retyped that book back into a file and re-rendered them to claim that it's original work

You are describing an infringing *use*, which does not make the machine itself infringing. This is a bedrock of copyright law, established in 1984 with the Supreme Court's ruling in Sony v Universal (the Betamax decision), which I referenced in my thread: "A device capable of sustaining a substantial noninfringing use is not infringing."

@Arvīds Kokins

in reply to Cory Doctorow

Cory Doctorow

in reply to Cory Doctorow • 3 days ago • •

@archo

> At the end of the day, copyright is an economical tool to ensure the author is fairly rewarded for their work

Categorically untrue. In the US, copyright is one of two elements of the Constitution that contain a justification. The first is the Second Amendment ("a well-regulated militia").

The second is copyright ("To promote the progress of the useful arts and sciences"). What's more, the "progress clause" is a "may," not a "shall," meaning Congress need not provide *any* copyright

@Arvīds Kokins

in reply to Cory Doctorow

Cory Doctorow

in reply to Cory Doctorow • 3 days ago • •

@archo If Copyright is intended to ensure "fair reward," it has failed on its own terms, repeatedly. As I wrote in the thread:

What will the media companies do if they win their lawsuits? Glazier gives us the answer in the opening sentence of his press release: they will create "partnerships" with AI companies to train models on the work we produce.

This is the lesson of the past 40 years of copyright expansion. For 40 years, we have expanded copyright in every way: copyright lasts longer, covers more works, prohibits more uses without licenses, establishes higher penalties, and makes it easier to win those penalties.

Today, the media industry is larger and more profitable than at any time, and the share of those profits that artists take home is smaller than ever.

How has the expansion of copyright led to media companies getting richer and artists getting poorer? That's the question that Rebecca Giblin and I answer in our 2022 book Chokepoint Capitalism. In a nutshell: in a world of five publishers, four studios, three labels, two app companies and one company that controls all ebooks and audiobooks, giving a creative worker more copyright is like giving your bullied kid extra lunch money. It doesn't matter how much lunch money you give that kid – the bullies will take it all, and the kid will go hungry:

@Arvīds Kokins

in reply to Cory Doctorow

Cory Doctorow

in reply to Cory Doctorow • 3 days ago • •

@archo I co-wrote an entire bestselling book of policies that will increase the share of income that goes to creators. None of them involves more alienable exclusive rights:

beacon.org/Chokepoint-Capitali…

Beacon Press: Chokepoint Capitalism

^{www.beacon.org}

@Arvīds Kokins

in reply to Cory Doctorow

Arvīds Kokins

in reply to Cory Doctorow • 3 days ago • •

I don't want to argue in favor of concentration of wealth, power or copyright. :)

Just saying that was the intent of the law. There's definitely plenty of room for improvement and possibly alternatives.

in reply to Cory Doctorow

Arvīds Kokins

in reply to Cory Doctorow • 3 days ago • •

What's untrue?

constitution.congress.gov/brow…

it says:

> "encourage, by *proper premiums* and provisions, the advancement of useful knowledge and discoveries";

in reply to Arvīds Kokins

Cory Doctorow

in reply to Arvīds Kokins • 3 days ago • •

@archo "to promote the useful arts and sciences" - not to "ensure fair reward."

@Arvīds Kokins

in reply to Cory Doctorow

Cory Doctorow

in reply to Cory Doctorow • 3 days ago • •

@archo If the framers had intended this as a compensation scheme, it would have read "shall," not "may" (the majority of the constitution's prescriptions are mandatory, this is one of the few that is optional - that's not an oversight. The framers did not extend *any* copyright to foreign works, for example, because they judged that padding the margins of US printers with the works of foreign authors would promote the useful arts)

@Arvīds Kokins

in reply to Cory Doctorow

Arvīds Kokins

in reply to Cory Doctorow • 3 days ago • •

Obviously they phrased it differently but it's also clear from the associated text that the "promotion" they had in mind was an incentive using some forms of rewards.

The "proper" to me reads as "fair" but I'm sure some may disagree. Fairness isn't entirely objective anyway.

in reply to Cory Doctorow

Arvīds Kokins

in reply to Cory Doctorow • 3 days ago • •

I don't think anyone's arguing in court that a genAI model training script is infringement by itself?

The issue is with the data set the companies collected and fed into the script.

If they had the rights to every single image they used, there would be no lawsuit.

Sorry if I was initially unclear, that was what I was referring to, not the general act of "training".

in reply to Arvīds Kokins

Cory Doctorow

in reply to Arvīds Kokins • 3 days ago • •

@archo

No, you're still wrong.

The act of making a transient copy of a work in order to examine it and publish facts derived from it does not infringe copyright.

You don't need the rights to a work to do these things.

@Arvīds Kokins

in reply to Cory Doctorow

Arvīds Kokins

in reply to Cory Doctorow • 3 days ago • •

This is the same kind of reductive phrasing I already addressed in my initial post.

Not to mention that the copy on their hard drives isn't even especially "transient" considering that they want to reuse the data for subsequent "training" runs.

⇧