How To Think About Scraping

In privacy and labor fights, copyright is a clumsy tool at best.

A paint scraper on a window-sill. The blade of the scraper has been overlaid with a ‘code rain’ effect as seen in the credits of the Wachowskis’ ‘Matrix’ movies.
syvwlch/CC BY 2.0 (modified)

Web-scraping is good, actually.

For nearly all of history, academic linguistics focused on written, formal text, because informal, spoken language was too expensive and difficult to capture. In order to find out how people spoke — which is not how people write! — a researcher had to record speakers, then pay a grad student to transcribe the speech.

The process was so cumbersome that the whole discipline grew lopsided. We developed an extensive body of knowledge about written, formal prose (something very few of us produce), while informal, casual language (something we all produce) was mostly a black box.

The internet changed all that, creating the first-ever corpus of informal language — the immense troves of public casual speech that we all off-gas as we move around on the internet, chattering with our friends.

Continue reading "How To Think About Scraping"

Pluralistic: How unions won a 30% raise for every fast food worker in California (14 Sept 2023)


Today's links

Continue reading "Pluralistic: How unions won a 30% raise for every fast food worker in California (14 Sept 2023)"

The proletarianization of tech workers

If there is hope, it is in the proles.

Margaret Bourke-White’s iconic WPA photo ‘World’s Highest Standard of Living,’ picturing a line of poor Black people standing in a breadline before a billboard proclaiming ‘The World’s Highest Standard of Living: there’s no way like the American Way’ around an image of a white, propserous family enjoying a drive in a large luxury car. The image has been altered so that the lined up workers and the family in the car blink in and out of existence, replaced by the ‘code rain’ effect from the Wachowskis 'Matrix' movies.

The last time I saw the late, great Eric Flint was at the 2018 World Science Fiction Convention in San Jose, California, where we both participated (along with Patrick Nielsen Hayden, Olav Rokne and Eileen Gunn) in an excellent panel about the working class in sf.

Eric was an extraordinary writer and an even more extraordinary character. A Marxist meat-packers’ union organizer whose whole labor career was spent in the brutal trenches of Chicago Machine politics, Eric was also a towering figure in the subgenre of historical military science fiction, a field that is otherwise dominated by right-wingers, including numerous out-and-out kooks who endlessly fantasize about Bronze Age battles being re-fought with jets and mustard gas (for the record: Eric isn’t the only progressive voice in this field; others, like Harry Turtledove, bring a humanizing, leftist view to their work).
Continue reading "The proletarianization of tech workers"

Pluralistic: NLRB rules that any union busting triggers automatic union recognition (06 Sept 2023)


Today's links

Continue reading "Pluralistic: NLRB rules that any union busting triggers automatic union recognition (06 Sept 2023)"

Everything Made By an AI Is In the Public Domain

The US Copyright Office offers creative workers a powerful labor protective.

Norman Rockwell’s ‘self portrait.’ All the Rockwell faces have been replaced with HAL 9000 from Kubrick’s ‘2001: A Space Odyssey.’ His signature has been modified with a series of rotations and extra symbols. He has ten fingers on his one visible hand. Image: Cryteria (modified) https://commons.wikimedia.org/wiki/File:HAL9000.svg CC BY 3.0 https://creativecommons.org/licenses/by/3.0/deed.en
Cryteria/CC BY 3.0, modified

Last week, a US federal judge handed America’s creative workers a huge labor win: Judge Beryl A Howell of the DC Circuit Court upheld a US Copyright Office ruling that works created by “AIs” are not eligible for copyright protection.

This is huge.

Some background: under US law — and under a mountain of international treaties, from the Berne Convention to the TRIPS —copyright is automatically granted to creative works of human authorship “at the moment of fixation in some tangible medium.”

That is: as soon as a human being makes something creative, and records it in some medium (a hard-drive, magnetic tape, paper, film, canvas, etc), that creative thing is immediately copyrighted (the duration of that copyright varies, both by territory and by whether the creator was working on their own or for a corporation).

Continue reading "Everything Made By an AI Is In the Public Domain"

Pluralistic: When parties fail, movements step up (17 August 2023)


Today's links

Continue reading "Pluralistic: When parties fail, movements step up (17 August 2023)"

Pluralistic: Private equity plunderers want to buy Simon & Schuster (08 August 2023)


Today's links

Continue reading "Pluralistic: Private equity plunderers want to buy Simon & Schuster (08 August 2023)"

Pluralistic: When the app tries to make you robo-scab (30 July 2023)


Today's links

Continue reading "Pluralistic: When the app tries to make you robo-scab (30 July 2023)"

Pluralistic: Linkty Dumpty (15 July 2023)


Today's links

  • Linkty Dumpty: Things I thought about when I was supposed to be on holidays.
  • This day in history: 2003, 2008, 2013, 2018
  • Colophon: Recent publications, upcoming/recent appearances, current writing projects, current reading

Continue reading "Pluralistic: Linkty Dumpty (15 July 2023)"

Pluralistic: Links, dumped (10 June 2023)


Today's links

Continue reading "Pluralistic: Links, dumped (10 June 2023)"