

Download all existing literature to build a library for preservation and you’re called a pirate.
Said library contains petabytes of the exact text of each and every piece of literature.
Download all existing literature from aforementioned library to train an LLM and you’re a tech innovator.
Said model contains gigabytes of a bunch of weights that can never go back to the exact words of the book.
What a strange world we live in.
It’s not strange at all. It’s degrees of compression. You compress a JPEG to the point that it’s unrecognizable, and it’s no longer breaking copyright. It’s essentially like trying to write a book you just read based on memory.





That’s my point. If it’s invisible, it’s done its job.