Authors using a new tool to search a list of 183,000 books used to train AI are furious to find their works on the list.

    • FaceDeer
      link
      fedilink
      72 years ago

      It hasn’t been tested in court yet but I don’t see why it shouldn’t.

    • @kromem@lemmy.world
      link
      fedilink
      English
      32 years ago

      The training argument is probably going to come up dry by the time the court works its way through expert testimony, as the underlying argument for training as infringement is insane.

      But where OpenAI is probably in hot water is that torrenting 100k books in the first place runs afoul of existing copyright legislation.

      Everyone is debating the training in these suits, but the real meat and potatoes is going to be the initial infringement of obtaining the books, not how they were subsequently used.

    • @lloram239@feddit.de
      link
      fedilink
      English
      2
      edit-2
      2 years ago

      Authors Guild, Inc. v. Google, Inc. decided that it is fair use to scan books and make large parts of them available verbatim on the net. What AI does is far more transformative than that, as very little of a book can be reproduced verbatim with AI (e.g. popular quotes), you really just get “knowledge” from the books. The sources are however lost in the process, unlike with Google, which by itself however also makes it difficult to argue for copyright violation, since you can’t point at what was actually copied.