Since Reddit content being used to train AI was part of what triggered their Dumb Actions™️, is there a way to deal with this on Lemmy? If there’s a way to license API access or the content itself under, say, LGPL to prevent commercial AI from using it that would be awesome. With the way ActivityPub works I’m not sure if that’s possible though.
The licensing doesn’t matter, most AI are trained off proprietary and copyrighted data. There’s still a lot of talks in governments about whether this is legal or not, but at this point the cat’s out of the bag and I doubt we’ll regress back to using smaller amounts of data.
In Europe they are currently trying to publish a law that sources have to be given by AI if the result is based on proprietary source material. See https://www.reuters.com/technology/eu-lawmakers-committee-reaches-deal-artificial-intelligence-act-2023-04-27/
Aside from the fact that I don’t think this law will pass, I doubt it’ll be effective at all. Companies will just move AI training to countries where it is legal. The most the EU can do right now is play whack-a-mole and start blocking AIs that don’t meet its requirements, but at that point people will just host mirrors or use a VPN. It’s just not enforceable, and the EU knows that, which is why they’re so stressed out trying to figure out a reasonable law regarding AI.
Yeah I think so too.