The biggest issue with generative AI, at least to me, is the fact that it’s trained using human-made works where the original authors didn’t consent to or even know that their work is being used to train the AI. Are there any initiatives to address this issue? I’m thinking something like an open source AI model and training data store that only has works that are public domain and highly permissive no-attribution licenses, as well as original works submitted by the open source community and explicitly licensed to allow AI training.

I guess the hard part is moderating the database and ensuring all works are licensed properly and people are actually submitting their own works, but does anything like this exist?

  • masterspace@lemmy.ca
    link
    fedilink
    English
    arrow-up
    2
    ·
    23 hours ago

    It’s not a popular opinion but you’re entirely right.

    AI isn’t copying in the way that most people think it is. It truly is transformative in all the tradition copyright ways.

    Is it copyright infringements if my company pays an employee to study the internet and that makes them capable of animating a frame from the Simpsons? No, it’s copyright infringement when that company publishes that copyright infringing work.

    The reality is that copyright has always been a nonsense system and ‘fair use’ concepts were also nonsense and arbitrary. AI algorithms just let us expose how nonsense they are at scale.