There are a number of copyright issues relating to both the input and output stages of AI.
At the input stage, there’s the use of copyrighted materials to train LLMs. Does this violate copyright, or is it a transformative fair use (which means it would fall within the fair use exception to copyright law)? If the use of copyrighted materials to train AI does not violate the copyright law, are there still ethical/moral issues with AI using works for training purposes without the permission of the creators?
At the output stage, there are two issues:
- because the AI model is trained on copyrighted material, if it generates output based on that material does the AI-generated material infringe on copyright; and
- what, if any, copyright protection applies to works created by AI -- generally, a human creator is required for copyright to attach. What level of AI assistance is necessary before the work is created (for copyright purposes) by AI instead of a human? What rights do (and should) a creator have over works created with the use, or assistance of, AI?