Which of the following statements best describes the copyright status of content generated entirely by an AI without human creative input?
Think about who can hold copyright and if a machine can be an author.
Copyright law generally requires human authorship. Content created solely by AI without human creativity is usually considered public domain.
Which practice is most likely to violate copyright laws when training an AI model?
Consider what happens if you use someone else's work without permission.
Using copyrighted material without permission for training can infringe copyright, unless fair use or licenses apply.
You have a model trained on mixed data, some copyrighted and some public domain. Which metric best helps you measure if the model is generating content that might infringe copyright?
Think about how to detect if output copies training data.
Similarity scores can detect if generated content closely matches copyrighted training data, indicating possible infringement.
Given a text generation model, which scenario below indicates a potential intellectual property risk?
Look for exact copying of protected content.
Output identical to copyrighted text can infringe intellectual property rights.
Which model architecture is best suited to reduce the risk of memorizing and reproducing copyrighted training data?
Consider dataset quality and model size impact on memorization.
Smaller models trained on licensed, curated data with augmentation reduce memorization and copyright risks.