all about machine learning and keras on sustained model training which identifies the original images traits from the duplicate one to prove the image is subjected to copyright or may be a media based on violation of the policy
if you want to use the dataset use the .json file #do not violate