The Daniel Stih Podcast
AI companies have been accused of training music-generation models on copyrighted songs without permission. Lawsuits followed. Licensing deals emerged. The debate became about copyright and compensation. While investigating the issue, I found myself asking a different question: How did the music actually get into the training system? That question led me into datasets, metadata, YouTube links, and an under explored part of the public discussion—the pipeline between publicly available music and AI model training. In this episode, I explore why datasets are not the same as audio collections and why understanding how a system works is as important as deciding what should happen after the fact. This isn't an argument for or against AI companies or artists. It's an exploration of problem definition, assumptions, and why understanding the mechanism leads to better questions—and better solutions.
209 Folgen
Kommentare
0Sei die erste Person, die kommentiert
Melde dich jetzt an und werde Teil der The Daniel Stih Podcast-Community!