The popular free software project “youtube-dl” was removed from Github on Friday following a legal notice from the Recording Industry Association of America claiming it violates U.S. copyright law. According to the RIAA, the tool’s “clear purpose” includes reproducing and distributing “music videos and sound recordings … without authorization.”
In fact, youtube-dl is a powerful general purpose media tool that allows users to make local copies of media from a very broad range of sites. That versatility has secured it a place in the toolkits of many reporters, newsroom developers, and archivists. For now, the code remains available to download through youtube-dl’s own site, but the disruption of its development hub and the RIAA saber-rattling jeopardizes both the future of the software and the myriad journalistic workflows that depend on it.
Numerous reporters told Freedom of the Press Foundation that they rely on youtube-dl when reporting on extremist or controversial content. Downloading the content does not necessarily mean we will re-publish it, but it is often important to secure it for documentation and further internal investigations.
Justin Ling, a freelance investigative reporter who often covers security and extremism for outlets including Foreign Policy and VICE News, described the scenario of reporting on the rise of conspiracy theories as the relevant posts face removal and bans. Youtube “has been a crucial hub for QAnon organizing and propaganda: I’ve often used youtube-dl to store those videos for my own benefit. Good thing, too, as Youtube often, without warning, mass-removes that sort of content, which can be ruinous for those of us using those Youtube accounts to trace the spread of these conspiracies.”
In other cases, local copies are necessary to conduct more rigorous analysis than is possible online, and journalists turn to youtube-dl for the highest quality copy of the video available. John Bolger, a software developer and systems administrator who does freelance and data journalism, recounted the experience of reporting an award-winning investigation as the News Editor of the college paper the Hunter Envoy in 2012. In that story, the Envoy used video evidence to contradict official reports denying a police presence at an on-campus Occupy Wall Street protest.
“In order to reach my conclusions about the NYPD’s involvement… I had to watch this video hundreds of times—in slow motion, zoomed in, and looping over critical moments—in order to analyze the video I had to watch and manipulate it in ways that are just not possible” using the web interface. Youtube-dl is one effective method for downloading the video at the maximum possible resolution.
Jake, a member of the Chicago-based transparency group Lucy Parsons Labs, uses youtube-dl to save copies of recorded incidents involving police use of force or abusive behavior. Once copied, the videos can be stored in an archive or modified before publication, such as by blurring the faces of bystanders or victims. “We have sometimes been able to take a closer look at individual frames after downloading with youtube-dl to identify officers when they are not wearing their badges intentionally or obfuscating them with things to avoid accountability.”
One misinformation researcher told Freedom of the Press Foundation about using youtube-dl to create a baseline for machine learning models developed to do automated real-time fact-checking. “While our production systems are designed to be used on live video streams, it’s not feasible to test on live video. Youtube-dl allows us to greatly increase the speed of our research development and allow us to be able to actually test our software on a day-to-day basis, not just when politicians happen to have a speech.”
Similarly, a number of reporters described using youtube-dl for nuts-and-bolts workflows such as transcribing videos they’re covering. Jeremy Gray, a data scientist with The Globe and Mail, described a Slack tool he provides to journalists to allow them to automatically transcribe their own interviews and, until Friday, to transcribe Youtube videos from a URL. “It used youtube-dl, and now that part is broken.” Another journalist, who works at a “small-ish public media newsroom,” described a common situation where a reporter needs “a recording of a public meeting for a story but is on deadline and doesn’t want the hassle of recording the parts they want it in realtime or wants the full file for something like AI transcription.”
That same journalist described how youtube-dl helps address the challenge of incorporating user-generated content on-air. In the immediate aftermath of an earthquake, their newsroom began expansive continuous coverage and sought to include photos and videos that locals had recorded. “We are scrupulous about making sure we get permission (and the person granting it actually owns the copyright), but especially right after an earthquake asking people to send the video to us specifically can be a much bigger ask than just allowing us to use it (if they even have a recording, which they probably don’t for a livestream), so often after getting permission I’d just download it straight from social media to transcode for TV.”
That use case is common. Reporters frequently need high-fidelity copies of video or audio tracks for publication or reporting. Ling, the freelance security reporter, said he also uses youtube-dl to “get the best audio quality” when downloading copies of press conferences or news events “to grab snippets of audio for use in podcasts or radio work.”
Finally, numerous reporters described using youtube-dl to download copies of their own works. Freedom of the Press Foundation has previously worked to help writers preserve portfolio copies of their articles, and to help full news archives stay online when the outlet itself is under threat. Youtube-dl plays an important role in that ecosystem as well.
Github has not publicly commented on its removal of one of its most popular repositories. Clearly, youtube-dl in particular and the ability to download and manipulate online videos in general are an important part of the work of journalism and contemporary media literacy. Given the important role that youtube-dl plays in public interest reporting and archiving, the RIAA’s efforts to have the tool removed represent an extraordinary overreach with the possibility for dramatic unforeseen consequences. We urge RIAA to reconsider its threat, and Github to reinstate the account in full.