Anna’s Archive, the open-source search engine for shadow libraries, it scraped Spotify’s total library of music. The group acquired metadata for round 256 million tracks, with 86 million precise songs, and is just below 300TB in complete measurement.
“Some time in the past, we found a approach to scrape Spotify at scale. We noticed a task for us right here to construct a music archive primarily geared toward preservation,” the group mentioned in a . The pirated treasure trove of music represents over 15 million artists with over 58 million albums.
The group intends to make all recordsdata obtainable for obtain for anybody with the obtainable disk area. “This Spotify scrape is our humble try to begin such a “preservation archive” for music. After all Spotify doesn’t have all of the music on the earth, nevertheless it’s an important begin,” the group wrote. The 86 million songs that the group has archived to this point signify about 99.6 p.c of listens on the platform. This solely represents about 37 p.c of the entire and the group nonetheless has thousands and thousands left to be archived.
The open-source website is often targeted on textual content like books and papers, which it says provides the very best data density. The group says its objective of “preserving humanity’s data and tradition” does not distinguish between media varieties. After all none of that is precisely authorized, and the sharing or downloading of all these recordsdata is flagrantly in violation of IP safety legal guidelines.
Anna’s Archive contends that present collections of music, each bodily and digital, are over-indexed to the most well-liked artists or composed of unnecessarily giant file sizes because of collectors’ deal with constancy. The group says that what it is amassed is by far the most important music metadata database publicly obtainable. The music recordsdata will probably be launched so as of recognition in levels.


