Crawling for music part2

5 March 2014
Tags: Podcast , Python

In a previous post (Crawling for music) I tried to automate the process of importing the soundtracks played in the Freakonomics podcast via their tags in the audio transcripts. Some time has passed, new songs have been added and my script required some maintenance. These are the new Python scripts:

freakocrawl
size 5.4 KiB
sha256: d2f9adde3e...bd5a7d476d

spotify_check
size 584.0 bytes
sha256: e71b09fd9c...787732cf6b

raw_music_tags
size 132.4 KiB
sha256: 57f3225cb7...eb6e9256e1

visited
size 16.8 KiB
sha256: 9cbc450b03...12f0500e9c

Removed a lot of unnecessary code and keep track of previously visited pages in order to avoid looking up a page over and over again.

  • Of the 296 unique songs found so far, 140 was found on Spotify (compared to 60 last run) Of the 296 unique songs found so far, 140 was found on Spotify (compared to 60 last run)
  • All tags gathered so far in raw format All tags gathered so far in raw format
  • After a lot of regex manipulation After a lot of regex manipulation
  • On the clipboard after using the ivy service. Use On the clipboard after using the ivy service. Use "spotify_check.py" for finding the difference between two such listings.

    Still using ivyishere.org though. Perhaps I'll look into using the Spotify API directly later...