music library

I recently undertook the task of attempting to listen to my entire music collection. I have a lot of music, so this took me the better part of a year (just under 250 days, if I remember right. I have a lot of music. It’s a bit of a problem.) I opted to do this by setting my music collection to play in alphabetical order by title, starting with a random selection.

Truth be told, I did a lot of skipping tracks. You’d be surprised how many movie and video game soundtracks are mostly filler.

I began and ended on the letter E. I had been noticing throughout the experience that some letters contained a lot more music than others, which is to be expected; the usual Wheel of Fortune suspects R, S, T, L, and N all seemed to be well-represented, and I was expecting E to be a triumphant, long home stretch. It seemed to be over much quicker than I expected, though. So this got me thinking what the actual alphabetical distribution of my music collection was.

I put together a Python script to evaluate it for me. Here was the result.



This came as a bit of a surprise. I had no idea so many tracks would be distributed into S and T. Then I remembered that I was not ignoring the word “The” at the start of a title. Adjusting for that produced the following:



I want to say this makes more sense, but I’m not sure what I was expecting or not expecting. This does confirm my suspicion that the letter E (the most common vowel in English) was underrepresented. It’s not the lowest ranking letter, but lower than I would expect.

My music collection is probably an outlier, though. Most people do not have one Final Fantasy soundtrack in their collection, let alone seven. I’d like to release the script publicly and get some anecdata to publish, but it currently doesn’t work in Windows. When the remaining issues are sorted out, I’ll update this post.

I’ll also probably use a less awful UI toolkit than Tk.