49 Years of Lyrics: A Python based study of the change in language for popular music from 1970 to 2018.

Let’s take a look at the word breakdowns further.Lyrics (Original Content)[Verse 1] When you're weary Feeling small When tears are in your eyes I will dry them all I'm on your side When times get rough And friends just can't be found Like a bridge over troubled water I will lay me down Like a bridge over troubled water I will lay me down [Verse 2] When you're down and out When you're on the street When evening falls so hard I will comfort you I'll take your part When darkness comes And pain is all around Like a bridge over troubled water I will lay me down Like a bridge over troubled water I will lay me down [Verse 3] Sail on Silver Girl Sail on by Your time has come to shine All your dreams are on their way See how they shine If you need a friend I'm sailing right behind Like a bridge over troubled water I will ease your mind Like a bridge over troubled water I will ease your mind"Corpus (Removed Stop words, Punctuation and made Lowercase)verse 1 when be weary feel small when tear eye I dry I be when time rough and friend not find like bridge troubled water I lay like bridge troubled water I lay verse 2 when be when be street when evening fall hard I comfort I will when darkness come and pain like bridge troubled water I lay like bridge troubled water I lay verse 3 sail silver girl sail Your time come shine all dream way see shine if need friend I be sail right like bridge troubled water I ease mind like bridge troubled water I ease mindAdverbswhen when when just not when down out when when so hard when all around how rightNounsverse tear eye side time friend bridge water bridge water street evening part darkness pain bridge water bridge water time dream way friend bridge water mind bridge water mindVerbsbe feel be will dry be get can be find will lay will lay be be fall will comfort will take come be will lay will lay sail sail have come shine be see shine need be sail will ease will easeWe’re going to map out word frequencies (total and unique), as well average frequency of words that are used across every year to see if we can prove our complexity increase and nouns evolution over the 49 year spread.Average Words and Unique Words Per YearWe can see from the chart above that the amounts of words in each song has been trending upwards from 1970 to 2018, and that generally speaking, unique words tick upwards with the increase in overall number of words..We can also see that the overall number of songs collected doesn’t seem to have a direct effect on either..We can look at this with a stacked barchart as well to see if there are any more insights.This helps us determine that the lowest number of unique words happened in 1978, and also supports the hypothesis that (by measure of uniqueness and word counts) that lyrics have gotten more complex over time..We can also look at these with matplotlib’s subplot feature to overlay multiple dimensions..This will help us visualize if there are any overt correlations.From this view, we can indeed see that unique words and total words follow each other closely, and that the number of songs collected do not appear to have a clear bearing on those values..In fact, when some of the most complex lyrics appear, the collection is actually relatively low..As we’re averaging both word count and unique word count, if there was an outsize problem caused by the data, we would see dips where we saw collection misses.It looks like our most complex year lyric wise was 2004, 2005..Let’s take a look at them below.Most Words, 2004Most Words, 2005We can see here that in both cases the top 5 are Rap/Hip-Hop songs, which makes sense in this case as both of those genres are word heavy vs some of the more Pop songs of the time..You can check the code for more ways to interact with the data, but suffice it to say the results with unique words are similar..I didn’t have the ability to collect genre information with the songs, but I would think you’d see these genres were quite popular in this time frame, which would again support the increase of the word counts.Let’s look at a word cloud or two.I wrote a function that wraps the wordcloud library into a format and font package I like and have pushed some of the years of data through it here..I actually use word clouds a lot in day to day investigations to identify outliers and terms that could bias models that I build..They can also be quite pretty.. More details

Leave a Reply