# The Likeability index: a method for classifying likeability of YouTube videos

I have had a YouTube channel for about 5 years.  I have a few hundred subscribers, and a few dozen of them regularly watch my videos and comment.  I think I have had around 30,000 views a year.  All in all, my contribution is a mere speck on the sandy beaches of YouTube content.

What makes a YouTube video popular?

Partly for narcissistic reasons, and partly out of curiosity, I want to know what it is that people like and don’t like on YouTube.  I find it fascinating that a video like “Charlie bit my finger” gets thumbs down, for example.  What could possibly motivate someone to dislike this video?  Are they just internet trolls?  Are they child haters?  Did they accidentally click the wrong thumb sign?

The problem is that while views, likes and dislikes are all important in determining what people think about a YouTube video, it is hard to interpret them together to get an overall sense of likeability.  What is needed is an index that combines this information into a single, sensible metric. Sounds like a job for someone with some free time on a Sunday afternoon!

A likeability index!

To start, I had to sample videos.  I know of no way to select a true random sample of YouTube videos, so instead I created a list of random words (found here) that I then used as search terms on YouTube, and then picked the video at the top of the results page associated with each search term. For each of these videos (N=125) I collected date of upload, number of visits, number of likes and number of dislikes.  Four of these videos had no information on likes and dislikes, so I dropped them.  This left me with 121 videos.

Then I came up with an index for ranking the likeability as follows.  First, I calculated the total views, likes and dislikes per day based on the date I extracted the data and the date the video was uploaded.  I refer to these are TPD, LPD and DPD, respectively.  Then I calculated two ratios

with the following reasoning.  I assume that if a video is ‘perfectly’ liked, then every visit would correspond with a like, and TPD would equal LPD.  Similarly, if a video is ‘perfectly’ disliked, then every visit would correspond with a dislike (TPD would equal DPD).  To combine these two metrics into a single index I could just subtract one from the other, but this would suggest that likes and dislikes are of equal importance when judging the relative likeability of videos.  As it turns out, the ratio of likes to dislikes on the sample of videos I took is about 20 to 1.  I take this to mean that dislikes matter considerably less than likes when it comes to the judgement of a YouTube video’s overall likeability.  So I combine these two ratios into a single index

that can be easily modified with a different constant if so desired.  It’s worth noting (I realized this after the original post) that the formula can be simplified algebraically to remove the day term.  This leaves us with

There are important shortcomings worth noting.  This likeability index does not measure popularity, which is probably more important from a  marketing standpoint.  Furthermore, it is probably not constant over time; when a video is posted, the subscribers to the channel will probably give it a short term boost in likeability index, since they probably have a more positive disposition towards the channel than the average viewer.  This figure shows the relationship between the likeability index and the number of days since upload in my sample of videos from YouTube.

As you can see, the longer a video is on YouTube, the lower its likeability rating.  I suspect this has to do with the flow and ebb of popular culture; interest in a video starts out with enthusiasm, which gradually gives way to apathy over time.  So to make a proper comparison of likeability, one should probably compare videos uploaded at a similar time–say, the same year.

Here is a graph of my most popular video (about 16,000 views), with the likeability index on the Y-axis, and date on the x-axis.

As you can see, the likeability index for this video stabilizes at around 0,00225, which I would interpret as the ‘natural’ or true likeability of that video.  The early variations reflect clusters of likes I probably got from subscribers in the past.  How does this compare with the likeability of videos in my sample?  Well, the median likeability score in my sample is about 0.005, which means that I seem to have a poor likeability score on my video. However, the median likeability index for the year my video was uploaded is 0.003, which puts me in more middling territory.  Thank goodness!

My conclusions

There is no obvious benchmark for evaluating this likeability index, however I think it is a nice tool for systematically assessing the likeability of a video on YouTube