R bloggers

The Bold & Beautiful Character Similarities using Word Embeddings

FavoriteLoadingAdd to favorites

Introduction I often see advertisement for The Bold and The Beautiful, I have never watched a single episode of the series. Still, even as a data scientist you might be wondering how these beautiful ladies and gentlemen from the show are related to each other. I do not have the time to watch all these episodes to find out, so I am going to use word embeddings on recaps instead… Calculating word embeddings First, we need some data, from the first few google hits I got to the site soap central. Recaps can be found from the show that date back to 1997. Then, I used a little bit of rvest code to scrape the daily recaps into an R data set. Word embedding is a technique to transform a word onto a vector of numbers, there are several approaches…
Original Post: The Bold & Beautiful Character Similarities using Word Embeddings