This workshop is part of EARL week! We will be hosting four workshops from the 6th-9th September and the week will come to a conclusion on Friday 10th September, with a day full of presentations on using R in enterprise. We hope you can join us, if you'd like to view the other EARL events please see the 'also by Mango' box to the right of this text. EARL is proudly organised by Mango Solutions.
Web Scraping and Text Mining Lyrics in R
Date and time: Thursday 9th September 2021, 2pm-5pm BST
Level – Intermediate
Harnessing the wealth of freely available information on the Internet, Data Scientists can generate their own datasets on virtually any topic of interest. In this workshop, we demonstrate a full text analysis workflow in R using lyrics from currently popular songs. After scraping our lyrics from the web, we illustrate the steps typically involved in analysing text, from its cleaning and pre-processing to a variety of widely used feature engineering, visualisation and modelling techniques.
Good working knowledge of R programming.
A basic understanding of regular expressions.
Registrants will be provided with a list of R packages to install prior to the workshop.
Cleaning and pre-processing text data
Profits from EARL 2021 will be donated to Data Kind UK.
Daniel is a Data Scientist at Mango, where he spends his time either consulting on a variety of projects or delivering training that hopefully spreads the love for R. While he thoroughly enjoyed his previous work as a researcher in Computational...
Andrew joined Mango straight out of Bristol University, as part of the first full graduate programme in the company. Since then, he has taken part in multiple solo and team projects, with a focus on statistical analysis and modelling - in line...