The purpose of this repository is to analyze a news popularity dataset, producing multiple files reporting analyses for each of the six different data channels: lifestyle, entertainment, social media, business, tech, and world. Each file goes through reading in the dataset, data manipulation and variable creation, summary table creation, data visualization, and model fitting using linear regression, random forest, and boosted tree methods. At the end of each document, a “best model” is declared.
tidyversecaretggplot2The code used to render the documents is as follows:
for(i in c("Lifestyle","Entertainment","Business","Social Media","Tech","World")){
rmarkdown::render("Project2.Rmd",output_file=i,params = list("channel"= i))
}