Ravelry is one of the largest online fiber works sites available. I use the platform to find new and exciting patterns to knit for myself and my friends and family. In October of 2022 Tidy Tuesday shared yarn data available on Ravelry to the data science community for analysis.
The code I reviewed for my analysis comes from GitHub user Alice Walsh
Original Data Set
The Data Available from Ravelry consists of many variables of interest. For my analysis I chose to primarily look at the average rating, yarn weight name and the company that sells the specific yarns.
I have been interested in learning more about different yarn companies so I chose to look at which brands had the highest number of perfect ratings in the data set.
I began by filtering only the yarns with a perfect rating and summarized the number of yarns each company had. Some brands only had a few yarns that fit that requirement so I narrowed my scope to only those with more than thirty perfectly rated yarns.
My final graphical analysis shows that ColourMart, Ice Yarns, and Lana Grossa have the highest number of 5 point reviews for their yarns
Show the code
ggplot() +geom_col(data = high_ratings,aes(y =fct_reorder(yarn_company_name, totalnumber), x = totalnumber), fill ="grey") +geom_col(data = top3, aes(y =fct_reorder(yarn_company_name, totalnumber), x = totalnumber), fill ="darkred") +labs(title="Which Yarn Company has the highest number of perfect ratings?",x ="Total Number of 5 star ratings",y ="Company Name" ) +theme_minimal()
Wickham, Hadley, Mara Averick, Jennifer Bryan, Winston Chang, Lucy D’Agostino McGowan, Romain François, Garrett Grolemund, et al. 2019. “Welcome to the tidyverse.”Journal of Open Source Software 4 (43): 1686. https://doi.org/10.21105/joss.01686.
Wickham, Hadley, Winston Chang, Lionel Henry, Thomas Lin Pedersen, Kohske Takahashi, Claus Wilke, Kara Woo, Hiroaki Yutani, and Dewey Dunnington. 2022. Ggplot2: Create Elegant Data Visualisations Using the Grammar of Graphics. https://CRAN.R-project.org/package=ggplot2.
Wickham, Hadley, Romain François, Lionel Henry, Kirill Müller, and Davis Vaughan. 2023. Dplyr: A Grammar of Data Manipulation. https://CRAN.R-project.org/package=dplyr.