Skip to content

Data collection, cleaning, and analysis for Senior Thesis Ver. 1

Notifications You must be signed in to change notification settings

ek8terina/TikTokDataAnalysisCodingSample

Repository files navigation

TikTokDataCodingSample

Data collection, cleaning, and analysis for Senior Thesis Ver. 1

This repository represents all the underlying code of my senior thesis: "User-attempted Algorithm Control on TikTok- Disability Awareness and Blackout Day" version 1. Data collection was scraped in Python through the use of TikTokApi package by David Teather (code can be found on GitHub) slightly modified in order to ensure compatibility of mac OS Big Sur while the majority of cleaning and analysis was done in R. Both pdf versions of exclusively data figures/visualizations as well as the version 1 thesis for convenient comparison.

data_preparation.py represents selection of data attributes to be collected from any given TikTok

data_collection.py scrapes selected TikTok attributes

Data_Cleaning.R takes raw data and writes cleaned data adding day variables among others

Data_Analysis.Rmd conducts DiD analysis between selected communities, checks trends to ensure decent comparability, and constructs a few additional tables for simple data visualization

Data_Analysis.pdf is the knitted version of the above R markdown file

Final Work Sample... is the pdf version of thesis version 1. Includes all figures created and referenced in Data_Analysis

Thank you to David Teather for his TikTokApi (https://github.com/davidteather/TikTok-Api)

About

Data collection, cleaning, and analysis for Senior Thesis Ver. 1

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published