Filling Gaps in Publishing Data: New York Times Best Sellers
Publication Date
Spring 4-16-2025
Presentation Length
Poster/Gallery presentation
College
College of Sciences & Mathematics
Department
Math and Computer Science, Department of
Student Level
Undergraduate
SPARK Category
Research
Faculty Advisor
Dr. Christina Davis
SPARK Session
11:45-12:45
Presentation Type
Poster
Summary
Continuing my previous work assembling datasets from the New York Times Best Sellers of the year 2023, this project fills gaps in the data with Selenium, a python web scraping tool, to extract missing book details and descriptions. Natural language processing is then applied to build a classification model for assigning genres to each book on the lists based off of their descriptions.
Recommended Citation
Aebi, Katherine E., "Filling Gaps in Publishing Data: New York Times Best Sellers" (2025). SPARK Symposium Presentations. 76.
https://repository.belmont.edu/spark_presentations/76