Filling Gaps in Publishing Data: New York Times Best Sellers

Publication Date

Spring 4-16-2025

Presentation Length

Poster/Gallery presentation

College

College of Sciences & Mathematics

Department

Math and Computer Science, Department of

Student Level

Undergraduate

SPARK Category

Research

Faculty Advisor

Dr. Christina Davis

SPARK Session

11:45-12:45

Presentation Type

Poster

Summary

Continuing my previous work assembling datasets from the New York Times Best Sellers of the year 2023, this project fills gaps in the data with Selenium, a python web scraping tool, to extract missing book details and descriptions. Natural language processing is then applied to build a classification model for assigning genres to each book on the lists based off of their descriptions.

This document is currently not available here.

Share

COinS