Loading...
Thumbnail Image
Item

A Comparison of Topic Modeling Approaches Using Networked Discussion Forum Posts From the City-data.com Corpus

Omizo, Ryan M.
Citations
Altmetric:
Genre
Journal article
Date
2024-02-07
Advisor
Committee member
Group
Department
English
Permanent link to this record
Research Projects
Organizational Units
Journal Issue
DOI
https://doi.org/10.5334/johd.182
Abstract
The City-Data.com Corpus provides over 15,000 discussion forum posts scraped from city-data.com--a website that hosts information about cities across the United States. Like the 20 Newsgroups dataset, the City-Data.com Corpus is weakly labeled by forum topics and thread titles and can be used to trial natural language processing techniques or be used to stage lessons in digital textual analysis in digital humanities pedagogy.
Description
Citation
Omizo, R.M. (2024) ‘A Comparison of Topic Modeling Approaches Using Networked Discussion Forum Posts From the City-data.com Corpus’, <i>Journal of Open Humanities Data</i>, 10(1), p. 16. Available at: https://doi.org/10.5334/johd.182.
Citation to related work
Ubiquity Press
Has part
Journal of Open Humanities Data, Vol. 10
ADA compliance
For Americans with Disabilities Act (ADA) accommodation, including help with reading this content, please contact scholarshare@temple.edu
Embedded videos
License
Attribution CC BY