Spatial Regularization for Analysis of Text and Epidemiological Data
dc.contributor.advisor | Vucetic, Slobodan | |
dc.creator | MAITI, ANIRUDDHA | |
dc.date.accessioned | 2022-08-15T19:03:43Z | |
dc.date.available | 2022-08-15T19:03:43Z | |
dc.date.issued | 2022 | |
dc.identifier.uri | http://hdl.handle.net/20.500.12613/8036 | |
dc.description.abstract | Use of spatial data has become an important aspect of data analysis. Use of location information can provide useful insight into the dataset. Advancement of sensor technologies and improved data connectivity have made it possible to the generation of large amounts of passively generated user location data. Apart from passively generated data from users, explicit effort has been made by commercial vendors to curate large amounts of location related data such as residential histories from a variety of sources such as credit records, litigation data, driving license records etc. Such spatial data, when linked with other datasets can provide useful insights. In this dissertation, we show that spatial information of data enables us to derive useful insights in domains of text analysis and epidemiology. We investigated primarily two types of data having spatial information - text data with location information and disease related data having residential address information. We show that in the case of text data, spatial information helps us find spatially informative topics. In the case of epidemiological data, we show residential information can be used to identify high risk spatial regions. There are instances where a primary analysis is not sufficient to establish a statistically robust conclusion. For instance, in domains such as epidemiology, where a finding is not considered to be relevant unless some statistical significance is established. We proposed techniques for significant tests which can be applied to text analysis, topic modelling, and disease mapping tasks in order to establish significance of the findings. | |
dc.format.extent | 94 pages | |
dc.language.iso | eng | |
dc.publisher | Temple University. Libraries | |
dc.relation.ispartof | Theses and Dissertations | |
dc.rights | IN COPYRIGHT- This Rights Statement can be used for an Item that is in copyright. Using this statement implies that the organization making this Item available has determined that the Item is in copyright and either is the rights-holder, has obtained permission from the rights-holder(s) to make their Work(s) available, or makes the Item available under an exception or limitation to copyright (including Fair Use) that entitles it to make the Item available. | |
dc.rights.uri | http://rightsstatements.org/vocab/InC/1.0/ | |
dc.subject | Computer science | |
dc.subject | Hypothesis testing | |
dc.subject | Microblog data | |
dc.subject | Residential history data | |
dc.subject | Spatial epidemiology | |
dc.subject | Spatial text analysis | |
dc.subject | Topic modelling | |
dc.title | Spatial Regularization for Analysis of Text and Epidemiological Data | |
dc.type | Text | |
dc.type.genre | Thesis/Dissertation | |
dc.contributor.committeemember | Obradovic, Zoran | |
dc.contributor.committeemember | Vucetic, Slobodan | |
dc.contributor.committeemember | Dragut, Eduard Constantin | |
dc.contributor.committeemember | Henry, Kevin A. | |
dc.description.department | Computer and Information Science | |
dc.relation.doi | http://dx.doi.org/10.34944/dspace/8008 | |
dc.ada.note | For Americans with Disabilities Act (ADA) accommodation, including help with reading this content, please contact scholarshare@temple.edu | |
dc.description.degree | Ph.D. | |
dc.identifier.proqst | 14917 | |
dc.creator.orcid | 0000-0002-1142-6344 | |
dc.date.updated | 2022-08-11T22:08:12Z | |
refterms.dateFOA | 2022-08-15T19:03:44Z | |
dc.identifier.filename | Maiti_temple_0225E_14917.pdf |