• Login
    View Item 
    •   Home
    • Theses and Dissertations
    • Theses and Dissertations
    • View Item
    •   Home
    • Theses and Dissertations
    • Theses and Dissertations
    • View Item
    JavaScript is disabled for your browser. Some features of this site may not work without it.

    Browse

    All of TUScholarShareCommunitiesDateAuthorsTitlesSubjectsGenresThis CollectionDateAuthorsTitlesSubjectsGenres

    My Account

    LoginRegister

    Help

    AboutPeoplePoliciesHelp for DepositorsData DepositFAQs

    Statistics

    Most Popular ItemsStatistics by CountryMost Popular Authors

    NOVEL DATA MINING ALGORITHMS FOR ANALYSIS OF ELECTRONIC HEALTH RECORDS

    • CSV
    • RefMan
    • EndNote
    • BibTex
    • RefWorks
    Thumbnail
    Name:
    Chanda_temple_0225E_15028.pdf
    Size:
    1.162Mb
    Format:
    PDF
    Download
    Genre
    Thesis/Dissertation
    Date
    2022
    Author
    Chanda, Ashis cc
    Advisor
    Vucetic, Slobodan
    Committee member
    Obradovic, Zoran
    Dragut, Eduard Constantin
    Department
    Computer and Information Science
    Subject
    Computer science
    Artificial intelligence
    Data mining
    Data visualization
    Electronic health records
    Machine learning
    Natural language processing
    Neural network
    Permanent link to this record
    http://hdl.handle.net/20.500.12613/8340
    
    Metadata
    Show full item record
    DOI
    http://dx.doi.org/10.34944/dspace/8311
    Abstract
    Medical health providers use electronic health records (EHRs) to store information about patient treatment to support patient care management and securely share health information among healthcare organizations. EHRs have also been used in healthcare research in problems such as patient phenotyping, health risk prediction, and medical entity extraction. In this thesis, we focus on several important issues: (1) how to convert natural text from medical notes to vector representations suitable for deep learning algorithms, (2) how to help healthcare researchers select a patient cohort from EHRs, and (3) how to use EHRs to identify patient diagnoses and treatments. In the first part of the thesis, we present a new method for learning vector representations of medical terms. Learning vector representations of words is an important pre-processing step in many natural language processing applications. For example, EHRs contain clinical notes that describe patient health conditions and course of treatment in a narrative style. The notes contain specialized medical terminology and many abbreviations. Learning good vector representations of specialized medical terms can improve the quality of downstream data analysis tasks on EHR data. However, the traditional approaches struggle to learn vector representations of rarely used medical terms. To overcome this problem, we developed a neural network-based approach, called definition2vec, that uses external knowledge contained in medical vocabularies. We performed quantitative and qualitative analysis to measure the usefulness of the learned representations. The results demonstrate that definition2vec is superior to the state-of-the-art algorithms. In the second part of the thesis, we describe a new visual interface that helps healthcare researchers select patient cohorts from EHR data. Process of identifying patients of interest for observational studies from EHR data is known as cohort selection, a challenging research problem. We considered a problem of cohort selection from medical claim data, which requires identifying a set of medical codes for selection. However, there are tens of thousands of unique medical codes, and it becomes very difficult for any human to decide which codes identify patients of interest. To help users in defining a set of codes for cohort identification, we developed an interactive system, called Medical Claim Visualization system (MedCV), which visualizes medical code representations. MedCV analyzes a medical claim database and allows users to reason about medical code relationships and define inclusion rules for the selection by visualizing medical codes, claims, and patient timelines. Evaluation of our system through a user study indicates that MedCV enables domain experts to define inclusion rules efficiently and with high quality. The third part of the thesis is a study of the definition of acute kidney injury (AKI), which is a condition where kidneys suddenly cannot filter waste from the blood. AKI is a major cause of patient death in intensive care units (ICU) and it is critical to detect it early. Recently published KDIGO medical guideline proposed a clinical definition of AKI using blood serum creatinine and urine output. The KDIGO definition was developed based on the expert knowledge, but very little is known about how well it matches the medical practice. In this study, we investigated publicly available EHR data from 47,499 ICU admissions to determine the concordance between the KDIGO definition and AKI determination by the medical provider. We show that it is possible to find a formula using machine learning with much higher concordance with the medical provider AKI coding than KDIGO and discuss the medical relevance of this finding.
    ADA compliance
    For Americans with Disabilities Act (ADA) accommodation, including help with reading this content, please contact scholarshare@temple.edu
    Collections
    Theses and Dissertations

    entitlement

     
    DSpace software (copyright © 2002 - 2023)  DuraSpace
    Temple University Libraries | 1900 N. 13th Street | Philadelphia, PA 19122
    (215) 204-8212 | scholarshare@temple.edu
    Open Repository is a service operated by 
    Atmire NV
     

    Export search results

    The export option will allow you to export the current search results of the entered query to a file. Different formats are available for download. To export the items, click on the button corresponding with the preferred download format.

    By default, clicking on the export buttons will result in a download of the allowed maximum amount of items.

    To select a subset of the search results, click "Selective Export" button and make a selection of the items you want to export. The amount of items that can be exported at once is similarly restricted as the full export.

    After making a selection, click one of the export format buttons. The amount of items that will be exported is indicated in the bubble next to export format.