• Login
    View Item 
    •   Home
    • Theses and Dissertations
    • Theses and Dissertations
    • View Item
    •   Home
    • Theses and Dissertations
    • Theses and Dissertations
    • View Item
    JavaScript is disabled for your browser. Some features of this site may not work without it.

    Browse

    All of TUScholarShareCommunitiesDateAuthorsTitlesSubjectsGenresThis CollectionDateAuthorsTitlesSubjectsGenres

    My Account

    LoginRegister

    Help

    AboutPeoplePoliciesHelp for DepositorsData DepositFAQs

    Statistics

    Most Popular ItemsStatistics by CountryMost Popular Authors

    DEEP LEARNING BASED OBJECT PERCEPTION ALGORITHM AND APPLICATION

    • CSV
    • RefMan
    • EndNote
    • BibTex
    • RefWorks
    Thumbnail
    Name:
    yang_temple_0225E_14284.pdf
    Size:
    59.55Mb
    Format:
    PDF
    Download
    Genre
    Thesis/Dissertation
    Date
    2020
    Author
    Yang, Fan
    Advisor
    Ling, Haibin
    Committee member
    Shi, Justin Y.
    Ji, Bo
    Wu, Ziyan
    Ling, Haibin
    Department
    Computer and Information Science
    Subject
    Computer science
    Permanent link to this record
    http://hdl.handle.net/20.500.12613/4716
    
    Metadata
    Show full item record
    DOI
    http://dx.doi.org/10.34944/dspace/4698
    Abstract
    Object perception as a fundamental task in computer vision has a broad of applicationin real word, such as self-driving, industrial defect inspection, intelligent agriculture. Numerous works have been studied to advance the progress of object perception. In particular, due to the powerful feature learning and representation ability of deep learning, object perception algorithm has achieved signicant progress. In the dissertation, I rst introduce the denition of object perception and its three subtasks: object detection, pose estimation, and object segmentation; then specically present our works in the three subtasks, respectively. Object detection: we conduct two works, clustered object detection in aerial image (ClusDet) and dually supervised feature pyramid for object detection and segmentation (DSFPN). The ClusDet is designed to leverage the prior that objects in aerial ( especially in trac scenario) tend to cluster in dierent scales for object detection. By comparing with evenly crop method, ClusDet can achieve superior precision with less computation load. The DSFPN is proposed to alleviate the gradient degradation or vanishing problem in feature pyramid network (FPN) for object detection and segmentation. In particular, we note that performance of the two-stage detectors do not constantly increase with the growing complexity of backbone network, which is consistent with the conclusion in \deep residual learning for image recognition". To mitigate the problem, we propose to add extra supervision signal on bottom-up path of FPN in training phase to enhance the gradient information so as to facilitate the model training. Pose estimation: a robust dynamic fusion (RDF) algorithm is proposed to deal with noisy modalities in patient body modeling. In particular, for patient body modeling, the RGB camera cannot provide sucient information because of the body covered with blanket or loosen cloth. In this case, multi-modality (e.g., RGB, thermal, depth) sensors are required to acquire complimentary information. However, dierent application may need dierent sensors. It is labor-intensive and time-consuming to train a model per an application. In addition, multi-modality images may come to various noise in deployment, so that the trained model fails to work precisely. To deal with the aforementioned issues, we propose the RDF in conjunction with a dynamic training strategy to adaptively depress the features from noisy modalities, such that the model can be trained once and deployed any of the modalities. Object segmentation: the object here refers to crack, we propose a feature pyramid and hierarchical boosting network (FPHBN) for pavement crack detection. Specically, the crack in pavement has various scales (width), based on this characteristic, we introduce a feature pyramid architecture to utilize the inherent hierarchy of deep convolution networks (DConvNets) to construct multi-scale features for multi-scale cracks. Beside, each layer of the DConvNets is not independent, to leverage this dependency, we design a hierarchical boosting module to reweight samples via the prediction from adjunct layer. With the benet of the boosting module, the proposed network can dynamically pay more attention to hard samples.
    ADA compliance
    For Americans with Disabilities Act (ADA) accommodation, including help with reading this content, please contact scholarshare@temple.edu
    Collections
    Theses and Dissertations

    entitlement

     
    DSpace software (copyright © 2002 - 2023)  DuraSpace
    Temple University Libraries | 1900 N. 13th Street | Philadelphia, PA 19122
    (215) 204-8212 | scholarshare@temple.edu
    Open Repository is a service operated by 
    Atmire NV
     

    Export search results

    The export option will allow you to export the current search results of the entered query to a file. Different formats are available for download. To export the items, click on the button corresponding with the preferred download format.

    By default, clicking on the export buttons will result in a download of the allowed maximum amount of items.

    To select a subset of the search results, click "Selective Export" button and make a selection of the items you want to export. The amount of items that can be exported at once is similarly restricted as the full export.

    After making a selection, click one of the export format buttons. The amount of items that will be exported is indicated in the bubble next to export format.