Show simple item record

dc.contributor.advisorLing, Haibin
dc.creatorChu, Peng
dc.date.accessioned2020-11-03T16:23:37Z
dc.date.available2020-11-03T16:23:37Z
dc.date.issued2020
dc.identifier.urihttp://hdl.handle.net/20.500.12613/2698
dc.description.abstractTracking multiple objects in video is critical for many applications, ranging from vision-based surveillance to autonomous driving. The popular solution to Multiple Object Tracking (MOT) is the tracking-by-detection strategy, in which, detections of each frame from an external detector are associated and connected to form target trajectories in either online or offline batch mode. Following this strategy, the challenges of robust tracking comes mainly from three aspects: discrimination of the appearance similar targets; handling of the noise from input detections; unifying the separated function modules for generalizability. Recently, deep neural network (DNN) has demonstrate its ability to automatically learn discriminative features from training samples thus achieves success in various computer vision tasks. My research works are to leverage this powerful learning ability of DNN to tackle the above challenges for robust MOT in real world application. In this dissertation, I first introduce the popular framework of MOT system, the datasets, the evaluation metric and challenges in MOT. Then I discuss a work that encodes the structure prior of curvilinear structures in the rank-1 tensor approximation tracking framework to reduce the ambiguity rising from indistinguishable curvilinear structures parts. This work uses convolutional neural network to generate more reliable candidates for tracking and consequently improves the tracking robustness. In the third chapter, I present a work that adapts the DNN based Single Object Tracking (SOT) techniques for missing detection recovery. SOT tracker in this work merges the originally separated feature extraction and similarity evaluation as an integrated affinity estimator. Learning of the integrated affinity estimator requires dedicated affinity samples to be manually fabricated from ground truth association, which usually does not guarantee the consistent data distribution between training and inference phases. In Chapter 4, FAMNet is proposed to integrate feature extraction, affinity estimation and multi-dimensional assignment into a unified DNN to realize end-to-end learning, which demonstrates its capability in different target categories and tracking scenarios in our comprehensive experiments. On the other hand, training of DNN usually requires large amount of labeled data which is not always available in the tracking tasks. To tackle this problem, in Chapter 5, I present a work using transfer learning and multi-task scheme to facilitate the feature learning in the context of limited training data. Finally, we summarize with the discussion of future works including DNN also integrating detector for MOT and other possible MOT frameworks such as model-free MOT tracker.
dc.format.extent119 pages
dc.language.isoeng
dc.publisherTemple University. Libraries
dc.relation.ispartofTheses and Dissertations
dc.rightsIN COPYRIGHT- This Rights Statement can be used for an Item that is in copyright. Using this statement implies that the organization making this Item available has determined that the Item is in copyright and either is the rights-holder, has obtained permission from the rights-holder(s) to make their Work(s) available, or makes the Item available under an exception or limitation to copyright (including Fair Use) that entitles it to make the Item available.
dc.rights.urihttp://rightsstatements.org/vocab/InC/1.0/
dc.subjectComputer Science
dc.titleDeep Neural Network for Robust Multiple Object Tracking
dc.typeText
dc.type.genreThesis/Dissertation
dc.contributor.committeememberSouvenir, Richard M.
dc.contributor.committeememberYang, Jie
dc.contributor.committeememberZhang, Kai
dc.description.departmentComputer and Information Science
dc.relation.doihttp://dx.doi.org/10.34944/dspace/2680
dc.ada.noteFor Americans with Disabilities Act (ADA) accommodation, including help with reading this content, please contact scholarshare@temple.edu
dc.description.degreePh.D.
refterms.dateFOA2020-11-03T16:23:37Z


Files in this item

Thumbnail
Name:
TETDEDXChu-temple-0225E-13991.pdf
Size:
14.28Mb
Format:
PDF

This item appears in the following Collection(s)

Show simple item record