東華大學圖書館 |

Language: English

Help

回圖書館首頁

手機版館藏查詢

Back

Switch To: Labeled | MARC Mode | ISBD

Joint Multiple Visual Task Understan...

Wang, Peng.

Linked to FindBook

Google Book

Amazon

博客來

Joint Multiple Visual Task Understanding from a Single Image via Deep Learning and Conditional Random Field.

Record Type:	Electronic resources : Monograph/item
Title/Author:	Joint Multiple Visual Task Understanding from a Single Image via Deep Learning and Conditional Random Field./
Author:	Wang, Peng.
Published:	Ann Arbor : ProQuest Dissertations & Theses, : 2017,
Description:	99 p.
Notes:	Source: Dissertation Abstracts International, Volume: 78-08(E), Section: B.
Contained By:	Dissertation Abstracts International78-08B(E).
Subject:	Statistics. -
Online resource:	http://pqdd.sinica.edu.tw/twdaoapp/servlet/advanced?query=10259737
ISBN:	9781369657098

Joint Multiple Visual Task Understanding from a Single Image via Deep Learning and Conditional Random Field.
Wang, Peng.

Joint Multiple Visual Task Understanding from a Single Image via Deep Learning and Conditional Random Field. - Ann Arbor : ProQuest Dissertations & Theses, 2017 - 99 p.

Source: Dissertation Abstracts International, Volume: 78-08(E), Section: B.

Thesis (Ph.D.)--University of California, Los Angeles, 2017.

Human are interpolating the visual world with very rich understanding. For example, when observing the world through eyes, we not only understand the high level semantic meaning of each region/pixel, more importantly, we also understand the 3D properties like how far away each object is and how the 3D shape of each object is in order to do interaction with the world. In the field of computer vision, however, visual understanding are separated into multiple tasks, e.g. segmentation, 3D reconstruction or object detection etc., due to its high complexity. However, this induces the problem that the results from different strategies are lack of compatibility among different tasks. For example, semantic object detection can not take care of the 3D occlusion regions, while 3D reconstruction does not consider overall semantic context. Thus, in order to have good visual understanding, it is critical to joint understand different tasks while maintaining their compatibility.

ISBN: 9781369657098Subjects--Topical Terms:

517247
Statistics.

Joint Multiple Visual Task Understanding from a Single Image via Deep Learning and Conditional Random Field.
LDR:03800nmm a2200337 4500 001 2159843
005 20180703084808.5
008 190424s2017 ||||||||||||||||| ||eng d
020 $a 9781369657098
035 $a (MiAaPQ)AAI10259737
035 $a (MiAaPQ)ucla:15243
035 $a AAI10259737
040 $a MiAaPQ $c MiAaPQ
100 1 $a Wang, Peng. $3 1271684
245 1 0 $a Joint Multiple Visual Task Understanding from a Single Image via Deep Learning and Conditional Random Field.
260 1 $a Ann Arbor : $b ProQuest Dissertations & Theses, $c 2017
300 $a 99 p.
500 $a Source: Dissertation Abstracts International, Volume: 78-08(E), Section: B.
500 $a Adviser: Alan Loddon Yuille.
502 $a Thesis (Ph.D.)--University of California, Los Angeles, 2017.
520 $a Human are interpolating the visual world with very rich understanding. For example, when observing the world through eyes, we not only understand the high level semantic meaning of each region/pixel, more importantly, we also understand the 3D properties like how far away each object is and how the 3D shape of each object is in order to do interaction with the world. In the field of computer vision, however, visual understanding are separated into multiple tasks, e.g. segmentation, 3D reconstruction or object detection etc., due to its high complexity. However, this induces the problem that the results from different strategies are lack of compatibility among different tasks. For example, semantic object detection can not take care of the 3D occlusion regions, while 3D reconstruction does not consider overall semantic context. Thus, in order to have good visual understanding, it is critical to joint understand different tasks while maintaining their compatibility.
520 $a Luckily, thanks to the raising technique of deep learning, (a.k.a. convolutional neural network (CNN)), which dramatically beats the other traditional strategies in many visual tasks based on hierarchical learned features with a nearly single framework, we are able to unify different understandings in a more compact and efficient way by designing reasonable output and interaction terms.
520 $a However, CNN is not a magic key of solving all problems, and one obvious limitation of CNN is that it contains arbitrarily selected convolutional kernel size and layers, yielding non-adaptive receptive fields to match the variance of object scales. In addition, it is not strait-forward to add arbitrary connections inside each layer based on intuition. Thus, we further embed the conditional random field (CRF) into the system in order to compensate the deficiency in order to unify different cues and perform multiple tasks simultaneously.
520 $a In this thesis, we prove the concept through estimating multiple tasks jointly including joint part and object segmentation, joint segmentation and geometry estimation etc. We first show that we can fit deep convolutional network into many different tasks to acquire superior performance compare to traditional shallow features. Secondly, by unifying different tasks with our designed compatibility constrains, we make different tasks mutually regularized and beneficial. Finally, to evaluate the results, we perform our experiments over the standard evaluating benchmarks like PASCAL for segmentation and the NYU v2 dataset for depth estimation. Last but not the least, we not only apply the existing metrics to show the performance gain from our design, but also introduce reasonable new metrics in order to better show the aspect that improved.
590 $a School code: 0031.
650 4 $a Statistics. $3 517247
650 4 $a Computer science. $3 523869
690 $a 0463
690 $a 0984
710 2 $a University of California, Los Angeles. $b Statistics 0891. $3 2095317
773 0 $t Dissertation Abstracts International $g 78-08B(E).
790 $a 0031
791 $a Ph.D.
792 $a 2017
793 $a English
856 4 0 $u http://pqdd.sinica.edu.tw/twdaoapp/servlet/advanced?query=10259737