語系:
繁體中文
English
說明(常見問題)
回圖書館首頁
手機版館藏查詢
登入
回首頁
切換:
標籤
|
MARC模式
|
ISBD
Joint Multiple Visual Task Understan...
~
Wang, Peng.
FindBook
Google Book
Amazon
博客來
Joint Multiple Visual Task Understanding from a Single Image via Deep Learning and Conditional Random Field.
紀錄類型:
書目-電子資源 : Monograph/item
正題名/作者:
Joint Multiple Visual Task Understanding from a Single Image via Deep Learning and Conditional Random Field./
作者:
Wang, Peng.
出版者:
Ann Arbor : ProQuest Dissertations & Theses, : 2017,
面頁冊數:
99 p.
附註:
Source: Dissertation Abstracts International, Volume: 78-08(E), Section: B.
Contained By:
Dissertation Abstracts International78-08B(E).
標題:
Statistics. -
電子資源:
http://pqdd.sinica.edu.tw/twdaoapp/servlet/advanced?query=10259737
ISBN:
9781369657098
Joint Multiple Visual Task Understanding from a Single Image via Deep Learning and Conditional Random Field.
Wang, Peng.
Joint Multiple Visual Task Understanding from a Single Image via Deep Learning and Conditional Random Field.
- Ann Arbor : ProQuest Dissertations & Theses, 2017 - 99 p.
Source: Dissertation Abstracts International, Volume: 78-08(E), Section: B.
Thesis (Ph.D.)--University of California, Los Angeles, 2017.
Human are interpolating the visual world with very rich understanding. For example, when observing the world through eyes, we not only understand the high level semantic meaning of each region/pixel, more importantly, we also understand the 3D properties like how far away each object is and how the 3D shape of each object is in order to do interaction with the world. In the field of computer vision, however, visual understanding are separated into multiple tasks, e.g. segmentation, 3D reconstruction or object detection etc., due to its high complexity. However, this induces the problem that the results from different strategies are lack of compatibility among different tasks. For example, semantic object detection can not take care of the 3D occlusion regions, while 3D reconstruction does not consider overall semantic context. Thus, in order to have good visual understanding, it is critical to joint understand different tasks while maintaining their compatibility.
ISBN: 9781369657098Subjects--Topical Terms:
517247
Statistics.
Joint Multiple Visual Task Understanding from a Single Image via Deep Learning and Conditional Random Field.
LDR
:03800nmm a2200337 4500
001
2159843
005
20180703084808.5
008
190424s2017 ||||||||||||||||| ||eng d
020
$a
9781369657098
035
$a
(MiAaPQ)AAI10259737
035
$a
(MiAaPQ)ucla:15243
035
$a
AAI10259737
040
$a
MiAaPQ
$c
MiAaPQ
100
1
$a
Wang, Peng.
$3
1271684
245
1 0
$a
Joint Multiple Visual Task Understanding from a Single Image via Deep Learning and Conditional Random Field.
260
1
$a
Ann Arbor :
$b
ProQuest Dissertations & Theses,
$c
2017
300
$a
99 p.
500
$a
Source: Dissertation Abstracts International, Volume: 78-08(E), Section: B.
500
$a
Adviser: Alan Loddon Yuille.
502
$a
Thesis (Ph.D.)--University of California, Los Angeles, 2017.
520
$a
Human are interpolating the visual world with very rich understanding. For example, when observing the world through eyes, we not only understand the high level semantic meaning of each region/pixel, more importantly, we also understand the 3D properties like how far away each object is and how the 3D shape of each object is in order to do interaction with the world. In the field of computer vision, however, visual understanding are separated into multiple tasks, e.g. segmentation, 3D reconstruction or object detection etc., due to its high complexity. However, this induces the problem that the results from different strategies are lack of compatibility among different tasks. For example, semantic object detection can not take care of the 3D occlusion regions, while 3D reconstruction does not consider overall semantic context. Thus, in order to have good visual understanding, it is critical to joint understand different tasks while maintaining their compatibility.
520
$a
Luckily, thanks to the raising technique of deep learning, (a.k.a. convolutional neural network (CNN)), which dramatically beats the other traditional strategies in many visual tasks based on hierarchical learned features with a nearly single framework, we are able to unify different understandings in a more compact and efficient way by designing reasonable output and interaction terms.
520
$a
However, CNN is not a magic key of solving all problems, and one obvious limitation of CNN is that it contains arbitrarily selected convolutional kernel size and layers, yielding non-adaptive receptive fields to match the variance of object scales. In addition, it is not strait-forward to add arbitrary connections inside each layer based on intuition. Thus, we further embed the conditional random field (CRF) into the system in order to compensate the deficiency in order to unify different cues and perform multiple tasks simultaneously.
520
$a
In this thesis, we prove the concept through estimating multiple tasks jointly including joint part and object segmentation, joint segmentation and geometry estimation etc. We first show that we can fit deep convolutional network into many different tasks to acquire superior performance compare to traditional shallow features. Secondly, by unifying different tasks with our designed compatibility constrains, we make different tasks mutually regularized and beneficial. Finally, to evaluate the results, we perform our experiments over the standard evaluating benchmarks like PASCAL for segmentation and the NYU v2 dataset for depth estimation. Last but not the least, we not only apply the existing metrics to show the performance gain from our design, but also introduce reasonable new metrics in order to better show the aspect that improved.
590
$a
School code: 0031.
650
4
$a
Statistics.
$3
517247
650
4
$a
Computer science.
$3
523869
690
$a
0463
690
$a
0984
710
2
$a
University of California, Los Angeles.
$b
Statistics 0891.
$3
2095317
773
0
$t
Dissertation Abstracts International
$g
78-08B(E).
790
$a
0031
791
$a
Ph.D.
792
$a
2017
793
$a
English
856
4 0
$u
http://pqdd.sinica.edu.tw/twdaoapp/servlet/advanced?query=10259737
筆 0 讀者評論
館藏地:
全部
電子資源
出版年:
卷號:
館藏
1 筆 • 頁數 1 •
1
條碼號
典藏地名稱
館藏流通類別
資料類型
索書號
使用類型
借閱狀態
預約狀態
備註欄
附件
W9359390
電子資源
11.線上閱覽_V
電子書
EB
一般使用(Normal)
在架
0
1 筆 • 頁數 1 •
1
多媒體
評論
新增評論
分享你的心得
Export
取書館
處理中
...
變更密碼
登入