語系:
繁體中文
English
說明(常見問題)
回圖書館首頁
手機版館藏查詢
登入
回首頁
切換:
標籤
|
MARC模式
|
ISBD
Using structural information in mach...
~
Princeton University.
FindBook
Google Book
Amazon
博客來
Using structural information in machine learning applications .
紀錄類型:
書目-電子資源 : Monograph/item
正題名/作者:
Using structural information in machine learning applications ./
作者:
Barutcuoglu, Zafer.
面頁冊數:
97 p.
附註:
Adviser: Robert E. Schapire.
Contained By:
Dissertation Abstracts International69-08B.
標題:
Artificial Intelligence. -
電子資源:
http://pqdd.sinica.edu.tw/twdaoapp/servlet/advanced?query=3324284
ISBN:
9780549764670
Using structural information in machine learning applications .
Barutcuoglu, Zafer.
Using structural information in machine learning applications .
- 97 p.
Adviser: Robert E. Schapire.
Thesis (Ph.D.)--Princeton University, 2008.
Classification problems encountered in real-life applications often have domain-specific structural information available on the measured data, which cannot be readily accommodated by conventional machine learning algorithms. Ignoring the structure and blindly running a conventional algorithm on the numerical data can compromise the quality of solutions.
ISBN: 9780549764670Subjects--Topical Terms:
769149
Artificial Intelligence.
Using structural information in machine learning applications .
LDR
:04249nmm 2200325 a 45
001
867645
005
20100804
008
100804s2008 ||||||||||||||||| ||eng d
020
$a
9780549764670
035
$a
(UMI)AAI3324284
035
$a
AAI3324284
040
$a
UMI
$c
UMI
100
1
$a
Barutcuoglu, Zafer.
$3
1036391
245
1 0
$a
Using structural information in machine learning applications .
300
$a
97 p.
500
$a
Adviser: Robert E. Schapire.
500
$a
Source: Dissertation Abstracts International, Volume: 69-08, Section: B, page: 4832.
502
$a
Thesis (Ph.D.)--Princeton University, 2008.
520
$a
Classification problems encountered in real-life applications often have domain-specific structural information available on the measured data, which cannot be readily accommodated by conventional machine learning algorithms. Ignoring the structure and blindly running a conventional algorithm on the numerical data can compromise the quality of solutions.
520
$a
This thesis provides answers to two such complementary settings: one where there is a hierarchy among multiple class labels (output structure), and one where the input features are known to be sequentially correlated (input structure). Probabilistic graphical models are used to encode the dependencies, and model parameters are estimated using efficient inference algorithms. While both scenarios are motivated by real bioinformatics problems, namely gene function prediction and aneuploidy-based cancer classification, they have applications in other domains as well, such as computer graphics, music, and text classification.
520
$a
The first part focuses on structure among a group of output classes. Large numbers of overlapping classes are found to be organized in hierarchies in many domains. In multi-label classification over such a hierarchy, members of a class must also belong to all of its parents. Training an independent classifier for each class is a common approach, but this may yield labels for a given example that collectively violate this constraint. We propose a principled method of resolving such inconsistencies to increase accuracy over all classes. Our approach is to view the hierarchy as a graphical model, and then to employ Bayesian inference to infer the most likely set of hierarchically consistent class labels from independent base classifier predictions. This method is applicable over any type of base classification algorithm. Experiments on synthetic data, as well as real data sets from bioinformatics and computer graphics domains, illustrate its behavior under a range of conditions, and demonstrate that it is able to improve accuracy at all levels of a hierarchy.
520
$a
The second part focuses on structure among input features, in the form of a sequential relationship. Generic non-sequential machine learning models assume no importance in the order of inputs. Conversely, sequence models (e.g. Hidden Markov Models) need to assume stationarity to keep the number of parameters manageable, modeling only sequence-wide stability and losing the significance of particular positions. We propose a fixed-length sequence classification method that combines sequential correlations with positional features in a sparsely regularized solution, with training and inference algorithms in linear-time of sequence length. Motivated by the problem of tumor classification by genetic copy number changes, our method can identify copy number alteration regions in noisy array-CGH data, and locate the genes of clinical relevance driving these alterations and affecting the cancer label. Experiments on synthetic array-CGH data modeled from real human breast tumors, as well as real tumor datasets from breast cancer, bladder cancer, and uveal melanoma, demonstrate that the our method matches or exceeds state-of-the-art methods in accuracy, and is able to produce biologically significant predictions for clinically relevant genes.
590
$a
School code: 0181.
650
4
$a
Artificial Intelligence.
$3
769149
650
4
$a
Biology, Bioinformatics.
$3
1018415
650
4
$a
Computer Science.
$3
626642
690
$a
0715
690
$a
0800
690
$a
0984
710
2
$a
Princeton University.
$3
645579
773
0
$t
Dissertation Abstracts International
$g
69-08B.
790
$a
0181
790
1 0
$a
Schapire, Robert E.,
$e
advisor
791
$a
Ph.D.
792
$a
2008
856
4 0
$u
http://pqdd.sinica.edu.tw/twdaoapp/servlet/advanced?query=3324284
筆 0 讀者評論
館藏地:
全部
電子資源
出版年:
卷號:
館藏
1 筆 • 頁數 1 •
1
條碼號
典藏地名稱
館藏流通類別
資料類型
索書號
使用類型
借閱狀態
預約狀態
備註欄
附件
W9078623
電子資源
11.線上閱覽_V
電子書
EB W9078623
一般使用(Normal)
在架
0
1 筆 • 頁數 1 •
1
多媒體
評論
新增評論
分享你的心得
Export
取書館
處理中
...
變更密碼
登入