語系:
繁體中文
English
說明(常見問題)
回圖書館首頁
手機版館藏查詢
登入
回首頁
切換:
標籤
|
MARC模式
|
ISBD
Text mining biomedical literature fo...
~
The University of Texas at Dallas.
FindBook
Google Book
Amazon
博客來
Text mining biomedical literature for improving MEDLINE retrieval.
紀錄類型:
書目-語言資料,印刷品 : Monograph/item
正題名/作者:
Text mining biomedical literature for improving MEDLINE retrieval./
作者:
Lin, Yongjing.
面頁冊數:
179 p.
附註:
Adviser: Ying Liu.
Contained By:
Dissertation Abstracts International69-03B.
標題:
Computer Science. -
電子資源:
http://pqdd.sinica.edu.tw/twdaoeng/servlet/advanced?query=3305851
ISBN:
9780549527039
Text mining biomedical literature for improving MEDLINE retrieval.
Lin, Yongjing.
Text mining biomedical literature for improving MEDLINE retrieval.
- 179 p.
Adviser: Ying Liu.
Thesis (Ph.D.)--The University of Texas at Dallas, 2008.
A major problem faced in biomedical informatics involves how best to present information retrieval results. This dissertation developed an approach that present users with reduced sets of relevant citations together with topic label. A text mining system is designed to group the retrieved citations, rank the citations in each cluster, and generate a set of keywords and MeSH terms to describe the common theme of each cluster.
ISBN: 9780549527039Subjects--Topical Terms:
626642
Computer Science.
Text mining biomedical literature for improving MEDLINE retrieval.
LDR
:03297nam 2200313 a 45
001
856938
005
20100709
008
100709s2008 ||||||||||||||||| ||eng d
020
$a
9780549527039
035
$a
(UMI)AAI3305851
035
$a
AAI3305851
040
$a
UMI
$c
UMI
100
1
$a
Lin, Yongjing.
$3
1023827
245
1 0
$a
Text mining biomedical literature for improving MEDLINE retrieval.
300
$a
179 p.
500
$a
Adviser: Ying Liu.
500
$a
Source: Dissertation Abstracts International, Volume: 69-03, Section: B, page: 1742.
502
$a
Thesis (Ph.D.)--The University of Texas at Dallas, 2008.
520
$a
A major problem faced in biomedical informatics involves how best to present information retrieval results. This dissertation developed an approach that present users with reduced sets of relevant citations together with topic label. A text mining system is designed to group the retrieved citations, rank the citations in each cluster, and generate a set of keywords and MeSH terms to describe the common theme of each cluster.
520
$a
A series of follow-up researches were conducted for better performance of the system. A spectral analysis clustering method was proposed based on the content similarity network techniques for information retrieval systems. The new approach organizes all these search results into categories intelligently. Our experimental results demonstrated that the presented method performs well in real world applications.
520
$a
Automated concept recognition for each cluster is one of the important tasks in our text mining system. The system can perform keyword, key MeSH term and key noun-phrase extraction. Within each cluster, the extraction of keyword and key MeSH term is based on modeling the document-term-matrix as a weighted bipartite graph. A mutual reinforcement principle is used to rank the terms. Our new key noun-phrase extraction method is based on the context-free grammatical rules extracted from the input documents. An existing algorithm called Sequitur is used for constructing the context-free grammar rules that re-represent a sequence as a hierarchical structure. Noun-phrases are extracted from the grammatical rules. The key noun-phrases were identified from top frequency rules without extracting all the grammatical rules. The experimental results showed that our key noun-phrase extraction method is effective in identifying key concepts from documents, and outperforms current widely-used methods.
520
$a
We also explored to rank MEDLINE citations using an existing web ranking algorithm, HITS (Hyperlink-Induced Topic Search) algorithm. We further extended HITS to supervised HITS to rank citations. Our results showed that supervised HITS algorithm significantly outperforms HITS algorithm (p<0.01). Compared with HITS, supervised HITS can improve citation ranking from 15% to more than 59% in almost all the cases we tested. Furthermore, MeSH terms outperforms text words in ranking citations, especially when HITS was applied (p<0.01).
590
$a
School code: 0382.
650
4
$a
Computer Science.
$3
626642
650
4
$a
Engineering, Biomedical.
$3
1017684
690
$a
0541
690
$a
0984
710
2
$a
The University of Texas at Dallas.
$3
1018411
773
0
$t
Dissertation Abstracts International
$g
69-03B.
790
$a
0382
790
1 0
$a
Liu, Ying,
$e
advisor
791
$a
Ph.D.
792
$a
2008
856
4 0
$u
http://pqdd.sinica.edu.tw/twdaoeng/servlet/advanced?query=3305851
筆 0 讀者評論
館藏地:
全部
電子資源
出版年:
卷號:
館藏
1 筆 • 頁數 1 •
1
條碼號
典藏地名稱
館藏流通類別
資料類型
索書號
使用類型
借閱狀態
預約狀態
備註欄
附件
W9072100
電子資源
11.線上閱覽_V
電子書
EB W9072100
一般使用(Normal)
在架
0
1 筆 • 頁數 1 •
1
多媒體
評論
新增評論
分享你的心得
Export
取書館
處理中
...
變更密碼
登入