語系:
繁體中文
English
說明(常見問題)
回圖書館首頁
手機版館藏查詢
登入
回首頁
切換:
標籤
|
MARC模式
|
ISBD
False Textual Information Detection ...
~
Yang, Fan.
FindBook
Google Book
Amazon
博客來
False Textual Information Detection - Towards Building a Truth Machine.
紀錄類型:
書目-電子資源 : Monograph/item
正題名/作者:
False Textual Information Detection - Towards Building a Truth Machine./
作者:
Yang, Fan.
出版者:
Ann Arbor : ProQuest Dissertations & Theses, : 2020,
面頁冊數:
130 p.
附註:
Source: Dissertations Abstracts International, Volume: 82-07, Section: B.
Contained By:
Dissertations Abstracts International82-07B.
標題:
Computer science. -
電子資源:
https://pqdd.sinica.edu.tw/twdaoapp/servlet/advanced?query=28182744
ISBN:
9798678155900
False Textual Information Detection - Towards Building a Truth Machine.
Yang, Fan.
False Textual Information Detection - Towards Building a Truth Machine.
- Ann Arbor : ProQuest Dissertations & Theses, 2020 - 130 p.
Source: Dissertations Abstracts International, Volume: 82-07, Section: B.
Thesis (Ph.D.)--University of Houston, 2020.
This item must not be sold to any third party vendors.
With social media growing dominant, false information, such as questionable claims and fake news, diffuses fast. Detecting false information is one of the most elusive and long-standing challenges. With social media growing dominant, falsehood can diffuse faster and broader than truth. This calls for building a ``truth machine" that automatically debunks false information. Although existing works have developed methods to prevent false information, challenges still remain. For example, previous works demand a large amount of annotated data and related evidence, underestimating the difficulty of evidence linking and the cost of manual annotation. Besides, since a large number of works rely on evidence to determine the credibility of claims, we need to carefully address situations when no evidence or noisy evidence is provided. This thesis aims to improve detecting false textual information from four aspects: 1. we first target sentiment classification because previous works show that leveraging sentiment can boost content-based rumor detection. We propose a representation learning framework that incorporates both labeled and unlabeled data. We show that our model learns robust features across domains and removes domain-specific features. 2. we develop a hierarchical model with attention mechanism so that our model reveals important insights at the paragraph level or at the sentence level. We evaluate our model on news satire detection and find that our model can effectively discover satirical cues at different levels. 3. we extend evidence-aware claim verification from supervised learning to positive-unlabeled learning. This setting requires a comparatively small number of true claims, and more claims can be unlabeled. We adopt the generative adversarial network to generate pseudo negative examples and conduct a thorough analysis of selected models. 4. we pay special attention to analyzing whether estimating entailment between evidence and claim helps not only to verify it but also to the preliminary step of retrieving the necessary evidence. We find that entailment indeed improves evidence ranking, as far as the entailment model produces reliable outputs.
ISBN: 9798678155900Subjects--Topical Terms:
523869
Computer science.
Subjects--Index Terms:
False information detection
False Textual Information Detection - Towards Building a Truth Machine.
LDR
:03541nmm a2200433 4500
001
2284713
005
20211124102945.5
008
220723s2020 ||||||||||||||||| ||eng d
020
$a
9798678155900
035
$a
(MiAaPQ)AAI28182744
035
$a
(MiAaPQ)0087vireo5170Yang
035
$a
AAI28182744
040
$a
MiAaPQ
$c
MiAaPQ
100
1
$a
Yang, Fan.
$3
1020735
245
1 0
$a
False Textual Information Detection - Towards Building a Truth Machine.
260
1
$a
Ann Arbor :
$b
ProQuest Dissertations & Theses,
$c
2020
300
$a
130 p.
500
$a
Source: Dissertations Abstracts International, Volume: 82-07, Section: B.
500
$a
Advisor: Mukherjee, Arjun.
502
$a
Thesis (Ph.D.)--University of Houston, 2020.
506
$a
This item must not be sold to any third party vendors.
520
$a
With social media growing dominant, false information, such as questionable claims and fake news, diffuses fast. Detecting false information is one of the most elusive and long-standing challenges. With social media growing dominant, falsehood can diffuse faster and broader than truth. This calls for building a ``truth machine" that automatically debunks false information. Although existing works have developed methods to prevent false information, challenges still remain. For example, previous works demand a large amount of annotated data and related evidence, underestimating the difficulty of evidence linking and the cost of manual annotation. Besides, since a large number of works rely on evidence to determine the credibility of claims, we need to carefully address situations when no evidence or noisy evidence is provided. This thesis aims to improve detecting false textual information from four aspects: 1. we first target sentiment classification because previous works show that leveraging sentiment can boost content-based rumor detection. We propose a representation learning framework that incorporates both labeled and unlabeled data. We show that our model learns robust features across domains and removes domain-specific features. 2. we develop a hierarchical model with attention mechanism so that our model reveals important insights at the paragraph level or at the sentence level. We evaluate our model on news satire detection and find that our model can effectively discover satirical cues at different levels. 3. we extend evidence-aware claim verification from supervised learning to positive-unlabeled learning. This setting requires a comparatively small number of true claims, and more claims can be unlabeled. We adopt the generative adversarial network to generate pseudo negative examples and conduct a thorough analysis of selected models. 4. we pay special attention to analyzing whether estimating entailment between evidence and claim helps not only to verify it but also to the preliminary step of retrieving the necessary evidence. We find that entailment indeed improves evidence ranking, as far as the entailment model produces reliable outputs.
590
$a
School code: 0087.
650
4
$a
Computer science.
$3
523869
650
4
$a
Web studies.
$3
2122754
650
4
$a
Information science.
$3
554358
653
$a
False information detection
653
$a
Social media
653
$a
Manual annotation
653
$a
Credibility of claims
653
$a
Unlabeled data
653
$a
Supervised learning
653
$a
Adversarial network
653
$a
Pseudo negative examples
690
$a
0984
690
$a
0454
690
$a
0723
690
$a
0646
710
2
$a
University of Houston.
$b
Computer Science.
$3
3555880
773
0
$t
Dissertations Abstracts International
$g
82-07B.
790
$a
0087
791
$a
Ph.D.
792
$a
2020
793
$a
English
856
4 0
$u
https://pqdd.sinica.edu.tw/twdaoapp/servlet/advanced?query=28182744
筆 0 讀者評論
館藏地:
全部
電子資源
出版年:
卷號:
館藏
1 筆 • 頁數 1 •
1
條碼號
典藏地名稱
館藏流通類別
資料類型
索書號
使用類型
借閱狀態
預約狀態
備註欄
附件
W9436446
電子資源
11.線上閱覽_V
電子書
EB
一般使用(Normal)
在架
0
1 筆 • 頁數 1 •
1
多媒體
評論
新增評論
分享你的心得
Export
取書館
處理中
...
變更密碼
登入