東華大學圖書館 |

Language: English

Help

回圖書館首頁

手機版館藏查詢

Back

Switch To: Labeled | MARC Mode | ISBD

False Textual Information Detection ...

Yang, Fan.

Linked to FindBook

Google Book

Amazon

博客來

False Textual Information Detection - Towards Building a Truth Machine.

Record Type:	Electronic resources : Monograph/item
Title/Author:	False Textual Information Detection - Towards Building a Truth Machine./
Author:	Yang, Fan.
Published:	Ann Arbor : ProQuest Dissertations & Theses, : 2020,
Description:	130 p.
Notes:	Source: Dissertations Abstracts International, Volume: 82-07, Section: B.
Contained By:	Dissertations Abstracts International82-07B.
Subject:	Computer science. -
Online resource:	https://pqdd.sinica.edu.tw/twdaoapp/servlet/advanced?query=28182744
ISBN:	9798678155900

False Textual Information Detection - Towards Building a Truth Machine.
Yang, Fan.

False Textual Information Detection - Towards Building a Truth Machine. - Ann Arbor : ProQuest Dissertations & Theses, 2020 - 130 p.

Source: Dissertations Abstracts International, Volume: 82-07, Section: B.

Thesis (Ph.D.)--University of Houston, 2020.

This item must not be sold to any third party vendors.

With social media growing dominant, false information, such as questionable claims and fake news, diffuses fast. Detecting false information is one of the most elusive and long-standing challenges. With social media growing dominant, falsehood can diffuse faster and broader than truth. This calls for building a ``truth machine" that automatically debunks false information. Although existing works have developed methods to prevent false information, challenges still remain. For example, previous works demand a large amount of annotated data and related evidence, underestimating the difficulty of evidence linking and the cost of manual annotation. Besides, since a large number of works rely on evidence to determine the credibility of claims, we need to carefully address situations when no evidence or noisy evidence is provided. This thesis aims to improve detecting false textual information from four aspects: 1. we first target sentiment classification because previous works show that leveraging sentiment can boost content-based rumor detection. We propose a representation learning framework that incorporates both labeled and unlabeled data. We show that our model learns robust features across domains and removes domain-specific features. 2. we develop a hierarchical model with attention mechanism so that our model reveals important insights at the paragraph level or at the sentence level. We evaluate our model on news satire detection and find that our model can effectively discover satirical cues at different levels. 3. we extend evidence-aware claim verification from supervised learning to positive-unlabeled learning. This setting requires a comparatively small number of true claims, and more claims can be unlabeled. We adopt the generative adversarial network to generate pseudo negative examples and conduct a thorough analysis of selected models. 4. we pay special attention to analyzing whether estimating entailment between evidence and claim helps not only to verify it but also to the preliminary step of retrieving the necessary evidence. We find that entailment indeed improves evidence ranking, as far as the entailment model produces reliable outputs.

ISBN: 9798678155900Subjects--Topical Terms:

523869
Computer science.
Subjects--Index Terms:

False information detection

False Textual Information Detection - Towards Building a Truth Machine.
LDR:03541nmm a2200433 4500 001 2284713
005 20211124102945.5
008 220723s2020 ||||||||||||||||| ||eng d
020 $a 9798678155900
035 $a (MiAaPQ)AAI28182744
035 $a (MiAaPQ)0087vireo5170Yang
035 $a AAI28182744
040 $a MiAaPQ $c MiAaPQ
100 1 $a Yang, Fan. $3 1020735
245 1 0 $a False Textual Information Detection - Towards Building a Truth Machine.
260 1 $a Ann Arbor : $b ProQuest Dissertations & Theses, $c 2020
300 $a 130 p.
500 $a Source: Dissertations Abstracts International, Volume: 82-07, Section: B.
500 $a Advisor: Mukherjee, Arjun.
502 $a Thesis (Ph.D.)--University of Houston, 2020.
506 $a This item must not be sold to any third party vendors.
520 $a With social media growing dominant, false information, such as questionable claims and fake news, diffuses fast. Detecting false information is one of the most elusive and long-standing challenges. With social media growing dominant, falsehood can diffuse faster and broader than truth. This calls for building a ``truth machine" that automatically debunks false information. Although existing works have developed methods to prevent false information, challenges still remain. For example, previous works demand a large amount of annotated data and related evidence, underestimating the difficulty of evidence linking and the cost of manual annotation. Besides, since a large number of works rely on evidence to determine the credibility of claims, we need to carefully address situations when no evidence or noisy evidence is provided. This thesis aims to improve detecting false textual information from four aspects: 1. we first target sentiment classification because previous works show that leveraging sentiment can boost content-based rumor detection. We propose a representation learning framework that incorporates both labeled and unlabeled data. We show that our model learns robust features across domains and removes domain-specific features. 2. we develop a hierarchical model with attention mechanism so that our model reveals important insights at the paragraph level or at the sentence level. We evaluate our model on news satire detection and find that our model can effectively discover satirical cues at different levels. 3. we extend evidence-aware claim verification from supervised learning to positive-unlabeled learning. This setting requires a comparatively small number of true claims, and more claims can be unlabeled. We adopt the generative adversarial network to generate pseudo negative examples and conduct a thorough analysis of selected models. 4. we pay special attention to analyzing whether estimating entailment between evidence and claim helps not only to verify it but also to the preliminary step of retrieving the necessary evidence. We find that entailment indeed improves evidence ranking, as far as the entailment model produces reliable outputs.
590 $a School code: 0087.
650 4 $a Computer science. $3 523869
650 4 $a Web studies. $3 2122754
650 4 $a Information science. $3 554358
653 $a False information detection
653 $a Social media
653 $a Manual annotation
653 $a Credibility of claims
653 $a Unlabeled data
653 $a Supervised learning
653 $a Adversarial network
653 $a Pseudo negative examples
690 $a 0984
690 $a 0454
690 $a 0723
690 $a 0646
710 2 $a University of Houston. $b Computer Science. $3 3555880
773 0 $t Dissertations Abstracts International $g 82-07B.
790 $a 0087
791 $a Ph.D.
792 $a 2020
793 $a English
856 4 0 $u https://pqdd.sinica.edu.tw/twdaoapp/servlet/advanced?query=28182744