Language:
English
繁體中文
Help
回圖書館首頁
手機版館藏查詢
Login
Back
Switch To:
Labeled
|
MARC Mode
|
ISBD
Activating Big Data at Scale.
~
Wang, Xikui.
Linked to FindBook
Google Book
Amazon
博客來
Activating Big Data at Scale.
Record Type:
Electronic resources : Monograph/item
Title/Author:
Activating Big Data at Scale./
Author:
Wang, Xikui.
Published:
Ann Arbor : ProQuest Dissertations & Theses, : 2020,
Description:
181 p.
Notes:
Source: Dissertations Abstracts International, Volume: 82-06, Section: B.
Contained By:
Dissertations Abstracts International82-06B.
Subject:
Computer science. -
Online resource:
https://pqdd.sinica.edu.tw/twdaoapp/servlet/advanced?query=28090430
ISBN:
9798557021388
Activating Big Data at Scale.
Wang, Xikui.
Activating Big Data at Scale.
- Ann Arbor : ProQuest Dissertations & Theses, 2020 - 181 p.
Source: Dissertations Abstracts International, Volume: 82-06, Section: B.
Thesis (Ph.D.)--University of California, Irvine, 2020.
This item must not be sold to any third party vendors.
With our world being more digitized than ever, handling Big Data has become a fundamental challenge in building modern applications and services. Although both academia and industry have developed a plethora of systems in recent years to help developers working with Big Data, many of them still follow the pattern of passively responding to users' queries, rather than processing and delivering data to interested users actively. We need systems for activating Big Data at scale and reducing the users' effort in working with Big Active Data.In this dissertation, we explore three problems related to activating Big Data at scale. We first investigate the problem of enabling data enrichment during data ingestion. We discuss the needs and challenges in enriching data during data ingestion, and we introduce a new ingestion framework into AsterixDB - dynamic data feeds - that supports complex data enrichment functions and captures relevant data changes in the system during ingestion. We show the design and implementation of the new ingestion framework and evaluate its performance using different enrichment use cases.Then, we look at the Big Active Data (BAD) challenge. We describe a BAD world that consists of different types of users and requests, and we propose a BAD system for providing BAD services for BAD users. We first review the initial prototype of the BAD system - BAD-RQ - and we discuss its limitations in BAD continuous use cases. We introduce a new BAD service - BAD-CQ - for providing continuous query semantics in the BAD system. Further, we use an alternative system constructed by gluing together multiple existing Big Data systems to show the challenges in providing BAD services without the BAD system. We measure the performance of BAD-CQ with various workloads and compare that with the alternative system's performance.Last but not least, we study how to allow users to declaratively create scalable data sharing services between multiple BAD system instances without having to create and manage dedicated programs/services. We describe the notion of BAD islands that consist of multiple BAD instances and introduce new features to the BAD system for "bridging" multiple BAD instances together. We use a sample use case to illustrate how to create bridges between different BAD systems. To this end, we present a demonstration system that also involves the use of dynamic data feeds and BAD-CQ to show how BAD islands work.
ISBN: 9798557021388Subjects--Topical Terms:
523869
Computer science.
Subjects--Index Terms:
Big active data
Activating Big Data at Scale.
LDR
:03641nmm a2200409 4500
001
2278705
005
20210712062248.5
008
220723s2020 ||||||||||||||||| ||eng d
020
$a
9798557021388
035
$a
(MiAaPQ)AAI28090430
035
$a
AAI28090430
040
$a
MiAaPQ
$c
MiAaPQ
100
1
$a
Wang, Xikui.
$3
3557093
245
1 0
$a
Activating Big Data at Scale.
260
1
$a
Ann Arbor :
$b
ProQuest Dissertations & Theses,
$c
2020
300
$a
181 p.
500
$a
Source: Dissertations Abstracts International, Volume: 82-06, Section: B.
500
$a
Advisor: Carey, Michael J.
502
$a
Thesis (Ph.D.)--University of California, Irvine, 2020.
506
$a
This item must not be sold to any third party vendors.
520
$a
With our world being more digitized than ever, handling Big Data has become a fundamental challenge in building modern applications and services. Although both academia and industry have developed a plethora of systems in recent years to help developers working with Big Data, many of them still follow the pattern of passively responding to users' queries, rather than processing and delivering data to interested users actively. We need systems for activating Big Data at scale and reducing the users' effort in working with Big Active Data.In this dissertation, we explore three problems related to activating Big Data at scale. We first investigate the problem of enabling data enrichment during data ingestion. We discuss the needs and challenges in enriching data during data ingestion, and we introduce a new ingestion framework into AsterixDB - dynamic data feeds - that supports complex data enrichment functions and captures relevant data changes in the system during ingestion. We show the design and implementation of the new ingestion framework and evaluate its performance using different enrichment use cases.Then, we look at the Big Active Data (BAD) challenge. We describe a BAD world that consists of different types of users and requests, and we propose a BAD system for providing BAD services for BAD users. We first review the initial prototype of the BAD system - BAD-RQ - and we discuss its limitations in BAD continuous use cases. We introduce a new BAD service - BAD-CQ - for providing continuous query semantics in the BAD system. Further, we use an alternative system constructed by gluing together multiple existing Big Data systems to show the challenges in providing BAD services without the BAD system. We measure the performance of BAD-CQ with various workloads and compare that with the alternative system's performance.Last but not least, we study how to allow users to declaratively create scalable data sharing services between multiple BAD system instances without having to create and manage dedicated programs/services. We describe the notion of BAD islands that consist of multiple BAD instances and introduce new features to the BAD system for "bridging" multiple BAD instances together. We use a sample use case to illustrate how to create bridges between different BAD systems. To this end, we present a demonstration system that also involves the use of dynamic data feeds and BAD-CQ to show how BAD islands work.
590
$a
School code: 0030.
650
4
$a
Computer science.
$3
523869
650
4
$a
Business administration.
$3
3168311
650
4
$a
Systems science.
$3
3168411
650
4
$a
Information technology.
$3
532993
650
4
$a
Information science.
$3
554358
653
$a
Big active data
653
$a
Cloud computing
653
$a
Data warehouses
653
$a
Databases
653
$a
Distributed systems
690
$a
0984
690
$a
0489
690
$a
0310
690
$a
0454
690
$a
0723
690
$a
0790
710
2
$a
University of California, Irvine.
$b
Computer Science - Ph.D..
$3
2099281
773
0
$t
Dissertations Abstracts International
$g
82-06B.
790
$a
0030
791
$a
Ph.D.
792
$a
2020
793
$a
English
856
4 0
$u
https://pqdd.sinica.edu.tw/twdaoapp/servlet/advanced?query=28090430
based on 0 review(s)
Location:
ALL
電子資源
Year:
Volume Number:
Items
1 records • Pages 1 •
1
Inventory Number
Location Name
Item Class
Material type
Call number
Usage Class
Loan Status
No. of reservations
Opac note
Attachments
W9430438
電子資源
11.線上閱覽_V
電子書
EB
一般使用(Normal)
On shelf
0
1 records • Pages 1 •
1
Multimedia
Reviews
Add a review
and share your thoughts with other readers
Export
pickup library
Processing
...
Change password
Login