語系:
繁體中文
English
說明(常見問題)
回圖書館首頁
手機版館藏查詢
登入
回首頁
切換:
標籤
|
MARC模式
|
ISBD
Never-Ending Learning of Sounds.
~
Martinez Elizalde, Benjamin.
FindBook
Google Book
Amazon
博客來
Never-Ending Learning of Sounds.
紀錄類型:
書目-電子資源 : Monograph/item
正題名/作者:
Never-Ending Learning of Sounds./
作者:
Martinez Elizalde, Benjamin.
出版者:
Ann Arbor : ProQuest Dissertations & Theses, : 2020,
面頁冊數:
125 p.
附註:
Source: Dissertations Abstracts International, Volume: 82-05, Section: B.
Contained By:
Dissertations Abstracts International82-05B.
標題:
Artificial intelligence. -
電子資源:
https://pqdd.sinica.edu.tw/twdaoapp/servlet/advanced?query=28148857
ISBN:
9798691216336
Never-Ending Learning of Sounds.
Martinez Elizalde, Benjamin.
Never-Ending Learning of Sounds.
- Ann Arbor : ProQuest Dissertations & Theses, 2020 - 125 p.
Source: Dissertations Abstracts International, Volume: 82-05, Section: B.
Thesis (Ph.D.)--Carnegie Mellon University, 2020.
This item must not be sold to any third party vendors.
Health care, public safety, home security and self-driving cars applications rely on the automatic identification and interpretation of sound events. For example, abnormal respiratory sounds indicate respiratory problems, a gunshot or a glass breaking imply a safe alert, and an ambulance siren wailing implies that vehicles should stop or pull over. Systems that can automatically recognize sound events in order to extract meaning that helps us react accordingly, are systems capable of Sound Understanding. Sound Understanding is an emerging field of Machine Hearing, which aims to build systems that can do sound-related tasks that have nothing to do with hearing - such as sonography, seismic, and sonar - and systems that could hear the way humans do and distinguish between music, speech and sounds~\\cite{lyon2010machine}. Hearing machines that understand sounds like humans do require computational programs that can learn from years of accumulated diverse acoustics. They must use associated knowledge to guide subsequent learning and organize what they hear, learn names for recognizable events, scenes, objects, actions, materials, places, and retrieve sounds by reference to those names. These machines must also continuously improve their hearing competence to encompass all the diversity and scale of the acoustics in the world. Therefore, this thesis proposes the Never-Ending Learner of Sounds (NELS), a computational program that aims to build hearing machines that understand sounds under a never-ending learning paradigm. NELS continuously hears the Web, in order to learn meaningful categories and relationships of sounds, and use this knowledge to index and organize the crawled audio. The content is made available for people to query and recover all kinds of information. To enhance NELS quality of expression of acoustic phenomena, we introduced a new interdisciplinary solution that draws domain knowledge from Psychology to build Machine Learning models. NELS breaks ground in challenges of Sound Understanding, such as collecting datasets with different types of labels and annotation processes, designing and improving sound recognition models, defining knowledge about sounds, and retrieving sounds with different types of similarities.
ISBN: 9798691216336Subjects--Topical Terms:
516317
Artificial intelligence.
Subjects--Index Terms:
Audio signal processing
Never-Ending Learning of Sounds.
LDR
:03430nmm a2200385 4500
001
2281874
005
20210927083417.5
008
220723s2020 ||||||||||||||||| ||eng d
020
$a
9798691216336
035
$a
(MiAaPQ)AAI28148857
035
$a
AAI28148857
040
$a
MiAaPQ
$c
MiAaPQ
100
1
$a
Martinez Elizalde, Benjamin.
$0
(orcid)0000-0003-2697-819X
$3
3560584
245
1 0
$a
Never-Ending Learning of Sounds.
260
1
$a
Ann Arbor :
$b
ProQuest Dissertations & Theses,
$c
2020
300
$a
125 p.
500
$a
Source: Dissertations Abstracts International, Volume: 82-05, Section: B.
500
$a
Advisor: Lane, Ian;Raj, Bhiksha.
502
$a
Thesis (Ph.D.)--Carnegie Mellon University, 2020.
506
$a
This item must not be sold to any third party vendors.
520
$a
Health care, public safety, home security and self-driving cars applications rely on the automatic identification and interpretation of sound events. For example, abnormal respiratory sounds indicate respiratory problems, a gunshot or a glass breaking imply a safe alert, and an ambulance siren wailing implies that vehicles should stop or pull over. Systems that can automatically recognize sound events in order to extract meaning that helps us react accordingly, are systems capable of Sound Understanding. Sound Understanding is an emerging field of Machine Hearing, which aims to build systems that can do sound-related tasks that have nothing to do with hearing - such as sonography, seismic, and sonar - and systems that could hear the way humans do and distinguish between music, speech and sounds~\\cite{lyon2010machine}. Hearing machines that understand sounds like humans do require computational programs that can learn from years of accumulated diverse acoustics. They must use associated knowledge to guide subsequent learning and organize what they hear, learn names for recognizable events, scenes, objects, actions, materials, places, and retrieve sounds by reference to those names. These machines must also continuously improve their hearing competence to encompass all the diversity and scale of the acoustics in the world. Therefore, this thesis proposes the Never-Ending Learner of Sounds (NELS), a computational program that aims to build hearing machines that understand sounds under a never-ending learning paradigm. NELS continuously hears the Web, in order to learn meaningful categories and relationships of sounds, and use this knowledge to index and organize the crawled audio. The content is made available for people to query and recover all kinds of information. To enhance NELS quality of expression of acoustic phenomena, we introduced a new interdisciplinary solution that draws domain knowledge from Psychology to build Machine Learning models. NELS breaks ground in challenges of Sound Understanding, such as collecting datasets with different types of labels and annotation processes, designing and improving sound recognition models, defining knowledge about sounds, and retrieving sounds with different types of similarities.
590
$a
School code: 0041.
650
4
$a
Artificial intelligence.
$3
516317
650
4
$a
Computer science.
$3
523869
650
4
$a
Acoustics.
$3
879105
653
$a
Audio signal processing
653
$a
DCASE
653
$a
Machine learning
653
$a
Never-ending learning
653
$a
Sound events
653
$a
Sound understanding
690
$a
0800
690
$a
0984
690
$a
0986
710
2
$a
Carnegie Mellon University.
$b
Electrical and Computer Engineering.
$3
2094139
773
0
$t
Dissertations Abstracts International
$g
82-05B.
790
$a
0041
791
$a
Ph.D.
792
$a
2020
793
$a
English
856
4 0
$u
https://pqdd.sinica.edu.tw/twdaoapp/servlet/advanced?query=28148857
筆 0 讀者評論
館藏地:
全部
電子資源
出版年:
卷號:
館藏
1 筆 • 頁數 1 •
1
條碼號
典藏地名稱
館藏流通類別
資料類型
索書號
使用類型
借閱狀態
預約狀態
備註欄
附件
W9433607
電子資源
11.線上閱覽_V
電子書
EB
一般使用(Normal)
在架
0
1 筆 • 頁數 1 •
1
多媒體
評論
新增評論
分享你的心得
Export
取書館
處理中
...
變更密碼
登入