東華大學圖書館 |

Language: English

Help

回圖書館首頁

手機版館藏查詢

Back

Switch To: Labeled | MARC Mode | ISBD

Never-Ending Learning of Sounds.

Martinez Elizalde, Benjamin.

Linked to FindBook

Google Book

Amazon

博客來

Never-Ending Learning of Sounds.

Record Type:	Electronic resources : Monograph/item
Title/Author:	Never-Ending Learning of Sounds./
Author:	Martinez Elizalde, Benjamin.
Published:	Ann Arbor : ProQuest Dissertations & Theses, : 2020,
Description:	125 p.
Notes:	Source: Dissertations Abstracts International, Volume: 82-05, Section: B.
Contained By:	Dissertations Abstracts International82-05B.
Subject:	Artificial intelligence. -
Online resource:	https://pqdd.sinica.edu.tw/twdaoapp/servlet/advanced?query=28148857
ISBN:	9798691216336

Never-Ending Learning of Sounds.
Martinez Elizalde, Benjamin.

Never-Ending Learning of Sounds. - Ann Arbor : ProQuest Dissertations & Theses, 2020 - 125 p.

Source: Dissertations Abstracts International, Volume: 82-05, Section: B.

Thesis (Ph.D.)--Carnegie Mellon University, 2020.

This item must not be sold to any third party vendors.

Health care, public safety, home security and self-driving cars applications rely on the automatic identification and interpretation of sound events. For example, abnormal respiratory sounds indicate respiratory problems, a gunshot or a glass breaking imply a safe alert, and an ambulance siren wailing implies that vehicles should stop or pull over. Systems that can automatically recognize sound events in order to extract meaning that helps us react accordingly, are systems capable of Sound Understanding. Sound Understanding is an emerging field of Machine Hearing, which aims to build systems that can do sound-related tasks that have nothing to do with hearing - such as sonography, seismic, and sonar - and systems that could hear the way humans do and distinguish between music, speech and sounds~\\cite{lyon2010machine}. Hearing machines that understand sounds like humans do require computational programs that can learn from years of accumulated diverse acoustics. They must use associated knowledge to guide subsequent learning and organize what they hear, learn names for recognizable events, scenes, objects, actions, materials, places, and retrieve sounds by reference to those names. These machines must also continuously improve their hearing competence to encompass all the diversity and scale of the acoustics in the world. Therefore, this thesis proposes the Never-Ending Learner of Sounds (NELS), a computational program that aims to build hearing machines that understand sounds under a never-ending learning paradigm. NELS continuously hears the Web, in order to learn meaningful categories and relationships of sounds, and use this knowledge to index and organize the crawled audio. The content is made available for people to query and recover all kinds of information. To enhance NELS quality of expression of acoustic phenomena, we introduced a new interdisciplinary solution that draws domain knowledge from Psychology to build Machine Learning models. NELS breaks ground in challenges of Sound Understanding, such as collecting datasets with different types of labels and annotation processes, designing and improving sound recognition models, defining knowledge about sounds, and retrieving sounds with different types of similarities.

ISBN: 9798691216336Subjects--Topical Terms:

516317
Artificial intelligence.
Subjects--Index Terms:

Audio signal processing

Never-Ending Learning of Sounds.
LDR:03430nmm a2200385 4500 001 2281874
005 20210927083417.5
008 220723s2020 ||||||||||||||||| ||eng d
020 $a 9798691216336
035 $a (MiAaPQ)AAI28148857
035 $a AAI28148857
040 $a MiAaPQ $c MiAaPQ
100 1 $a Martinez Elizalde, Benjamin. $0 (orcid)0000-0003-2697-819X $3 3560584
245 1 0 $a Never-Ending Learning of Sounds.
260 1 $a Ann Arbor : $b ProQuest Dissertations & Theses, $c 2020
300 $a 125 p.
500 $a Source: Dissertations Abstracts International, Volume: 82-05, Section: B.
500 $a Advisor: Lane, Ian;Raj, Bhiksha.
502 $a Thesis (Ph.D.)--Carnegie Mellon University, 2020.
506 $a This item must not be sold to any third party vendors.
520 $a Health care, public safety, home security and self-driving cars applications rely on the automatic identification and interpretation of sound events. For example, abnormal respiratory sounds indicate respiratory problems, a gunshot or a glass breaking imply a safe alert, and an ambulance siren wailing implies that vehicles should stop or pull over. Systems that can automatically recognize sound events in order to extract meaning that helps us react accordingly, are systems capable of Sound Understanding. Sound Understanding is an emerging field of Machine Hearing, which aims to build systems that can do sound-related tasks that have nothing to do with hearing - such as sonography, seismic, and sonar - and systems that could hear the way humans do and distinguish between music, speech and sounds~\\cite{lyon2010machine}. Hearing machines that understand sounds like humans do require computational programs that can learn from years of accumulated diverse acoustics. They must use associated knowledge to guide subsequent learning and organize what they hear, learn names for recognizable events, scenes, objects, actions, materials, places, and retrieve sounds by reference to those names. These machines must also continuously improve their hearing competence to encompass all the diversity and scale of the acoustics in the world. Therefore, this thesis proposes the Never-Ending Learner of Sounds (NELS), a computational program that aims to build hearing machines that understand sounds under a never-ending learning paradigm. NELS continuously hears the Web, in order to learn meaningful categories and relationships of sounds, and use this knowledge to index and organize the crawled audio. The content is made available for people to query and recover all kinds of information. To enhance NELS quality of expression of acoustic phenomena, we introduced a new interdisciplinary solution that draws domain knowledge from Psychology to build Machine Learning models. NELS breaks ground in challenges of Sound Understanding, such as collecting datasets with different types of labels and annotation processes, designing and improving sound recognition models, defining knowledge about sounds, and retrieving sounds with different types of similarities.
590 $a School code: 0041.
650 4 $a Artificial intelligence. $3 516317
650 4 $a Computer science. $3 523869
650 4 $a Acoustics. $3 879105
653 $a Audio signal processing
653 $a DCASE
653 $a Machine learning
653 $a Never-ending learning
653 $a Sound events
653 $a Sound understanding
690 $a 0800
690 $a 0984
690 $a 0986
710 2 $a Carnegie Mellon University. $b Electrical and Computer Engineering. $3 2094139
773 0 $t Dissertations Abstracts International $g 82-05B.
790 $a 0041
791 $a Ph.D.
792 $a 2020
793 $a English
856 4 0 $u https://pqdd.sinica.edu.tw/twdaoapp/servlet/advanced?query=28148857