語系:
繁體中文
English
說明(常見問題)
回圖書館首頁
手機版館藏查詢
登入
回首頁
切換:
標籤
|
MARC模式
|
ISBD
An Environment for Accessing Remote ...
~
Jadhav, Shrikant Shridhar.
FindBook
Google Book
Amazon
博客來
An Environment for Accessing Remote Field-Programmable Custom Computing Machine Accelerators with Multi-Memory Banks.
紀錄類型:
書目-電子資源 : Monograph/item
正題名/作者:
An Environment for Accessing Remote Field-Programmable Custom Computing Machine Accelerators with Multi-Memory Banks./
作者:
Jadhav, Shrikant Shridhar.
出版者:
Ann Arbor : ProQuest Dissertations & Theses, : 2018,
面頁冊數:
136 p.
附註:
Source: Dissertations Abstracts International, Volume: 79-12, Section: B.
Contained By:
Dissertations Abstracts International79-12B.
標題:
Computer Engineering. -
電子資源:
http://pqdd.sinica.edu.tw/twdaoapp/servlet/advanced?query=10750895
ISBN:
9780355982046
An Environment for Accessing Remote Field-Programmable Custom Computing Machine Accelerators with Multi-Memory Banks.
Jadhav, Shrikant Shridhar.
An Environment for Accessing Remote Field-Programmable Custom Computing Machine Accelerators with Multi-Memory Banks.
- Ann Arbor : ProQuest Dissertations & Theses, 2018 - 136 p.
Source: Dissertations Abstracts International, Volume: 79-12, Section: B.
Thesis (Ph.D.)--North Carolina Agricultural and Technical State University, 2018.
This item must not be sold to any third party vendors.
Recent research shows that a Field-Programmable Custom Computing Machine (FCCM) can be used to achieve an order-of-magnitude speedup of High-Performance Computing (HPC) applications. An FCCM, within the context of our research, is a host processor connected to one or more Field Programmable Gate Arrays (FPGAs). Typically, for some applications, FCCMs provide better performance than conventional processors and consume less energy than Graphics Processing Units (GPUs). FCCMs can also be integrated into cloud-computing environments and data centers to enhance the accessibility of these hardware accelerators. FCCMs are a viable choice for accelerating applications in various areas including bio-informatics, neuroinformatics, physics, computer vision, etc. where significant processing capability is required.While extensive research is underway to accelerate computationally intensive tasks in HPC applications on FCCMs, few studies report frameworks and multi-memory architectures developed to provide access to FCCMs from a remote site. The remote environment introduced in this thesis, provided to access FCCM accelerators, proves beneficial to users with limited hardware/software knowledge. The remote environment allows users to access FCCM accelerators remotely from any part of the world. The remote environment also helps to reduce overall system cost since the users are not required to invest in an expensive FCCM design and infrastructure development.Memory bandwidth is a major factor in developing FCCMs with execution times faster than traditional processors. For achieving high performance, FCCM processing elements must be able to consume/produce data at the highest possible data rates. If there are multiple memory banks, the number of memory accesses required to bring all data into the FCCM processing unit can be significantly reduced. Similarly, outputs can be transmitted in few numbers of memory write operations. In turn, increasing memory bandwidth will positively impact the overall performance of the FCCM accelerator. However, adding multiple memories adds extra hardware to the FCCM, increasing size/complexity, as an interface is needed for each additional memory.In this thesis, we present a Java-based environment to access FCCM accelerators from a remote site. We have developed a hardware/software interface which supports multiple FPGAs connected to a server. This remote environment allows users with limited knowledge of the underlying hardware/software to take full advantage of FCCM accelerators by accessing FCCMs in a remote server from a client machine. We also provide a multi-memory architecture for accepting multiple inputs and producing multiple outputs per clock cycle. The architecture includes processor cores with pipelined functional units specifically tailored for each application. Additionally, we present an approach to achieve an order-of-magnitude speedup over a traditional software implementation executing on a conventional multi-core processor. Even though the clock frequency of the FCCM is an order of magnitude slower than a conventional multi-core processor, the FCCM is significantly faster.This thesis also presents a case study using the Taylor Series to demonstrate the merits of the local FCCM. The Taylor Series is a computationally intensive application used in many HPC applications, e.g., computer vision [10], cellular networks [53] and deep neural network [26]. In our experiments, we executed the Taylor Series in software and compared execution times with an FCCM interfaced to the same machine. Our experiments show that the results obtained using our multi-memory architecture is approximately 481X faster than software executing the Taylor Series on a typical server. We also implemented other computational primitives, i.e., Power, Natural Logarithm and Exponential on a local FCCM and compared their execution time with their respective software implementation. The experiments show that, with our multi-memory architecture, we achieved 1-2 orders-of-magnitude speedup over software implementation.This thesis also presents results that demonstrate the merits of a remote FCCM with the multi-memory architecture. In our experiments, we executed the FCCM implementation of the same applications on the remote server machine and transmitted data to be processed from a client machine. Our experiments show that the execution time for the remote FCCM is not constrained by the specialized processor, but rather by the network bandwidth. On a network with 300 Mbps upload speed, and 900 Mbps download speed of the Internet, execution of an application on the remote FCCM with our remote framework, with two clients and two servers, can achieve 10X speedup over the execution of a software implementation of the same application on the local machine.
ISBN: 9780355982046Subjects--Topical Terms:
1567821
Computer Engineering.
An Environment for Accessing Remote Field-Programmable Custom Computing Machine Accelerators with Multi-Memory Banks.
LDR
:05996nmm a2200325 4500
001
2208072
005
20190929184211.5
008
201008s2018 ||||||||||||||||| ||eng d
020
$a
9780355982046
035
$a
(MiAaPQ)AAI10750895
035
$a
(MiAaPQ)ncat:10824
035
$a
AAI10750895
040
$a
MiAaPQ
$c
MiAaPQ
100
1
$a
Jadhav, Shrikant Shridhar.
$3
3435084
245
1 3
$a
An Environment for Accessing Remote Field-Programmable Custom Computing Machine Accelerators with Multi-Memory Banks.
260
1
$a
Ann Arbor :
$b
ProQuest Dissertations & Theses,
$c
2018
300
$a
136 p.
500
$a
Source: Dissertations Abstracts International, Volume: 79-12, Section: B.
500
$a
Publisher info.: Dissertation/Thesis.
500
$a
Advisor: Doss, Christopher;Gloster, Clay.
502
$a
Thesis (Ph.D.)--North Carolina Agricultural and Technical State University, 2018.
506
$a
This item must not be sold to any third party vendors.
520
$a
Recent research shows that a Field-Programmable Custom Computing Machine (FCCM) can be used to achieve an order-of-magnitude speedup of High-Performance Computing (HPC) applications. An FCCM, within the context of our research, is a host processor connected to one or more Field Programmable Gate Arrays (FPGAs). Typically, for some applications, FCCMs provide better performance than conventional processors and consume less energy than Graphics Processing Units (GPUs). FCCMs can also be integrated into cloud-computing environments and data centers to enhance the accessibility of these hardware accelerators. FCCMs are a viable choice for accelerating applications in various areas including bio-informatics, neuroinformatics, physics, computer vision, etc. where significant processing capability is required.While extensive research is underway to accelerate computationally intensive tasks in HPC applications on FCCMs, few studies report frameworks and multi-memory architectures developed to provide access to FCCMs from a remote site. The remote environment introduced in this thesis, provided to access FCCM accelerators, proves beneficial to users with limited hardware/software knowledge. The remote environment allows users to access FCCM accelerators remotely from any part of the world. The remote environment also helps to reduce overall system cost since the users are not required to invest in an expensive FCCM design and infrastructure development.Memory bandwidth is a major factor in developing FCCMs with execution times faster than traditional processors. For achieving high performance, FCCM processing elements must be able to consume/produce data at the highest possible data rates. If there are multiple memory banks, the number of memory accesses required to bring all data into the FCCM processing unit can be significantly reduced. Similarly, outputs can be transmitted in few numbers of memory write operations. In turn, increasing memory bandwidth will positively impact the overall performance of the FCCM accelerator. However, adding multiple memories adds extra hardware to the FCCM, increasing size/complexity, as an interface is needed for each additional memory.In this thesis, we present a Java-based environment to access FCCM accelerators from a remote site. We have developed a hardware/software interface which supports multiple FPGAs connected to a server. This remote environment allows users with limited knowledge of the underlying hardware/software to take full advantage of FCCM accelerators by accessing FCCMs in a remote server from a client machine. We also provide a multi-memory architecture for accepting multiple inputs and producing multiple outputs per clock cycle. The architecture includes processor cores with pipelined functional units specifically tailored for each application. Additionally, we present an approach to achieve an order-of-magnitude speedup over a traditional software implementation executing on a conventional multi-core processor. Even though the clock frequency of the FCCM is an order of magnitude slower than a conventional multi-core processor, the FCCM is significantly faster.This thesis also presents a case study using the Taylor Series to demonstrate the merits of the local FCCM. The Taylor Series is a computationally intensive application used in many HPC applications, e.g., computer vision [10], cellular networks [53] and deep neural network [26]. In our experiments, we executed the Taylor Series in software and compared execution times with an FCCM interfaced to the same machine. Our experiments show that the results obtained using our multi-memory architecture is approximately 481X faster than software executing the Taylor Series on a typical server. We also implemented other computational primitives, i.e., Power, Natural Logarithm and Exponential on a local FCCM and compared their execution time with their respective software implementation. The experiments show that, with our multi-memory architecture, we achieved 1-2 orders-of-magnitude speedup over software implementation.This thesis also presents results that demonstrate the merits of a remote FCCM with the multi-memory architecture. In our experiments, we executed the FCCM implementation of the same applications on the remote server machine and transmitted data to be processed from a client machine. Our experiments show that the execution time for the remote FCCM is not constrained by the specialized processor, but rather by the network bandwidth. On a network with 300 Mbps upload speed, and 900 Mbps download speed of the Internet, execution of an application on the remote FCCM with our remote framework, with two clients and two servers, can achieve 10X speedup over the execution of a software implementation of the same application on the local machine.
590
$a
School code: 1544.
650
4
$a
Computer Engineering.
$3
1567821
650
4
$a
Computer science.
$3
523869
690
$a
0464
690
$a
0984
710
2
$a
North Carolina Agricultural and Technical State University.
$b
Electrical Engineering.
$3
3181344
773
0
$t
Dissertations Abstracts International
$g
79-12B.
790
$a
1544
791
$a
Ph.D.
792
$a
2018
793
$a
English
856
4 0
$u
http://pqdd.sinica.edu.tw/twdaoapp/servlet/advanced?query=10750895
筆 0 讀者評論
館藏地:
全部
電子資源
出版年:
卷號:
館藏
1 筆 • 頁數 1 •
1
條碼號
典藏地名稱
館藏流通類別
資料類型
索書號
使用類型
借閱狀態
預約狀態
備註欄
附件
W9384621
電子資源
11.線上閱覽_V
電子書
EB
一般使用(Normal)
在架
0
1 筆 • 頁數 1 •
1
多媒體
評論
新增評論
分享你的心得
Export
取書館
處理中
...
變更密碼
登入