語系:
繁體中文
English
說明(常見問題)
回圖書館首頁
手機版館藏查詢
登入
回首頁
切換:
標籤
|
MARC模式
|
ISBD
Design and implementation of tool-ch...
~
University of Delaware., Department of Electrical and Computer Engineering.
FindBook
Google Book
Amazon
博客來
Design and implementation of tool-chain framework to support OpenMP single source compilation on cell platform.
紀錄類型:
書目-電子資源 : Monograph/item
正題名/作者:
Design and implementation of tool-chain framework to support OpenMP single source compilation on cell platform./
作者:
Jiang, Yi.
面頁冊數:
83 p.
附註:
Adviser: Guang R. Gao.
Contained By:
Masters Abstracts International46-04.
標題:
Engineering, Electronics and Electrical. -
電子資源:
http://pqdd.sinica.edu.tw/twdaoapp/servlet/advanced?query=1450066
ISBN:
9780549386957
Design and implementation of tool-chain framework to support OpenMP single source compilation on cell platform.
Jiang, Yi.
Design and implementation of tool-chain framework to support OpenMP single source compilation on cell platform.
- 83 p.
Adviser: Guang R. Gao.
Thesis (M.S.)--University of Delaware, 2008.
The well-known performance and power bottlenecks in traditional architecture design, in conjunction with the sustained demand for high performance in real world applications, stimulated the creation of new designs that utilize multi-cores in one processor. There are two approaches in multi-core design: homogeneous and heterogeneous. The homogeneous design is based on the replication of simple cores. It is easier for developers to port existing applications to this kind of platform. However, great diversity exists among applications and a homogeneous multi-core chip cannot be optimal for heterogeneous workloads. Therefore, more and more multi-core designs tend to utilize heterogeneous cores and specialized accelerators. The Cell Broadband Engine (CBE) is a representative. Every Cell processor integrates one Power Processing Engine(PPE) core and eight Synergistic Processing Engines (SPE). A PPE core has a traditional memory and cache hierarchy and it accesses memory via caching mechanism. On the other hand, each SPE core only has 256K local storage and accesses its own local storage directly. All data exchange between the SPUs and shared system memory is via high-latency DMA operation.
ISBN: 9780549386957Subjects--Topical Terms:
626636
Engineering, Electronics and Electrical.
Design and implementation of tool-chain framework to support OpenMP single source compilation on cell platform.
LDR
:04691nmm 2200313 a 45
001
891333
005
20101111
008
101111s2008 ||||||||||||||||| ||eng d
020
$a
9780549386957
035
$a
(UMI)AAI1450066
035
$a
AAI1450066
040
$a
UMI
$c
UMI
100
1
$a
Jiang, Yi.
$3
1065328
245
1 0
$a
Design and implementation of tool-chain framework to support OpenMP single source compilation on cell platform.
300
$a
83 p.
500
$a
Adviser: Guang R. Gao.
500
$a
Source: Masters Abstracts International, Volume: 46-04, page: 2226.
502
$a
Thesis (M.S.)--University of Delaware, 2008.
520
$a
The well-known performance and power bottlenecks in traditional architecture design, in conjunction with the sustained demand for high performance in real world applications, stimulated the creation of new designs that utilize multi-cores in one processor. There are two approaches in multi-core design: homogeneous and heterogeneous. The homogeneous design is based on the replication of simple cores. It is easier for developers to port existing applications to this kind of platform. However, great diversity exists among applications and a homogeneous multi-core chip cannot be optimal for heterogeneous workloads. Therefore, more and more multi-core designs tend to utilize heterogeneous cores and specialized accelerators. The Cell Broadband Engine (CBE) is a representative. Every Cell processor integrates one Power Processing Engine(PPE) core and eight Synergistic Processing Engines (SPE). A PPE core has a traditional memory and cache hierarchy and it accesses memory via caching mechanism. On the other hand, each SPE core only has 256K local storage and accesses its own local storage directly. All data exchange between the SPUs and shared system memory is via high-latency DMA operation.
520
$a
Therefore, this architecture presents great challenges to programmers who want to utilize parallelism: (1) Threads running on PPE are far different from the ones on SPEs, both in capability and ISAs. The users have to take care of programming threads in each ISA, as well as their cooperation and synchronization. (2) The SPE local storage is so limited that SPE code or data may have to be partitioned into overlay sections. Given the explicit memory hierarchy, it is necessary for the programmers to issue DMA instructions at the appropriate time and transfer them to or from system memory. (3) Shared data in SPE code also needs to be loaded from and stored to main memory. The Cell processor can guarantee the coherence for all DMA transactions. However, the user would have the responsibility to keep data coherence among SPE local storage.
520
$a
This thesis is inspired by the Open Opell project whose goal is to develop the GNU-Based OpenMP on the CBE platform and address the challenges mentioned above. Our solution involves the whole tool chain and a runtime system. The CBE programmer can take advantage of the convenience of this abstract share-memory programming model. Specifically, we design and implement single source compilation to attack the first challenge. Then the "ghost" mechanism, which invokes overlays and partition manager libraries, helps to deal with the second challenge. Lastly, we try to tackle the third challenge by the software cache library.
520
$a
In this thesis, we mainly focus on my work in the assembler and linker part of the Open Opell system, which addresses three specific problems: (1) Given a heterogeneous system like CBE, how to support single source compilation? (2) How to design a mechanism to automatically invoke code/data overlays and hide all details to the end-user? (3) How to make the solution and implementation robust and efficient?
520
$a
The main contributions of this thesis are: (1) We have completed the design of interfaces between different phases in toolchain to support single openMP source program compilation. (2) We have designed the "ghost" mechanism to invoke the overlays and hide the details about the overlay to high-level users. It will work with the partition manager run-time library to complete automatic code/data load if needed. (3) We have implemented the proposed design in our Open Opell system in a robust and efficient manner. We will focus on the assembler and linker implementation in this thesis. We have used a series of experiments with several non-trivial benchmarks to prove our claim.
590
$a
School code: 0060.
650
4
$a
Engineering, Electronics and Electrical.
$3
626636
690
$a
0544
710
2
$a
University of Delaware.
$b
Department of Electrical and Computer Engineering.
$3
1020805
773
0
$t
Masters Abstracts International
$g
46-04.
790
$a
0060
790
1 0
$a
Gao, Guang R.,
$e
advisor
791
$a
M.S.
792
$a
2008
856
4 0
$u
http://pqdd.sinica.edu.tw/twdaoapp/servlet/advanced?query=1450066
筆 0 讀者評論
館藏地:
全部
電子資源
出版年:
卷號:
館藏
1 筆 • 頁數 1 •
1
條碼號
典藏地名稱
館藏流通類別
資料類型
索書號
使用類型
借閱狀態
預約狀態
備註欄
附件
W9083461
電子資源
11.線上閱覽_V
電子書
EB W9083461
一般使用(Normal)
在架
0
1 筆 • 頁數 1 •
1
多媒體
評論
新增評論
分享你的心得
Export
取書館
處理中
...
變更密碼
登入