TUTCRIS - Tampereen teknillinen yliopisto

TUTCRIS

Design and implementation of parallel memory architectures

Tutkimustuotos

Yksityiskohdat

AlkuperäiskieliEnglanti
JulkaisupaikkaTampere
KustantajaTampere University of Technology
Sivumäärä65
ISBN (elektroninen)952-15-1726-3
ISBN (painettu)952-15-1689-5
TilaJulkaistu - 15 joulukuuta 2006
OKM-julkaisutyyppiG5 Artikkeliväitöskirja

Julkaisusarja

NimiTampere University of Technology. Publication
KustantajaTampere University of Technology
Vuosikerta641
ISSN (painettu)1459-2045

Tiivistelmä

Parallel processing is continually concerned about how to supply all the processing nodes with data. Many of the applications favor special data patterns that could be accessed in parallel. This is utilized in parallel memories, where the idea is to increase memory bandwidth with several memory modules working in parallel and feed the processor with only necessary data. Traditional parallel memories are application specific and support only fixed data access requirements. In this Thesis, memory flexibility is increased to give support for several algorithms by adding run-time configurability to parallel memories. Multitude of data access templates and module assignment functions can be used within a single hardware implementation, which has not been possible in prior embedded parallel memory systems. The design reusability of the memories is also improved since the same memory system is applicable in several separate implementations. Three novel parallel memory architectures are presented in this Thesis: one traditional application specific type and two with run-time configurability. The results show that run-time configurability can be included in parallel memories with a reasonable cost. As a case study with four memory modules, the normalized complexity of the proposed configurable parallel memories is 63 80% less than the conventional type of parallel memory. Moreover, in configurable parallel memories, the complexity increase in permutation networks is expressed to become the most critical when increasing the memory module count. According to evaluations, up to 79% of the total parallel memory gate count is consumed by the permutation networks excluding memory cells. The results of this Thesis can be used for designing a flexible memory system for real-time multimedia applications that demand high data throughput and data parallel computation.

Julkaisufoorumi-taso

Latausten tilastot

Ei tietoja saatavilla