TUTCRIS - Tampereen teknillinen yliopisto

TUTCRIS

Parallel processing intensive digital front-end for IEEE 802.11ac receiver

Tutkimustuotosvertaisarvioitu

Yksityiskohdat

AlkuperäiskieliEnglanti
Otsikko2015 49th Asilomar Conference on Signals, Systems and Computers
KustantajaIEEE
Sivut1619-1626
Sivumäärä8
ISBN (elektroninen)978-1-4673-8576-3
ISBN (painettu)978-1-4673-8574-9
DOI - pysyväislinkit
TilaJulkaistu - 2016
OKM-julkaisutyyppiA4 Artikkeli konferenssijulkaisussa
TapahtumaAsilomar Conference on Signals, Systems & Computers - California, Pacific Grove, Yhdysvallat
Kesto: 8 marraskuuta 201511 marraskuuta 2015

Julkaisusarja

Nimi
ISSN (elektroninen)1058-6393

Conference

ConferenceAsilomar Conference on Signals, Systems & Computers
MaaYhdysvallat
KaupunkiPacific Grove
Ajanjakso8/11/1511/11/15

Tiivistelmä

Modern computing platforms offer increasing levels of parallelism for fast execution of different signal processing tasks. In this paper, we develop and elaborate on a digital front-end concept for an IEEE 802.11ac receiver with 80 MHz bandwidth where parallel processing is adopted in multiple ways. First, the inherent structure of the 802.11ac waveform is utilized such that it is divided, through time-domain digital filtering and decimation, to two parallel 40 MHz signals that can be processed further in parallel using smaller-size FFTs and, e.g, legacy 802.11n digital receiver chains. This filtering task is very challenging, as the latency and the cyclic prefix budget of the receiver cannot be compromised, and because the number of unused subcarriers in the middle of the 80 MHz signal is only three, thus necessitating very narrow transition bandwidth in the deployed filters. Both linear and circular filtering based multirate channelization architectures are developed and reported, together with the corresponding filter coefficient optimization. Also, full radio link performance simulations with commonly adopted indoor WiFi channel profiles are provided, verifying that the channelization does not degrade the overall link performance. Then, both C and OpenCL software implementations of the processing are developed and simulated for comparison purposes on an Intel CPU, to demonstrate that the parallelism provided by the OpenCL will result in substantially faster realization. Furthermore, we provide complete software implementation results in terms of time, number of clock cycles, power, and energy consumption on the ARM Mali GPU with half precision floating-point arithmetic along with the ARM Cortex A7 CPU.