Skip to main navigation Skip to search Skip to main content

Designing coalescing network-on-chip for efficient memory accesses of GPGPUs

  • Chien-Ting Chen
  • , Yoshi Shih-Chieh Huang
  • , Yuan-Ying Chang
  • , Chiao-Yun Tu
  • , Chung-Ta King
  • , Tai-Yuan Wang
  • , Janche Sang
  • , Ming-Hua Li
  • National Tsing Hua University
  • Cleveland State University
  • Industrial Technology Research Institute of Taiwan

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

3 Scopus citations

Abstract

The massive multithreading architecture of General Purpose Graphic Processors Units (GPGPU) makes them ideal for data parallel computing. However, designing efficient GPGPU chips poses many challenges. One major hurdle is the interface to the external DRAM, particularly the buffers in the memory controllers (MCs), which is stressed heavily by the many concurrent memory accesses from the GPGPU. Previous approaches considered scheduling the memory requests in the memory buffers to reduce switching of memory rows. The problem is that the window of requests that can be considered for scheduling is too narrow and the memory controller is very complex, affecting the critical path. In view of the massive multithreading architecture of GPGPUs that can hide memory access latencies, we exploit in this paper the novel idea of rearranging the memory requests in the network-on-chip (NoC), called packet coalescing. To study the feasibility of this idea, we have designed an expanded NoC router that supports packet coalescing and evaluated its performance extensively. Evaluation results show that this NoC-assisted design strategy can improve the row buffer hit rate in the memory controllers. A comprehensive investigation of factors affecting the performance of coalescing is also conducted and reported. © 2014 IFIP International Federation for Information Processing.
Original languageEnglish
Title of host publicationLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Place of Publicationdeu
PublisherSpringer [email protected]
Pages169-180
Number of pages12
Volume8707 LNCS
ISBN (Print)9783662449165
DOIs
StatePublished - Jan 1 2014
Event11th IFIP WG 10.3 International Conference on Network and Parallel Computing, NPC 2014 - Ilan, Taiwan, Province of China
Duration: Sep 18 2014Sep 20 2014

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
PublisherSpringer [email protected]
Volume8707 LNCS
ISSN (Print)03029743
ISSN (Electronic)16113349

Conference

Conference11th IFIP WG 10.3 International Conference on Network and Parallel Computing, NPC 2014
Country/TerritoryTaiwan, Province of China
CityIlan
Period09/18/1409/20/14

Keywords

  • general-purpose graphic processors unit
  • latency hiding
  • memory controller
  • Network-on-chip
  • router design

Cite this