Consider a GPU with the following characteristics: Clock rate: 1.6 GHz Number of
ID: 3690327 • Letter: C
Question
Consider a GPU with the following characteristics: Clock rate: 1.6 GHz Number of SIMD processors (i.e. Streaming Multiprocessors (SM) in NVIDIA CUDA terminology): 16 Number of Floating Point Units per SIMD processor: 16 GPU off-chip memory bandwidth: 100 GB/s Compute the throughput in FLoating-point Operations Per Second (FLOPS) without considering the memory bandwidth and assuming all memory latencies can be hidden. Assuming that each FP operation requires two operands of 4 Bytes each and outputs one 4 Byte result, is this throughput sustainable with current memory bandwidth?
Explanation / Answer
FLOPS = 4*1.6 (the resultign output) * clock rate
so the throughput will 6.4 GFLOPS
Related Questions
drjack9650@gmail.com
Navigate
Integrity-first tutoring: explanations and feedback only — we do not complete graded work. Learn more.