Academic Integrity: tutoring, explanations, and feedback — we don’t complete graded work or submit on a student’s behalf.

Consider a GPU with the following characteristics: Clock rate: 1.6 GHz Number of

ID: 3690327 • Letter: C

Question

Consider a GPU with the following characteristics: Clock rate: 1.6 GHz Number of SIMD processors (i.e. Streaming Multiprocessors (SM) in NVIDIA CUDA terminology): 16 Number of Floating Point Units per SIMD processor: 16 GPU off-chip memory bandwidth: 100 GB/s Compute the throughput in FLoating-point Operations Per Second (FLOPS) without considering the memory bandwidth and assuming all memory latencies can be hidden. Assuming that each FP operation requires two operands of 4 Bytes each and outputs one 4 Byte result, is this throughput sustainable with current memory bandwidth?

Explanation / Answer

FLOPS = 4*1.6 (the resultign output) * clock rate

so the throughput will 6.4 GFLOPS

Hire Me For All Your Tutoring Needs
Integrity-first tutoring: clear explanations, guidance, and feedback.
Drop an Email at
drjack9650@gmail.com
Chat Now And Get Quote