Why is the cache hit rate for row major traversal of 16x16 array 75% for both di

ID: 3865619 • Letter: W

Question

Why is the cache hit rate for row major traversal of 16x16 array 75% for both direct mapped and fully associative?

#############################################################################
# Row-major order traversal of 16 x 16 array of words.
#
#
#
# To easily observe the row-oriented order, run the Memory Reference
# Visualization tool with its default settings over this program.
# You may, at the same time or separately, run the Data Cache Simulator
# over this program to observe caching performance. Compare the results
# with those of the column-major order traversal algorithm.
#
# The C/C++/Java-like equivalent of this MIPS program is:
#     int size = 16;
#     int[size][size] data;
#     int value = 0;
#     for (int row = 0; col < size; row++) {
#        for (int col = 0; col < size; col++) }
#           data[row][col] = value;
#           value++;
#        }
#     }
#
# Note: Program is hard-wired for 16 x 16 matrix. If you want to change this,
#        three statements need to be changed.
#        1. The array storage size declaration at "data:" needs to be changed from
#           256 (which is 16 * 16) to #columns * #rows.
#        2. The "li" to initialize $t0 needs to be changed to new #rows.
#        3. The "li" to initialize $t1 needs to be changed to new #columns.
#
         .data
data:    .word     0 : 256       # storage for 16x16 matrix of words
         .text
         li       $t0, 16        # $t0 = number of rows
         li       $t1, 16        # $t1 = number of columns
         move     $s0, $zero     # $s0 = row counter
         move     $s1, $zero     # $s1 = column counter
         move     $t2, $zero     # $t2 = the value to be stored
# Each loop iteration will store incremented $t1 value into next element of matrix.
# Offset is calculated at each iteration. offset = 4 * (row*#cols+col)
# Note: no attempt is made to optimize runtime performance!
loop:    mult     $s0, $t1       # $s2 = row * #cols (two-instruction sequence)
         mflo     $s2            # move multiply result from lo register to $s2
         add      $s2, $s2, $s1 # $s2 += column counter
         sll      $s2, $s2, 2    # $s2 *= 4 (shift left 2 bits) for byte offset
         sw       $t2, data($s2) # store the value in matrix element
         addi     $t2, $t2, 1    # increment value to be stored
# Loop control: If we increment past last column, reset column counter and increment row counter
#                If we increment past last row, we're finished.
         addi     $s1, $s1, 1    # increment column counter
         bne      $s1, $t1, loop # not at end of row so loop back
         move     $s1, $zero     # reset column counter
         addi     $s0, $s0, 1    # increment row counter
         bne      $s0, $t0, loop # not at end of matrix so loop back
# We're finished traversing the matrix.
         li       $v0, 10        # system service 10 is exit
         syscall                 # we are outta here.

Explanation / Answer

Note that A[i ,j] and A[i+1,j] are 16 words apart, and are therefore mapped to different cache lines. The row-first traversal takes a cache miss on every reference. The first 8 references find empty cache lines, the next 8 references evict the first 8, for a total of 256 cache misses.

By contract the column-first traversal moves through memory in sequence, reading each cache line once, for a total of 16 cache misses.

int size = 16;
int[size][size] data;
int value = 0;
for (int row = 0; col < size; row++) {
for (int col = 0; col < size; col++) }
data[row][col] = value; // 1st dim changes fastest
value++;
}

}

So for 256 cache misses if we calculate the cache hit rate for row major traversal of 16x16 array, it is always 75% for both direct mapped and fully associative

Navigate

Why is the bond market so sensitive to interest rates? What is the mathematics b

Why is the cache hit rate for this column_major code 0% for both direct mapped a

Integrity-first tutoring: explanations and feedback only — we do not complete graded work. Learn more.

Why is the cache hit rate for row major traversal of 16x16 array 75% for both di

Question

Explanation / Answer

Related Questions

Navigate