Note on Stride:
Memory stride is the distance between memory accesses and is measured as:
Local stride: This is the memory stride between two memory accesses for the same memory reference.
Global stride: This is the memory stride between memory accesses for consecutive memory references.
Consider the following example:
for (i=0; i<1000; i++){ for(j=0; j<10; j++){ sum += arrayOne[i] + arrayTwo[j]; } result[i] = sum; }The memory stride between consecutive memory accesses for the arrayOne memory reference is its local stride, i.e. (starting address of arrayOne[50] - ending address of arrayOne[49] ) is the local stride of arrayOne.
Note: This also means that if a region of code contains only 1 memory reference, global stride will be the same as local stride.
gStride7:
gStride7 or global stride 7 is a measure of the number of consecutive memory references that access the same 8 Byte memory location (through a double or int64_t data type).
Classifying gStride7:
gStride7 can be classified as low, medium or high as follows:
Bucket | Condition |
---|---|
Low |
The region of code contains no memory references.
OR
The region of code contains a number of memory references and no combinations of the memory references have a global stride of 7.
|
Medium | The region of code contains a number of memory references and the references that have a global stride of 7 are outnumbered by about 12:1. |
High | The region of code contains a number of memory references and the references that have a global stride of 7 are outnumbered by 10:1 or more. |
Example:
Consider the following code:
for (i=0; i<1000; i++){ for(j=0; j<10; j++){ sum += arrayOne[i] * arrayTwo[j]; } result[i] = sum; }This code contains 3 memory references (through arrays arrayOne, arrayTwo & result) and none of these references access successive the same 8 Byte memory location as arrayOne is at least (1000*8) 8kB wide.