Diferență între revizuiri ale paginii „PC Lab 4”

De la WikiLabs
Jump to navigationJump to search
 
Linia 23: Linia 23:
  
 
10 for better than 75 & 150 MD/s
 
10 for better than 75 & 150 MD/s
 
[[Fișier:Callgrind.out.20485.zip]]
 

Versiunea curentă din 19 aprilie 2018 16:03

Session 4

Speed optimization over i5/i7 x64 arch:

Compute distance between two vectors of points in 128-D space (128 coordinates). The purpose is to find the maximum distance from any one point.

As a distance, implement L1 (SAD) and L2 (SSD) norm over the 8-bit unsigned data type

Lnorm.png

Points (out of 10) vs. expected performance (SSD & SAD):

5 for better than 5 & 10 MD/s

6 for better than 10 & 20 MD/s

7 for better than 20 & 40 MD/s

8 for better than 35 & 70 MD/s

9 for better than 50 & 100 MD/s

10 for better than 75 & 150 MD/s