Diferență între revizuiri ale paginii „PC Lab 4”
De la WikiLabs
Jump to navigationJump to searchCbira (discuție | contribuții) |
Cbira (discuție | contribuții) |
||
Linia 24: | Linia 24: | ||
10 for better than 75 & 150 MD/s | 10 for better than 75 & 150 MD/s | ||
− | http://wiki.dcae.pub.ro/images/6/62/Callgrind.out.20485.zip | + | [[Callgrind Log] http://wiki.dcae.pub.ro/images/6/62/Callgrind.out.20485.zip] |
Versiunea de la data 19 aprilie 2018 14:53
Session 4
Speed optimization over i5/i7 x64 arch:
Compute distance between two vectors of points in 128-D space (128 coordinates). The purpose is to find the maximum distance from any one point.
As a distance, implement L1 (SAD) and L2 (SSD) norm over the 8-bit unsigned data type
Points (out of 10) vs. expected performance (SSD & SAD):
5 for better than 5 & 10 MD/s
6 for better than 10 & 20 MD/s
7 for better than 20 & 40 MD/s
8 for better than 35 & 70 MD/s
9 for better than 50 & 100 MD/s
10 for better than 75 & 150 MD/s
[[Callgrind Log] http://wiki.dcae.pub.ro/images/6/62/Callgrind.out.20485.zip]