Diferență între revizuiri ale paginii „Performance analysis and optimization”

De la WikiLabs
Jump to navigationJump to search
 
(Nu s-au afișat 16 versiuni intermediare efectuate de același utilizator)
Linia 1: Linia 1:
Session 1, x86 optimization:
+
= Results =
  
C/C++: increase the execution speed for the code that reverses the order of the bits composing a 100 M esantioane of unsigned 32-bit (eg. 10111...11 -> 11..11101)
+
https://docs.google.com/spreadsheets/d/1GvZ-P-MEA9iPuBFx2onQ8qg0o2cDQSoWFV2Y0T11Fls/edit?usp=sharing
  
Expected/Presented techniques:
+
= Support materials =
  
first implementation ~ 10 seconds
+
http://www.agner.org/optimize/optimizing_cpp.pdf
  
compiler optimized ~ 5 seconds
+
https://www.arm.com/files/pdf/AT_-_Better_C_Code_for_ARM_Devices.pdf
  
loop unrolling
+
= Lab sessions =
  
bit-tricks
+
Contact: calin.bira_AT_upb.ro
 
 
optimizing variables into registers
 
 
 
256-entry 8-bit table of 1 KB
 
 
 
16-bit table of 64 k entries (256 KB)
 
 
 
32-bit table of 4 G entries (16 GB)
 

Versiunea curentă din 9 martie 2026 08:38