Philly ETE 2015 #13 – David Richardson – Beating Hand-Tuned Assembly in Compiled Languages
We explain optimization techniques used to set three world speed records. Using a combination of code generation and hardware specific optimizations, we achieved a 20x speedup over hand tuned assembly.