Marvee Amasi Comments - Answer Overflow

Marvee Amasi

•Created by Marvee Amasi on 7/30/2024 in #📦-middleware-and-os

How to optimize SIMD instructions for double precision floating point operations on Intel Core i7

I've compiled the code with GCC using the -O3 optimization flag. While there is some performance improvement compared to the scalar version, it's significantly less than expected. I've measured a speedup of approximately 1.5x on an Intel Core i7 12700K processor. So I'm looking for suggestions on how to further optimize this code for maximum performance. Are there any specific SIMD instructions or techniques that could be beneficial? Thinking of exploring memory optimization strategies like prefetching or prolly cache blocking

2 replies

Gaming

Programming