Marvee Amasi
Marvee Amasi
DIIDevHeads IoT Integration Server
Created by Marvee Amasi on 10/18/2024 in #code-review
Optimizing memcpy Performance on Intel Core i7 10700K: SIMD and Compiler Flags
I am analyzing the performance of memcpy on an Intel Core i7 10700K CPU , using GCC 10.2 on Linux kernel 5.10. My assumption is that its speed should be close to the time it takes to transfer one long multiplied by the number of longs being copied. Could memcpy be optimized to exceed this expectation, possibly using SIMD or other CPU specific features? Are there any compiler flags or hardware optimizations I should be aware of to get the best performance out of memcpy?
6 replies