I have a __m512d intrinsic vector and I need sum of his elements. Is there any easy way to do this? I am concentrated on a performance of computation, so i need to do this operation quickly. My knowledge about intrinsic is not enough to do it yourself, so I need Your help.
Thanks :)
I guess the following intrinsic should help you:
double _mm512_reduce_add_pd (__m512d a)
P.S. You can look for useful intrinsics in the guide yourself.