[Numpy-discussion] Calling C code that assumes SIMD aligned data.
Julian Taylor
jtaylor.debian at googlemail.com
Fri May 6 16:22:38 EDT 2016
note that anything larger than 16 bytes alignment is unnecessary for
simd purposes on current hardware (>= haswell). 16 byte is default
malloc alignment on amd64.
And even on older ones (sandy bridge) the penalty is pretty minor.
On 05.05.2016 22:32, Charles R Harris wrote:
>
>
> On Thu, May 5, 2016 at 2:10 PM, Øystein Schønning-Johansen
> <oysteijo at gmail.com <mailto:oysteijo at gmail.com>> wrote:
>
> Thanks for your answer, Francesc. Knowing that there is no numpy
> solution saves the work of searching for this. I've not tried the
> solution described at SO, but it looks like a real performance
> killer. I'll rather try to override malloc with glibs malloc_hooks
> or LD_PRELOAD tricks. Do you think that will do it? I'll try it and
> report back.
>
> Thanks,
> -Øystein
>
>
> Might take a look at how numpy handles this in
> `numpy/core/src/umath/simd.inc.src`.
>
> <snip>
>
> Chuck
>
>
> _______________________________________________
> NumPy-Discussion mailing list
> NumPy-Discussion at scipy.org
> https://mail.scipy.org/mailman/listinfo/numpy-discussion
>
More information about the NumPy-Discussion
mailing list