forked from flame/blis
-
Notifications
You must be signed in to change notification settings - Fork 47
Open
Description
I discovered that compiling library for target zen4 with gcc version 11.3.1 20221121 (Red Hat 11.3.1-4) (GCC) does not enable avx512 acceleration for cblas_cgemm, The performance remains the same as for target zen3. There I used: ./configure CFLAGS="-O3" --enable-cblas --blas-int-size=64 zen4
The same issue I observed in blis-4.0.0. Moreover the performance was a little bit degraded. My code went up in time to 151s. (blis-5.0.0) from (blis-4.0.0) 146s.
Metadata
Metadata
Assignees
Labels
No labels