08b3e08a91
LAPACK 3.6.1: What’s new [Mark Gates, UTK] blocked back-transformation for the non-symmetric eigenvalue problem It blocks NB gemv calls into one gemm call inside trevc. To do that, it needs a new routine, trevc3, because unfortunately the lwork was not passed into trevc. Attached is the performance speedup for dgeev. It gives a nice 1.5x speedup for N=20000, and that appears to still be increasing with N. This is not the improvements that Greg Henry recently provided for doing the triangular solves as BLAS-3 instead of BLAS-1. That will take a while to process, but we expect another, even larger increase in performance when those changes are applied. This also does not include doing multiple (BLAS-1) triangular solves in parallel, which is available in MAGMA, since that requires OpenMP or pthreads.
27 lines
960 B
Makefile
27 lines
960 B
Makefile
# $NetBSD: Makefile,v 1.38 2016/10/18 14:08:14 wiz Exp $
|
|
|
|
.include "Makefile.common"
|
|
|
|
COMMENT= Linear Algebra PACKage
|
|
|
|
BUILD_TARGET= lapacklib
|
|
|
|
#FFLAGS+= -ffloat-store
|
|
|
|
# With f2c-20090411nb4, libf2c-20090201nb3, we need EXT_ETIME but gcc44-4.4.1nb3
|
|
# needs INT_ETIME. Rather than trying to enumerate all possible fortran compilers,
|
|
# just do a test for it.
|
|
do-configure:
|
|
@${ECHO_N} "Checking if fortran compiler has etime() as an intrinsic: "
|
|
@cd ${WRKDIR} && ${FC} -o etime_test ${FILESDIR}/etime_test.f >${WRKDIR}/etime_test.log 2>&1; \
|
|
if test $$? -eq 0 ; then \
|
|
${ECHO} "TIMER=INT_ETIME" >> ${WRKSRC}/make.inc ; ${ECHO} "YES" ; else \
|
|
${ECHO} "TIMER=EXT_ETIME" >> ${WRKSRC}/make.inc ; ${ECHO} "NO" ; fi
|
|
@${ECHO} "See ${WRKDIR}/etime_test.log for details on the test"
|
|
|
|
do-install:
|
|
${LIBTOOL} --mode=install ${INSTALL_DATA} ${WRKSRC}/liblapack.la \
|
|
${DESTDIR}${PREFIX}/lib
|
|
|
|
.include "../../math/blas/buildlink3.mk"
|
|
.include "../../mk/bsd.pkg.mk"
|