Version 0.41.0
This release adds the following major features:
* Diagnostics showing the optimizations done by ParallelAccelerator
* Support for profiling Numba-compiled functions in Intel VTune
* Additional NumPy functions: partition, nancumsum, nancumprod, ediff1d, cov,
conj, conjugate, tri, tril, triu
* Initial support for Python 3 Unicode strings
General Enhancements:
* armv7 support
* invert mapping b/w binop operators and the operator module
* First attempt at parallel diagnostics
* Adding NUMBA_ENABLE_PROFILING envvar, enabling jit event
* Support for np.partition
* Support for np.nancumsum and np.nancumprod
* Add location information to exceptions.
* Support for np.ediff1d
* Support for np.cov
* Support user pipeline class in with lifting
* string support
* Improve error message for empty imprecise lists.
* Enable overload(operator.getitem)
* Support negative indexing in tuple.
* Refactor Const type
* Optimized usage of alloca out of the loop
* Updates for llvmlite 0.26
* Add support for `np.conj/np.conjugate`.
* np.tri, np.tril, np.triu - default optional args
* Permit dtype argument as sole kwarg in np.eye
CUDA Enhancements:
* Add max_registers Option to cuda.jit
Continuous Integration / Testing:
* CI with Azure Pipelines
* Workaround race condition with apt
* Fix issues with Azure Pipelines
* Fix `RuntimeWarning: 'numba.runtests' found in sys.modules`
* Disable openmp in wheel building
* Azure Pipelines templates
* Fix cuda tests and error reporting in test discovery
* Prevent faulthandler installation on armv7l
* Fix CUDA test that used negative indexing behaviour that's fixed.
* Start Flake8 checking of Numba source
Fixes:
* Fix dispatcher to only consider contiguous-ness.
* Fix 3119, raise for 0d arrays in reductions
* Reduce redundant module linking
* Fix AOT on windows.
* Fix memory management of __cuda_array_interface__ views.
* Fix typo in error name.
* Fix the default unboxing logic
* Allow non-global reference to objmode() context-manager
* Fix global reference in objmode for dynamically created function
* CUDA_ERROR_MISALIGNED_ADDRESS Using Multiple Const Arrays
* Correctly handle very old versions of colorama
* Add 32bit package guard for non-32bit installs
* Fix with-objmode warning
* Fix label offset in call inline after parfor pass
* Fixes raising of user defined exceptions for exec(<string>).
* Fix error due to function naming in CI in py2.7
* Fixed TBB's single thread execution and test added for
* Allow matching non-array objects in find_callname()
* Change getiter and iternext to not be pure.
* Make ir.UndefinedType singleton class.
* Fix np.random.shuffle sideeffect
* Raise unsupported for kwargs given to `print()`
* Remove dead script.
* Fix stencil support for boolean as return type
* Fix handling make_function literals
* Add missing unicode != unicode
* Fix complex math sqrt implementation for large -ve values
* This adds arg an check for the pattern supplied to Parfors.
* Sets list dtor linkage to `linkonce_odr` to fix visibility in AOT
Documentation Updates:
* Update 0.40 changelog with additional PRs
* Tweak spacing to avoid search box wrapping onto second line
* Add note about memory leaks with exceptions to docs.
* Add FAQ on CUDA + fork issue.
* Update docs for argsort, kind kwarg partially supported.
* Added mention of njit in 5minguide.rst
* Fix parallel reduction example in docs.
* Fix broken link and mark up problem.
* Size Numba logo in docs in em units.
* just two typos
* Document string support
* Documentation for parallel diagnostics.