summary |
shortlog | log |
commit |
commitdiff |
tree
first ⋅ prev ⋅ next
Andreas Beckmann [Mon, 29 Nov 2021 20:07:14 +0000 (21:07 +0100)]
[PATCH 101/144] link libpocl-devices-cuda.so with -lLLVM -pthread
Gbp-Pq: Name 0101-link-libpocl-devices-cuda.so-with-lLLVM-pthread.patch
Andreas Beckmann [Wed, 24 Nov 2021 14:06:54 +0000 (15:06 +0100)]
[PATCH 100/144] link libpocl-devices-pthread.so with -pthread
Gbp-Pq: Name 0100-link-libpocl-devices-pthread.so-with-pthread.patch
Andreas Beckmann [Wed, 27 Jan 2021 20:26:51 +0000 (21:26 +0100)]
[PATCH 099/144] restore setting unroll-threshold on LLVM 9
otherwise kernel/test_rotate hangs on avx512 capable cpus
all other llvm releases work fine without this setting
(this partially reverts
459d0bb5241a62af3b5b607a65dcf5e9a34a01ee)
fixes: #915
Gbp-Pq: Name 0099-restore-setting-unroll-threshold-on-LLVM-9.patch
Roman Rusyaev [Tue, 27 Apr 2021 11:28:41 +0000 (14:28 +0300)]
[PATCH 097/144] Avoid inheritance from std::vector in ParallelRegion class.
Use composition instead of inheritance.
Gbp-Pq: Name 0097-Avoid-inheritance-from-std-vector-in-ParallelRegion-.patch
nchristensen [Sun, 5 Dec 2021 16:58:18 +0000 (16:58 +0000)]
[PATCH 096/144] Avoid loss precision in event time calculations
The current code implicitly converts `epoch` to a float before calculating `time_end` and `time_start` and converting back to a `cl_ulong`. A `float` only has seven or so digits of accuracy meaning this conversion throws away the least significant digits of `epoch` if `epoch` is larger than about 1e8. Conversely, the precision of `diff` is around 1e-3 so multiplying by 1e6 makes all of these digits > 1 and converting this number to a `cl_ulong` will not lose any digits. The gist below illustrates this in Python.
https://gist.github.com/nchristensen/
15ab1fd53ba099accaab780ab6dbad90
Gbp-Pq: Name 0096-Avoid-loss-precision-in-event-time-calculations.patch
Andreas Beckmann [Tue, 7 Dec 2021 17:06:32 +0000 (18:06 +0100)]
[PATCH 90/90] do not link libpocl-devices-hsa.so against libpocl-devices-basic.so
Gbp-Pq: Name 0090-do-not-link-libpocl-devices-hsa.so-against-libpocl-d.patch
Andreas Beckmann [Mon, 6 Dec 2021 23:25:21 +0000 (00:25 +0100)]
[PATCH 89/90] rename pocl_basic_svm_fill() to pocl_driver_svm_fill()
Gbp-Pq: Name 0089-rename-pocl_basic_svm_fill-to-pocl_driver_svm_fill.patch
Andreas Beckmann [Mon, 6 Dec 2021 23:23:00 +0000 (00:23 +0100)]
[PATCH 88/90] rename pocl_basic_free() to pocl_driver_free()
Gbp-Pq: Name 0088-rename-pocl_basic_free-to-pocl_driver_free.patch
Andreas Beckmann [Mon, 6 Dec 2021 23:14:36 +0000 (00:14 +0100)]
[PATCH 87/90] rename pocl_basic_alloc_mem_obj() to pocl_driver_alloc_mem_obj()
Gbp-Pq: Name 0087-rename-pocl_basic_alloc_mem_obj-to-pocl_driver_alloc.patch
Andreas Beckmann [Mon, 6 Dec 2021 19:14:37 +0000 (20:14 +0100)]
[PATCH 86/90] enable VISIBILITY_HIDDEN by default
(unless the proxy device is enabled)
Gbp-Pq: Name 0086-enable-VISIBILITY_HIDDEN-by-default.patch
Andreas Beckmann [Sat, 4 Dec 2021 20:51:30 +0000 (21:51 +0100)]
[PATCH 84/90] rename byteswap_uint32_t() to pocl_byteswap_uint32_t()
Gbp-Pq: Name 0084-rename-byteswap_uint32_t-to-pocl_byteswap_uint32_t.patch
Andreas Beckmann [Fri, 3 Dec 2021 10:34:09 +0000 (11:34 +0100)]
[PATCH 83/90] rename init_mem_region() to pocl_init_mem_region()
Gbp-Pq: Name 0083-rename-init_mem_region-to-pocl_init_mem_region.patch
Andreas Beckmann [Fri, 3 Dec 2021 10:27:49 +0000 (11:27 +0100)]
[PATCH 82/90] rename free_chunk() to pocl_free_chunk()
Gbp-Pq: Name 0082-rename-free_chunk-to-pocl_free_chunk.patch
Andreas Beckmann [Fri, 3 Dec 2021 10:21:48 +0000 (11:21 +0100)]
[PATCH 81/90] rename alloc_buffer_from_region() to pocl_alloc_buffer_from_region()
Gbp-Pq: Name 0081-rename-alloc_buffer_from_region-to-pocl_alloc_buffer.patch
Andreas Beckmann [Fri, 3 Dec 2021 10:17:46 +0000 (11:17 +0100)]
[PATCH 80/90] rename get_llvm_cpu_name() to pocl_get_llvm_cpu_name()
Gbp-Pq: Name 0080-rename-get_llvm_cpu_name-to-pocl_get_llvm_cpu_name.patch
Andreas Beckmann [Fri, 3 Dec 2021 09:36:27 +0000 (10:36 +0100)]
[PATCH 79/90] add POCL_EXPORT to additional symbols needed by the tce device
Gbp-Pq: Name 0079-add-POCL_EXPORT-to-additional-symbols-needed-by-the-.patch
Andreas Beckmann [Mon, 29 Nov 2021 19:21:02 +0000 (20:21 +0100)]
[PATCH 78/90] add POCL_EXPORT to additional symbols needed by the hsa device
Gbp-Pq: Name 0078-add-POCL_EXPORT-to-additional-symbols-needed-by-the-.patch
Andreas Beckmann [Wed, 13 Oct 2021 14:23:53 +0000 (16:23 +0200)]
[PATCH 77/90] add POCL_EXPORT to additional symbols needed if hardening is enabled
Gbp-Pq: Name 0077-add-POCL_EXPORT-to-additional-symbols-needed-if-hard.patch
Andreas Beckmann [Wed, 10 Feb 2021 22:25:57 +0000 (23:25 +0100)]
[PATCH 76/90] add POCL_EXPORT to additional symbols needed by the cuda device
Gbp-Pq: Name 0076-add-POCL_EXPORT-to-additional-symbols-needed-by-the-.patch
Andreas Beckmann [Fri, 3 Dec 2021 18:28:10 +0000 (19:28 +0100)]
[PATCH 75/90] be more verbose on pthread errors
Gbp-Pq: Name 0075-be-more-verbose-on-pthread-errors.patch
Andreas Beckmann [Fri, 26 Nov 2021 13:22:35 +0000 (14:22 +0100)]
[PATCH 74/90] pthread: cleanup/fail pthread_scheduler_init() if any worker is out-of-memory
fixes: #1009
Gbp-Pq: Name 0074-pthread-cleanup-fail-pthread_scheduler_init-if-any-w.patch
Andreas Beckmann [Fri, 26 Nov 2021 13:11:30 +0000 (14:11 +0100)]
[PATCH 73/90] pthread: add barrier after worker thread initialization
Gbp-Pq: Name 0073-pthread-add-barrier-after-worker-thread-initializati.patch
Andreas Beckmann [Fri, 26 Nov 2021 13:02:56 +0000 (14:02 +0100)]
[PATCH 72/90] pthread: add return code to pthread_scheduler_init()
Gbp-Pq: Name 0072-pthread-add-return-code-to-pthread_scheduler_init.patch
Andreas Beckmann [Thu, 2 Dec 2021 01:34:09 +0000 (02:34 +0100)]
[PATCH 71/90] add PTHREAD_CHECK() to all macros wrapping pthread_*() calls
Gbp-Pq: Name 0071-add-PTHREAD_CHECK-to-all-macros-wrapping-pthread_-ca.patch
Andreas Beckmann [Fri, 3 Dec 2021 15:14:53 +0000 (16:14 +0100)]
[PATCH 69/90] add PTHREAD_CHECK2() accepting an additional non-zero return code
use e.g. for
* pthread_broadcast(): PTHREAD_BARRIER_SERIAL_THREAD
* pthread_cond_timedwait(): ETIMEDOUT
avoid the extra function call to the error handler if the command succeeded
Gbp-Pq: Name 0069-add-PTHREAD_CHECK2-accepting-an-additional-non-zero-.patch
Andreas Beckmann [Thu, 2 Dec 2021 00:53:31 +0000 (01:53 +0100)]
[PATCH 68/90] move PTHREAD_CHECK() to pocl_cl.h
Gbp-Pq: Name 0068-move-PTHREAD_CHECK-to-pocl_cl.h.patch
Andreas Beckmann [Wed, 1 Dec 2021 22:39:45 +0000 (23:39 +0100)]
[PATCH 66/90] pthread: add PTHREAD_CHECK() to all pthread_*() calls
Gbp-Pq: Name 0066-pthread-add-PTHREAD_CHECK-to-all-pthread_-calls.patch
Andreas Beckmann [Wed, 1 Dec 2021 22:16:14 +0000 (23:16 +0100)]
[PATCH 64/90] cuda: add PTHREAD_CHECK() to all pthread_*() calls
Gbp-Pq: Name 0064-cuda-add-PTHREAD_CHECK-to-all-pthread_-calls.patch
Andreas Beckmann [Wed, 1 Dec 2021 21:50:25 +0000 (22:50 +0100)]
[PATCH 63/90] hsa: add PTHREAD_CHECK() to the remaining pthread_*() calls
Gbp-Pq: Name 0063-hsa-add-PTHREAD_CHECK-to-the-remaining-pthread_-call.patch
Andreas Beckmann [Thu, 2 Dec 2021 15:02:44 +0000 (16:02 +0100)]
[PATCH 62/90] add test test_ucharn.cl
this is primarily intended for inspecting the disassembled kernel
to debug parameter passing issues
Gbp-Pq: Name 0062-add-test-test_ucharn.cl.patch
Andreas Beckmann [Thu, 2 Dec 2021 23:40:21 +0000 (00:40 +0100)]
[PATCH 61/90] test_printf_vectors fails on most platforms
Gbp-Pq: Name 0061-test_printf_vectors-fails-on-most-platforms.patch
Andreas Beckmann [Mon, 22 Nov 2021 12:21:28 +0000 (13:21 +0100)]
[PATCH 60/90] add printf() tests for all (unsigned, fp) vector types and sizes
Gbp-Pq: Name 0060-add-printf-tests-for-all-unsigned-fp-vector-types-an.patch
Andreas Beckmann [Thu, 2 Dec 2021 22:49:57 +0000 (23:49 +0100)]
[PATCH 59/90] move vector type printf tests to a separate test
Gbp-Pq: Name 0059-move-vector-type-printf-tests-to-a-separate-test.patch
Andreas Beckmann [Fri, 19 Nov 2021 11:10:45 +0000 (12:10 +0100)]
[PATCH 58/90] run_test: show diff if test output does not match the expected output
Gbp-Pq: Name 0058-run_test-show-diff-if-test-output-does-not-match-the.patch
Michal Babej [Wed, 1 Dec 2021 13:16:26 +0000 (15:16 +0200)]
[PATCH 52/90] vulkan: label tests that work with vulkan driver
Gbp-Pq: Name 0052-vulkan-label-tests-that-work-with-vulkan-driver.patch
Michal Babej [Thu, 25 Nov 2021 13:49:47 +0000 (15:49 +0200)]
[PATCH 43/90] MapBuffer/MapImage: allocate the cl_mem backing buffer if required
Gbp-Pq: Name 0043-MapBuffer-MapImage-allocate-the-cl_mem-backing-buffe.patch
Giuseppe Bilotta [Tue, 23 Nov 2021 16:25:10 +0000 (17:25 +0100)]
[PATCH 41/90] CUDA: const correctness in get/free device name string
This silences a couple of const correctness warnings due to implicit
casts to/from char*/const char*.
Gbp-Pq: Name 0041-CUDA-const-correctness-in-get-free-device-name-strin.patch
Giuseppe Bilotta [Wed, 3 Feb 2021 19:32:31 +0000 (20:32 +0100)]
[PATCH 35/90] CUDA: don't leak the epoch_event
Gbp-Pq: Name 0035-CUDA-don-t-leak-the-epoch_event.patch
André Gemünd [Thu, 11 Nov 2021 08:38:24 +0000 (09:38 +0100)]
[PATCH 30/90] Add missing cl_ext_pocl.h to install.
Gbp-Pq: Name 0030-Add-missing-cl_ext_pocl.h-to-install.patch
Michal Babej [Mon, 20 Sep 2021 06:12:24 +0000 (09:12 +0300)]
[PATCH 18/90] Fix tests/regression/test_alignment_with_dynamic_wg.cpp
Don't rebuild the program inside loop iteration,
instead create a new one on each iteration.
Gbp-Pq: Name 0018-Fix-tests-regression-test_alignment_with_dynamic_wg..patch
Michal Babej [Thu, 9 Sep 2021 08:39:05 +0000 (11:39 +0300)]
[PATCH 17/90] Fix missing define in OpenCL-CTS
Gbp-Pq: Name 0017-Fix-missing-define-in-OpenCL-CTS.patch
Michal Babej [Wed, 8 Sep 2021 15:20:08 +0000 (18:20 +0300)]
[PATCH 16/90] Move pocl_cmd_max_grid_dim_width() outside HAVE_DLFCN_H
some build configs that don't use dlopen() require it
Gbp-Pq: Name 0016-Move-pocl_cmd_max_grid_dim_width-outside-HAVE_DLFCN_.patch
Michal Babej [Wed, 8 Sep 2021 15:17:03 +0000 (18:17 +0300)]
[PATCH 15/90] Enable test_dlopen only if DLFCN is available
Gbp-Pq: Name 0015-Enable-test_dlopen-only-if-DLFCN-is-available.patch
Michal Babej [Mon, 6 Sep 2021 04:28:26 +0000 (07:28 +0300)]
[PATCH 14/90] Fix uninitialized variable in tests/regression/test_alignment_with_dynamic_wg2.cpp
Gbp-Pq: Name 0014-Fix-uninitialized-variable-in-tests-regression-test_.patch
Michal Babej [Tue, 31 Aug 2021 06:11:52 +0000 (09:11 +0300)]
[PATCH 13/90] Fix data race in compile_and_link_program()
lock cl_program before touching kernels
Gbp-Pq: Name 0013-Fix-data-race-in-compile_and_link_program.patch
Michal Babej [Tue, 20 Jul 2021 09:19:28 +0000 (12:19 +0300)]
[PATCH 11/90] Random fix for accel driver
Accel does not use CPU or LLVM -> remove call to get_llvm_cpu_name()
Gbp-Pq: Name 0011-Random-fix-for-accel-driver.patch
Pekka Jääskeläinen [Tue, 12 Oct 2021 15:22:28 +0000 (18:22 +0300)]
[PATCH 04/90] Tweaks
Gbp-Pq: Name 0004-Tweaks.patch
Andreas Beckmann [Fri, 7 Jan 2022 23:55:22 +0000 (23:55 +0000)]
pocl (1.8-3) unstable; urgency=medium
* Cherry-pick several upstream(ed) patches:
- Check return code of all pthread_*() function calls for errors.
- Return error (don't assert) if any worker thread is out-of-memory.
- Show more information on OpenCL compile errors or test failures.
- Export more symbols needed by other devices.
- Fix underlinked device libraries.
- Skip tests that require unavailable devices.
- Unify test success/failure reporting.
- Miscellaneous bugfixes.
* armel/armhf: Reduce printf buffer size to 4 MB per core.
(Closes: #997908)
* Add superficial autopkgtest running clinfo.
* Update symbols file.
* Upload to unstable.
[dgit import unpatched pocl 1.8-3]
Andreas Beckmann [Fri, 7 Jan 2022 23:55:22 +0000 (23:55 +0000)]
Import pocl_1.8-3.debian.tar.xz
[dgit import tarball pocl 1.8-3 pocl_1.8-3.debian.tar.xz]
Andreas Beckmann [Fri, 15 Oct 2021 16:22:36 +0000 (17:22 +0100)]
Import pocl_1.8.orig.tar.gz
[dgit import orig pocl_1.8.orig.tar.gz]