Andreas Beckmann [Tue, 14 Jun 2022 12:14:47 +0000 (13:14 +0100)]
Merge pocl (3.0-2) import into refs/heads/workingbranch
Andreas Beckmann [Tue, 14 Jun 2022 12:14:47 +0000 (13:14 +0100)]
fix opencl.hpp detection
Gbp-Pq: Name fix-opencl.hpp-detection.patch
Andreas Beckmann [Tue, 14 Jun 2022 12:14:47 +0000 (13:14 +0100)]
this is not a release candidate
Forwarded: no-needed
Gbp-Pq: Name no-rc.patch
Andreas Beckmann [Tue, 14 Jun 2022 12:14:47 +0000 (13:14 +0100)]
kill hanging tests after 30 minutes
Gbp-Pq: Name timeout.patch
Andreas Beckmann [Tue, 14 Jun 2022 12:14:47 +0000 (13:14 +0100)]
tune distro mode
Forwarded: no-needed
Gbp-Pq: Name distro.patch
Andreas Beckmann [Tue, 14 Jun 2022 12:14:47 +0000 (13:14 +0100)]
add support for a GENERIC cpu type that uses llvm defaults
Forwarded: not-needed
Gbp-Pq: Name generic-cpu.patch
Andreas Beckmann [Tue, 14 Jun 2022 12:14:47 +0000 (13:14 +0100)]
HACK! regression/struct_kernel_arguments fails on i686
141/143 Test #85: regression/struct_kernel_arguments ..................................................***Failed Error regular expression found in output. Regex=[FAIL] 4.58 sec
CMake Error at /build/pocl-1.4/cmake/run_test.cmake:34 (message):
FAIL: Test exited with nonzero code (1):
/build/pocl-1.4/obj-i686-linux-gnu/tests/regression/test_structs_as_args
STDOUT:
F(4: 0 != 5) F(5: -
2147483648 != 6) F(6: 0 != 7) F(7: 0 != 8)
STDERR:
-- OK
Bug: https://github.com/pocl/pocl/issues/801
Bug: https://github.com/KhronosGroup/OpenCL-Headers/issues/149
Forwarded: not-needed
Gbp-Pq: Name test_structs_as_args.patch
Andreas Beckmann [Tue, 7 Apr 2020 23:33:12 +0000 (01:33 +0200)]
[PATCH 02/10] link against single shared libclang-cpp when building with llvm 9, too
Gbp-Pq: Name 2002-link-against-single-shared-libclang-cpp-when-buildin.patch
Andreas Beckmann [Tue, 15 Mar 2022 14:59:08 +0000 (15:59 +0100)]
[PATCH 1017/1017] support overriding the runtime cpu detection with POCL_LLVM_CPU_NAME
Gbp-Pq: Name 1017-support-overriding-the-runtime-cpu-detection-with-PO.patch
Andreas Beckmann [Tue, 15 Mar 2022 10:39:30 +0000 (11:39 +0100)]
[PATCH 1016/1017] let LLC_HOST_CPU be forced (HOST_CPU_FORCED) to any cpu type
do not special case LLC_HOST_CPU == LLC_HOST_CPU_AUTO
Gbp-Pq: Name 1016-let-LLC_HOST_CPU-be-forced-HOST_CPU_FORCED-to-any-cp.patch
Andreas Beckmann [Tue, 15 Mar 2022 10:05:45 +0000 (11:05 +0100)]
[PATCH 1015/1017] use _LLC_HOST_CPU as internal variable
do not set LLC_HOST_CPU to LLC_HOST_CPU_AUTO as it will turn on
HOST_CPU_FORCED on a subsequent cmake run
Gbp-Pq: Name 1015-use-_LLC_HOST_CPU-as-internal-variable.patch
Andreas Beckmann [Tue, 15 Mar 2022 09:33:22 +0000 (10:33 +0100)]
[PATCH 1014/1017] improve grouping of ENABLE_HOST_CPU_DEVICES specific tests
Gbp-Pq: Name 1014-improve-grouping-of-ENABLE_HOST_CPU_DEVICES-specific.patch
Andreas Beckmann [Sat, 9 Apr 2022 14:30:15 +0000 (16:30 +0200)]
[PATCH 1013/1017] allow overriding kernellib_name for distro builds with POCL_KERNELLIB_NAME
the kernellib_name (and corresponding llvm_cpu) can be overridden
to a target with a subset of the features supported by the current cpu
this will allow running e.g. code generation and tests for sse2 (but no
newer features) on a modern cpu
Gbp-Pq: Name 1013-allow-overriding-kernellib_name-for-distro-builds-wi.patch
Andreas Beckmann [Sat, 9 Apr 2022 13:28:42 +0000 (15:28 +0200)]
[PATCH 1012/1017] _cl_device_id: add const char* kernellib_name
determine the value at the same time as llvm_cpu, if possible
Gbp-Pq: Name 1012-_cl_device_id-add-const-char-kernellib_name.patch
Andreas Beckmann [Thu, 7 Apr 2022 10:04:05 +0000 (12:04 +0200)]
[PATCH 1011/1017] add pocl_get_distro_cpu_name()
in distro builds, map each kernellib_name to a specific llvm_cpu
which is then used as compilation target instead of the native cpu
this limits the possible code generation targets in distro builds
in order to be able to run tests against all of them with a minimal
number of different (physical) CPU models
Gbp-Pq: Name 1011-add-pocl_get_distro_cpu_name.patch
Andreas Beckmann [Wed, 23 Mar 2022 13:24:18 +0000 (14:24 +0100)]
[PATCH 1010/1017] use a more generic solution for pocl_get_distro_kernellib_name()
Gbp-Pq: Name 1010-use-a-more-generic-solution-for-pocl_get_distro_kern.patch
Andreas Beckmann [Wed, 23 Mar 2022 12:55:50 +0000 (13:55 +0100)]
[PATCH 1009/1017] rename getX86KernelLibName() to pocl_get_distro_kernellib_name()
Gbp-Pq: Name 1009-rename-getX86KernelLibName-to-pocl_get_distro_kernel.patch
Andreas Beckmann [Thu, 7 Apr 2022 20:27:03 +0000 (22:27 +0200)]
[PATCH 1008/1017] use a UTC timestamp for POCL_BUILD_TIMESTAMP
this will be reproducible if SOURCE_DATE_EPOCH is set,
regardless of any TZ setting
Gbp-Pq: Name 1008-use-a-UTC-timestamp-for-POCL_BUILD_TIMESTAMP.patch
Andreas Beckmann [Tue, 22 Mar 2022 13:51:19 +0000 (14:51 +0100)]
[PATCH 1007/1017] force kernellib name on HOST_CPU_FORCED, too
Gbp-Pq: Name 1007-force-kernellib-name-on-HOST_CPU_FORCED-too.patch
Andreas Beckmann [Wed, 16 Mar 2022 15:10:50 +0000 (16:10 +0100)]
[PATCH 1006/1017] always use runtime determined llvm_cpu to compute build_hash
Gbp-Pq: Name 1006-always-use-runtime-determined-llvm_cpu-to-compute-bu.patch
Andreas Beckmann [Tue, 15 Mar 2022 14:43:45 +0000 (15:43 +0100)]
[PATCH 1005/1017] no 'using namespace' in headers
Gbp-Pq: Name 1005-no-using-namespace-in-headers.patch
Andreas Beckmann [Tue, 15 Mar 2022 14:47:49 +0000 (15:47 +0100)]
[PATCH 1004/1017] try to apply the LLVM style first
otherwise lib/llvmopencl/.*\.h get reformatted with GNU style
Gbp-Pq: Name 1004-try-to-apply-the-LLVM-style-first.patch
Andreas Beckmann [Tue, 15 Mar 2022 09:24:11 +0000 (10:24 +0100)]
[PATCH 1003/1017] llc --version no longer exits with a non-zero return code
Gbp-Pq: Name 1003-llc-version-no-longer-exits-with-a-non-zero-return-c.patch
Andreas Beckmann [Tue, 15 Mar 2022 07:49:28 +0000 (08:49 +0100)]
[PATCH 1002/1017] check CMAKE_LIBRARY_ARCHITECTURE instead of CMAKE_SYSTEM_PROCESSOR
e.g. architecture armhf runs on an aarch64 processor, too
Gbp-Pq: Name 1002-check-CMAKE_LIBRARY_ARCHITECTURE-instead-of-CMAKE_SY.patch
Andreas Beckmann [Mon, 21 Mar 2022 16:48:30 +0000 (17:48 +0100)]
[PATCH 1001/1017] poclcc: use poclu_show_program_build_log()
Gbp-Pq: Name 1001-poclcc-use-poclu_show_program_build_log.patch
Andreas Beckmann [Tue, 14 Jun 2022 12:14:47 +0000 (13:14 +0100)]
pocl (3.0-2) unstable; urgency=medium
* Update symbols file.
* Add some patches to allow overriding CPU detection.
* Test all kernellib variants (that are supported by the host cpu) on distro
builds.
[dgit import unpatched pocl 3.0-2]
Andreas Beckmann [Tue, 14 Jun 2022 12:14:47 +0000 (13:14 +0100)]
Import pocl_3.0-2.debian.tar.xz
[dgit import tarball pocl 3.0-2 pocl_3.0-2.debian.tar.xz]
Andreas Beckmann [Mon, 13 Jun 2022 06:57:49 +0000 (07:57 +0100)]
Import pocl_3.0.orig.tar.gz
[dgit import orig pocl_3.0.orig.tar.gz]
Andreas Beckmann [Fri, 7 Jan 2022 23:55:22 +0000 (23:55 +0000)]
Merge pocl (1.8-3) import into refs/heads/workingbranch
Debian OpenCL Maintainers [Fri, 7 Jan 2022 23:55:22 +0000 (23:55 +0000)]
fix-accel
Gbp-Pq: Name fix-accel.patch
Andreas Beckmann [Fri, 7 Jan 2022 23:55:22 +0000 (23:55 +0000)]
the pocl cl extensions are not yet in the upstream cl headers
Forwarded: no-needed
Gbp-Pq: Name cl_ext_pocl.patch
Andreas Beckmann [Fri, 7 Jan 2022 23:55:22 +0000 (23:55 +0000)]
we move the embedded copy aside to use system CL headers
Forwarded: no-needed
Gbp-Pq: Name use-system-CL-headers.patch
Andreas Beckmann [Fri, 7 Jan 2022 23:55:22 +0000 (23:55 +0000)]
kill hanging tests after 30 minutes
Gbp-Pq: Name timeout.patch
Andreas Beckmann [Fri, 7 Jan 2022 23:55:22 +0000 (23:55 +0000)]
dump some cmake information
Forwarded: not-needed
Gbp-Pq: Name verbose.patch
Andreas Beckmann [Fri, 7 Jan 2022 23:55:22 +0000 (23:55 +0000)]
tune distro mode
Forwarded: no-needed
Gbp-Pq: Name distro.patch
Andreas Beckmann [Fri, 7 Jan 2022 23:55:22 +0000 (23:55 +0000)]
add support for a GENERIC cpu type that uses llvm defaults
Forwarded: not-needed
Gbp-Pq: Name generic-cpu.patch
Andreas Beckmann [Fri, 7 Jan 2022 23:55:22 +0000 (23:55 +0000)]
HACK! regression/struct_kernel_arguments fails on i686
141/143 Test #85: regression/struct_kernel_arguments ..................................................***Failed Error regular expression found in output. Regex=[FAIL] 4.58 sec
CMake Error at /build/pocl-1.4/cmake/run_test.cmake:34 (message):
FAIL: Test exited with nonzero code (1):
/build/pocl-1.4/obj-i686-linux-gnu/tests/regression/test_structs_as_args
STDOUT:
F(4: 0 != 5) F(5: -
2147483648 != 6) F(6: 0 != 7) F(7: 0 != 8)
STDERR:
-- OK
Bug: https://github.com/pocl/pocl/issues/801
Bug: https://github.com/KhronosGroup/OpenCL-Headers/issues/149
Forwarded: not-needed
Gbp-Pq: Name test_structs_as_args.patch
Andreas Beckmann [Tue, 7 Apr 2020 23:33:12 +0000 (01:33 +0200)]
[PATCH 02/10] link against single shared libclang-cpp when building with llvm 9, too
Gbp-Pq: Name 2002-link-against-single-shared-libclang-cpp-when-buildin.patch
Pekka Jääskeläinen [Thu, 6 Jan 2022 14:02:55 +0000 (16:02 +0200)]
[PATCH 144/144] test_dlopen fix for ICD enabled
Gbp-Pq: Name 0144-test_dlopen-fix-for-ICD-enabled.patch
Andreas Beckmann [Mon, 6 Dec 2021 17:49:48 +0000 (18:49 +0100)]
[PATCH 143/144] add printf and parameter passing test for ulongn vector types only
another test primarily for inspecting the disassembled kernel
as a debugging aid for the vector type printf issues
Gbp-Pq: Name 0143-add-printf-and-parameter-passing-test-for-ulongn-vec.patch
Andreas Beckmann [Tue, 7 Dec 2021 16:33:23 +0000 (17:33 +0100)]
[PATCH 142/144] ignore known llvm segfault test failures
mark as WILL_FAIL where reproducible
mark as DISABLED where flaky
Gbp-Pq: Name 0142-ignore-known-llvm-segfault-test-failures.patch
Andreas Beckmann [Wed, 24 Nov 2021 21:39:43 +0000 (22:39 +0100)]
[PATCH 141/144] add testcase for llvm segfault (issue #889)
The segmentation fault can be observed with llvm-10, llvm-11 and llvm-12
and seems to be fixed in llvm-13. It happens on the architectures armhf
and armel (both 32-bit) always and on x86_64 sporadically.
The test segfaults only on the first run (i.e. the kernel is not yet in
pocl's kernel cache) while it passes on subsequent execution (with
something already in the kernel cache), emitting only some llvm
diagnostics:
inlinable function call in a function with debug info must have a !dbg location
%11 = call i32 @_Z12get_local_idj(i32 0)
inlinable function call in a function with debug info must have a !dbg location
%19 = call i32 @_Z12get_local_idj(i32 1)
inlinable function call in a function with debug info must have a !dbg location
%27 = call i32 @_Z12get_local_idj(i32 2)
The backtrace of the segmentation fault as observed with llvm-10 and pocl 1.6:
#0 getEmissionKind () at .../llvm/include/llvm/IR/DebugInfoMetadata.h:1244
#1 initialize () at .../llvm/lib/CodeGen/LexicalScopes.cpp:53
#2 0xb14102f0 in computeIntervals () at .../llvm/lib/CodeGen/LiveDebugVariables.cpp:979
#3 runOnMachineFunction () at .../llvm/lib/CodeGen/LiveDebugVariables.cpp:996
#4 runOnMachineFunction () at .../llvm/lib/CodeGen/LiveDebugVariables.cpp:1023
#5 0xb14856c8 in runOnFunction () at .../llvm/lib/CodeGen/MachineFunctionPass.cpp:73
#6 0xb12ff494 in runOnFunction () at .../llvm/lib/IR/LegacyPassManager.cpp:1481
#7 0xb12ff750 in runOnModule () at .../llvm/lib/IR/LegacyPassManager.cpp:1517
#8 0xb12ffba8 in runOnModule () at .../llvm/lib/IR/LegacyPassManager.cpp:1582
#9 run () at .../llvm/lib/IR/LegacyPassManager.cpp:1694
#10 0xb6e64c82 in pocl_llvm_codegen (Device=Device@entry=0xdb0010, Modp=0x1361838, Output=Output@entry=0xbefde86c, OutputSize=OutputSize@entry=0xbefde880) at ./lib/CL/pocl_llvm_wg.cc:624
#11 0xb6e291de in llvm_codegen (output=output@entry=0xdeb898 "...BMDHA/Sdot_kernel/0-0-0/Sdot_kernel.so", device_i=device_i@entry=0, kernel=kernel@entry=0xbefe0240,
device=0xdb0010, command=command@entry=0xbefe0278, specialize=specialize@entry=0) at ./lib/CL/devices/common.c:158
#12 0xb6e2ae44 in pocl_check_kernel_disk_cache (command=command@entry=0xbefe0278, specialized=specialized@entry=0) at ./lib/CL/devices/common.c:958
#13 0xb6e2b262 in pocl_check_kernel_dlhandle_cache (command=0xbefe0278, initial_refcount=0, specialize=0) at ./lib/CL/devices/common.c:1081
#14 0xb6e033d4 in program_compile_dynamic_wg_binaries (program=program@entry=0xd8ab88) at ./lib/CL/pocl_build.c:179
#15 0xb6e13f20 in get_binary_sizes (sizes=0xbefe0384, program=0xd8ab88) at ./lib/CL/clGetProgramInfo.c:36
#16 POclGetProgramInfo (program=0xd8ab88, param_name=4453, param_value_size=128, param_value=0xbefe0384, param_value_size_ret=0xbefe0380) at ./lib/CL/clGetProgramInfo.c:115
#17 0x00473070 in main () at 975931.c:238
https://github.com/pocl/pocl/issues/889
https://bugs.debian.org/975931
Gbp-Pq: Name 0141-add-testcase-for-llvm-segfault-issue-889.patch
Andreas Beckmann [Mon, 6 Dec 2021 19:09:29 +0000 (20:09 +0100)]
[PATCH 140/144] add dlopen test for the proxy device
Gbp-Pq: Name 0140-add-dlopen-test-for-the-proxy-device.patch
Andreas Beckmann [Mon, 6 Dec 2021 17:03:05 +0000 (18:03 +0100)]
[PATCH 139/144] do not dlopen(libpocl.so) without ENABLE_ICD
Gbp-Pq: Name 0139-do-not-dlopen-libpocl.so-without-ENABLE_ICD.patch
Andreas Beckmann [Tue, 30 Nov 2021 22:16:11 +0000 (23:16 +0100)]
[PATCH 138/144] tests: (run_)kernel.c: show build log after clBuildProgram() failed
Gbp-Pq: Name 0138-tests-run_-kernel.c-show-build-log-after-clBuildProg.patch
Andreas Beckmann [Tue, 30 Nov 2021 22:04:21 +0000 (23:04 +0100)]
[PATCH 137/144] add poclu_show_program_build_log(cl_program)
Gbp-Pq: Name 0137-add-poclu_show_program_build_log-cl_program.patch
Andreas Beckmann [Wed, 24 Nov 2021 09:46:50 +0000 (10:46 +0100)]
[PATCH 136/144] skip subdevice tests on devices with only 1 compute unit
do not let them fail
Gbp-Pq: Name 0136-skip-subdevice-tests-on-devices-with-only-1-compute-.patch
Andreas Beckmann [Wed, 24 Nov 2021 09:29:45 +0000 (10:29 +0100)]
[PATCH 135/144] skip tests that require unavailable devices
do not silently let them "pass"
77 is the return code used by GNU autotest to indicate SKIP
Gbp-Pq: Name 0135-skip-tests-that-require-unavailable-devices.patch
Andreas Beckmann [Thu, 25 Nov 2021 00:30:54 +0000 (01:30 +0100)]
[PATCH 134/144] fix unlabeled tests
Gbp-Pq: Name 0134-fix-unlabeled-tests.patch
Andreas Beckmann [Wed, 24 Nov 2021 23:35:49 +0000 (00:35 +0100)]
[PATCH 133/144] consistently use add_test(NAME ... COMMAND ...)
Gbp-Pq: Name 0133-consistently-use-add_test-NAME-.-COMMAND.patch
Andreas Beckmann [Tue, 23 Nov 2021 17:21:16 +0000 (18:21 +0100)]
[PATCH 132/144] tests: emit "OK" along EXIT_SUCCESS
Gbp-Pq: Name 0132-tests-emit-OK-along-EXIT_SUCCESS.patch
Andreas Beckmann [Tue, 23 Nov 2021 17:17:11 +0000 (18:17 +0100)]
[PATCH 131/144] tests: simplify EXIT_SUCCESS/EXIT_FAILURE usage
Gbp-Pq: Name 0131-tests-simplify-EXIT_SUCCESS-EXIT_FAILURE-usage.patch
Andreas Beckmann [Tue, 23 Nov 2021 17:03:10 +0000 (18:03 +0100)]
[PATCH 130/144] tests: use EXIT_SUCCESS/EXIT_FAILURE in most tests
Gbp-Pq: Name 0130-tests-use-EXIT_SUCCESS-EXIT_FAILURE-in-most-tests.patch
Andreas Beckmann [Thu, 16 Dec 2021 15:06:14 +0000 (16:06 +0100)]
[PATCH 128/144] add POCL_EXPORT to additional symbols needed by the vulkan device
Gbp-Pq: Name 0128-add-POCL_EXPORT-to-additional-symbols-needed-by-the-.patch
Andreas Beckmann [Thu, 9 Dec 2021 16:20:31 +0000 (17:20 +0100)]
[PATCH 127/144] really enable VISIBILITY_HIDDEN by default
Gbp-Pq: Name 0127-really-enable-VISIBILITY_HIDDEN-by-default.patch
Tom Rix [Sat, 1 Jan 2022 15:30:55 +0000 (10:30 -0500)]
[PATCH 122/144] llvm 13: handle deprecated CreateLoad
As of llvm 13, CreateLoad prefers to have a type arguement as it's
first parameter, deprecating the old api.
For readablity, seperated out the CreateConstGEP* calls which have
a similar deprecation.
Signed-off-by: Tom Rix <trix@redhat.com>
Gbp-Pq: Name 0122-llvm-13-handle-deprecated-CreateLoad.patch
Andreas Beckmann [Wed, 8 Dec 2021 23:45:45 +0000 (00:45 +0100)]
[PATCH 118/144] cmake policy CMP0025 was introduced in CMake 3.0
Gbp-Pq: Name 0118-cmake-policy-CMP0025-was-introduced-in-CMake-3.0.patch
Andreas Beckmann [Wed, 8 Dec 2021 23:35:56 +0000 (00:35 +0100)]
[PATCH 117/144] drop the "clangxx works" check from the vecmathlib age
Gbp-Pq: Name 0117-drop-the-clangxx-works-check-from-the-vecmathlib-age.patch
Andreas Beckmann [Wed, 8 Dec 2021 23:17:03 +0000 (00:17 +0100)]
[PATCH 116/144] add a copy of the ROCm-Device-Libs license (NCSA) for ocml
Gbp-Pq: Name 0116-add-a-copy-of-the-ROCm-Device-Libs-license-NCSA-for-.patch
Andreas Beckmann [Wed, 8 Dec 2021 23:11:32 +0000 (00:11 +0100)]
[PATCH 115/144] add a copy of the Boost license for sleef
Gbp-Pq: Name 0115-add-a-copy-of-the-Boost-license-for-sleef.patch
Andreas Beckmann [Wed, 8 Dec 2021 22:32:53 +0000 (23:32 +0100)]
[PATCH 114/144] do not build libllvmopencl.so by default, no longer used since 0.14
Gbp-Pq: Name 0114-do-not-build-libllvmopencl.so-by-default-no-longer-u.patch
Andreas Beckmann [Wed, 8 Dec 2021 09:56:57 +0000 (10:56 +0100)]
[PATCH 113/144] drop irrelevant metadata causing warnings on non-x86
fixes: #1006
Gbp-Pq: Name 0113-drop-irrelevant-metadata-causing-warnings-on-non-x86.patch
Andreas Beckmann [Fri, 3 Dec 2021 00:55:51 +0000 (01:55 +0100)]
[PATCH 111/144] import clang-format-diff.py from clang-13
Gbp-Pq: Name 0111-import-clang-format-diff.py-from-clang-13.patch
Andreas Beckmann [Wed, 1 Dec 2021 20:42:38 +0000 (21:42 +0100)]
[PATCH 110/144] exit early if reformatting produced no changes
Gbp-Pq: Name 0110-exit-early-if-reformatting-produced-no-changes.patch
Andreas Beckmann [Wed, 1 Dec 2021 18:48:36 +0000 (19:48 +0100)]
[PATCH 109/144] use proper tempfiles
Gbp-Pq: Name 0109-use-proper-tempfiles.patch
Andreas Beckmann [Wed, 1 Dec 2021 18:43:40 +0000 (19:43 +0100)]
[PATCH 108/144] do not reformat if there are uncommitted changes
Gbp-Pq: Name 0108-do-not-reformat-if-there-are-uncommitted-changes.patch
Andreas Beckmann [Wed, 1 Dec 2021 18:39:05 +0000 (19:39 +0100)]
[PATCH 107/144] check for .git/rebase-merge to detect ongoing git rebase
.git/ORIG_HEAD may be a stale file
Gbp-Pq: Name 0107-check-for-.git-rebase-merge-to-detect-ongoing-git-re.patch
Andreas Beckmann [Tue, 30 Nov 2021 13:15:21 +0000 (14:15 +0100)]
[PATCH 106/144] hsa: check for clang -target amdgcn--amdhsa -nogpulib
Gbp-Pq: Name 0106-hsa-check-for-clang-target-amdgcn-amdhsa-nogpulib.patch
Andreas Beckmann [Mon, 29 Nov 2021 21:25:10 +0000 (22:25 +0100)]
[PATCH 105/144] hsa: check for clang -target hsail64
Gbp-Pq: Name 0105-hsa-check-for-clang-target-hsail64.patch
Andreas Beckmann [Fri, 3 Dec 2021 09:44:09 +0000 (10:44 +0100)]
[PATCH 103/144] link libpocl-devices-ttasim.so with -pthread
Gbp-Pq: Name 0103-link-libpocl-devices-ttasim.so-with-pthread.patch
Andreas Beckmann [Mon, 29 Nov 2021 20:08:29 +0000 (21:08 +0100)]
[PATCH 102/144] link libpocl-devices-hsa.so with -pthread
Gbp-Pq: Name 0102-link-libpocl-devices-hsa.so-with-pthread.patch
Andreas Beckmann [Mon, 29 Nov 2021 20:07:14 +0000 (21:07 +0100)]
[PATCH 101/144] link libpocl-devices-cuda.so with -lLLVM -pthread
Gbp-Pq: Name 0101-link-libpocl-devices-cuda.so-with-lLLVM-pthread.patch
Andreas Beckmann [Wed, 24 Nov 2021 14:06:54 +0000 (15:06 +0100)]
[PATCH 100/144] link libpocl-devices-pthread.so with -pthread
Gbp-Pq: Name 0100-link-libpocl-devices-pthread.so-with-pthread.patch
Andreas Beckmann [Wed, 27 Jan 2021 20:26:51 +0000 (21:26 +0100)]
[PATCH 099/144] restore setting unroll-threshold on LLVM 9
otherwise kernel/test_rotate hangs on avx512 capable cpus
all other llvm releases work fine without this setting
(this partially reverts
459d0bb5241a62af3b5b607a65dcf5e9a34a01ee)
fixes: #915
Gbp-Pq: Name 0099-restore-setting-unroll-threshold-on-LLVM-9.patch
Roman Rusyaev [Tue, 27 Apr 2021 11:28:41 +0000 (14:28 +0300)]
[PATCH 097/144] Avoid inheritance from std::vector in ParallelRegion class.
Use composition instead of inheritance.
Gbp-Pq: Name 0097-Avoid-inheritance-from-std-vector-in-ParallelRegion-.patch
nchristensen [Sun, 5 Dec 2021 16:58:18 +0000 (16:58 +0000)]
[PATCH 096/144] Avoid loss precision in event time calculations
The current code implicitly converts `epoch` to a float before calculating `time_end` and `time_start` and converting back to a `cl_ulong`. A `float` only has seven or so digits of accuracy meaning this conversion throws away the least significant digits of `epoch` if `epoch` is larger than about 1e8. Conversely, the precision of `diff` is around 1e-3 so multiplying by 1e6 makes all of these digits > 1 and converting this number to a `cl_ulong` will not lose any digits. The gist below illustrates this in Python.
https://gist.github.com/nchristensen/
15ab1fd53ba099accaab780ab6dbad90
Gbp-Pq: Name 0096-Avoid-loss-precision-in-event-time-calculations.patch
Andreas Beckmann [Tue, 7 Dec 2021 17:06:32 +0000 (18:06 +0100)]
[PATCH 90/90] do not link libpocl-devices-hsa.so against libpocl-devices-basic.so
Gbp-Pq: Name 0090-do-not-link-libpocl-devices-hsa.so-against-libpocl-d.patch
Andreas Beckmann [Mon, 6 Dec 2021 23:25:21 +0000 (00:25 +0100)]
[PATCH 89/90] rename pocl_basic_svm_fill() to pocl_driver_svm_fill()
Gbp-Pq: Name 0089-rename-pocl_basic_svm_fill-to-pocl_driver_svm_fill.patch
Andreas Beckmann [Mon, 6 Dec 2021 23:23:00 +0000 (00:23 +0100)]
[PATCH 88/90] rename pocl_basic_free() to pocl_driver_free()
Gbp-Pq: Name 0088-rename-pocl_basic_free-to-pocl_driver_free.patch
Andreas Beckmann [Mon, 6 Dec 2021 23:14:36 +0000 (00:14 +0100)]
[PATCH 87/90] rename pocl_basic_alloc_mem_obj() to pocl_driver_alloc_mem_obj()
Gbp-Pq: Name 0087-rename-pocl_basic_alloc_mem_obj-to-pocl_driver_alloc.patch
Andreas Beckmann [Mon, 6 Dec 2021 19:14:37 +0000 (20:14 +0100)]
[PATCH 86/90] enable VISIBILITY_HIDDEN by default
(unless the proxy device is enabled)
Gbp-Pq: Name 0086-enable-VISIBILITY_HIDDEN-by-default.patch
Andreas Beckmann [Sat, 4 Dec 2021 20:51:30 +0000 (21:51 +0100)]
[PATCH 84/90] rename byteswap_uint32_t() to pocl_byteswap_uint32_t()
Gbp-Pq: Name 0084-rename-byteswap_uint32_t-to-pocl_byteswap_uint32_t.patch
Andreas Beckmann [Fri, 3 Dec 2021 10:34:09 +0000 (11:34 +0100)]
[PATCH 83/90] rename init_mem_region() to pocl_init_mem_region()
Gbp-Pq: Name 0083-rename-init_mem_region-to-pocl_init_mem_region.patch
Andreas Beckmann [Fri, 3 Dec 2021 10:27:49 +0000 (11:27 +0100)]
[PATCH 82/90] rename free_chunk() to pocl_free_chunk()
Gbp-Pq: Name 0082-rename-free_chunk-to-pocl_free_chunk.patch
Andreas Beckmann [Fri, 3 Dec 2021 10:21:48 +0000 (11:21 +0100)]
[PATCH 81/90] rename alloc_buffer_from_region() to pocl_alloc_buffer_from_region()
Gbp-Pq: Name 0081-rename-alloc_buffer_from_region-to-pocl_alloc_buffer.patch
Andreas Beckmann [Fri, 3 Dec 2021 10:17:46 +0000 (11:17 +0100)]
[PATCH 80/90] rename get_llvm_cpu_name() to pocl_get_llvm_cpu_name()
Gbp-Pq: Name 0080-rename-get_llvm_cpu_name-to-pocl_get_llvm_cpu_name.patch
Andreas Beckmann [Fri, 3 Dec 2021 09:36:27 +0000 (10:36 +0100)]
[PATCH 79/90] add POCL_EXPORT to additional symbols needed by the tce device
Gbp-Pq: Name 0079-add-POCL_EXPORT-to-additional-symbols-needed-by-the-.patch
Andreas Beckmann [Mon, 29 Nov 2021 19:21:02 +0000 (20:21 +0100)]
[PATCH 78/90] add POCL_EXPORT to additional symbols needed by the hsa device
Gbp-Pq: Name 0078-add-POCL_EXPORT-to-additional-symbols-needed-by-the-.patch
Andreas Beckmann [Wed, 13 Oct 2021 14:23:53 +0000 (16:23 +0200)]
[PATCH 77/90] add POCL_EXPORT to additional symbols needed if hardening is enabled
Gbp-Pq: Name 0077-add-POCL_EXPORT-to-additional-symbols-needed-if-hard.patch
Andreas Beckmann [Wed, 10 Feb 2021 22:25:57 +0000 (23:25 +0100)]
[PATCH 76/90] add POCL_EXPORT to additional symbols needed by the cuda device
Gbp-Pq: Name 0076-add-POCL_EXPORT-to-additional-symbols-needed-by-the-.patch
Andreas Beckmann [Fri, 3 Dec 2021 18:28:10 +0000 (19:28 +0100)]
[PATCH 75/90] be more verbose on pthread errors
Gbp-Pq: Name 0075-be-more-verbose-on-pthread-errors.patch
Andreas Beckmann [Fri, 26 Nov 2021 13:22:35 +0000 (14:22 +0100)]
[PATCH 74/90] pthread: cleanup/fail pthread_scheduler_init() if any worker is out-of-memory
fixes: #1009
Gbp-Pq: Name 0074-pthread-cleanup-fail-pthread_scheduler_init-if-any-w.patch
Andreas Beckmann [Fri, 26 Nov 2021 13:11:30 +0000 (14:11 +0100)]
[PATCH 73/90] pthread: add barrier after worker thread initialization
Gbp-Pq: Name 0073-pthread-add-barrier-after-worker-thread-initializati.patch
Andreas Beckmann [Fri, 26 Nov 2021 13:02:56 +0000 (14:02 +0100)]
[PATCH 72/90] pthread: add return code to pthread_scheduler_init()
Gbp-Pq: Name 0072-pthread-add-return-code-to-pthread_scheduler_init.patch
Andreas Beckmann [Thu, 2 Dec 2021 01:34:09 +0000 (02:34 +0100)]
[PATCH 71/90] add PTHREAD_CHECK() to all macros wrapping pthread_*() calls
Gbp-Pq: Name 0071-add-PTHREAD_CHECK-to-all-macros-wrapping-pthread_-ca.patch
Andreas Beckmann [Fri, 3 Dec 2021 15:14:53 +0000 (16:14 +0100)]
[PATCH 69/90] add PTHREAD_CHECK2() accepting an additional non-zero return code
use e.g. for
* pthread_broadcast(): PTHREAD_BARRIER_SERIAL_THREAD
* pthread_cond_timedwait(): ETIMEDOUT
avoid the extra function call to the error handler if the command succeeded
Gbp-Pq: Name 0069-add-PTHREAD_CHECK2-accepting-an-additional-non-zero-.patch
Andreas Beckmann [Thu, 2 Dec 2021 00:53:31 +0000 (01:53 +0100)]
[PATCH 68/90] move PTHREAD_CHECK() to pocl_cl.h
Gbp-Pq: Name 0068-move-PTHREAD_CHECK-to-pocl_cl.h.patch
Andreas Beckmann [Wed, 1 Dec 2021 22:39:45 +0000 (23:39 +0100)]
[PATCH 66/90] pthread: add PTHREAD_CHECK() to all pthread_*() calls
Gbp-Pq: Name 0066-pthread-add-PTHREAD_CHECK-to-all-pthread_-calls.patch
Andreas Beckmann [Wed, 1 Dec 2021 22:16:14 +0000 (23:16 +0100)]
[PATCH 64/90] cuda: add PTHREAD_CHECK() to all pthread_*() calls
Gbp-Pq: Name 0064-cuda-add-PTHREAD_CHECK-to-all-pthread_-calls.patch
Andreas Beckmann [Wed, 1 Dec 2021 21:50:25 +0000 (22:50 +0100)]
[PATCH 63/90] hsa: add PTHREAD_CHECK() to the remaining pthread_*() calls
Gbp-Pq: Name 0063-hsa-add-PTHREAD_CHECK-to-the-remaining-pthread_-call.patch