Warning: Permanently added '44.222.134.9' (ED25519) to the list of known hosts. You can reproduce this build on your computer by running: sudo dnf install copr-rpmbuild /usr/bin/copr-rpmbuild --verbose --drop-resultdir --task-url https://copr.fedorainfracloud.org/backend/get-build-task/9938614-fedora-rawhide-x86_64 --chroot fedora-rawhide-x86_64 Version: 1.6 PID: 8701 Logging PID: 8703 Task: {'allow_user_ssh': False, 'appstream': False, 'background': True, 'build_id': 9938614, 'buildroot_pkgs': [], 'chroot': 'fedora-rawhide-x86_64', 'enable_net': False, 'fedora_review': False, 'git_hash': '2fd4146d4a1bdeb6a90f318d3b940f9b3f44191e', 'git_repo': 'https://copr-dist-git.fedorainfracloud.org/git/@rocm-packagers-sig/RH/llama-cpp', 'isolation': 'default', 'memory_reqs': 2048, 'package_name': 'llama-cpp', 'package_version': 'b6153-1', 'project_dirname': 'RH', 'project_name': 'RH', 'project_owner': '@rocm-packagers-sig', 'repo_priority': None, 'repos': [{'baseurl': 'https://download.copr.fedorainfracloud.org/results/@rocm-packagers-sig/RH/fedora-rawhide-x86_64/', 'id': 'copr_base', 'name': 'Copr repository', 'priority': None}], 'sandbox': '@rocm-packagers-sig/RH--https://src.fedoraproject.org/user/trix', 'source_json': {}, 'source_type': None, 'ssh_public_keys': None, 'storage': 0, 'submitter': 'https://src.fedoraproject.org/user/trix', 'tags': [], 'task_id': '9938614-fedora-rawhide-x86_64', 'timeout': 18000, 'uses_devel_repo': False, 'with_opts': [], 'without_opts': []} Running: git clone https://copr-dist-git.fedorainfracloud.org/git/@rocm-packagers-sig/RH/llama-cpp /var/lib/copr-rpmbuild/workspace/workdir-uxi5ap5m/llama-cpp --depth 500 --no-single-branch --recursive cmd: ['git', 'clone', 'https://copr-dist-git.fedorainfracloud.org/git/@rocm-packagers-sig/RH/llama-cpp', '/var/lib/copr-rpmbuild/workspace/workdir-uxi5ap5m/llama-cpp', '--depth', '500', '--no-single-branch', '--recursive'] cwd: . rc: 0 stdout: stderr: Cloning into '/var/lib/copr-rpmbuild/workspace/workdir-uxi5ap5m/llama-cpp'... Running: git checkout 2fd4146d4a1bdeb6a90f318d3b940f9b3f44191e -- cmd: ['git', 'checkout', '2fd4146d4a1bdeb6a90f318d3b940f9b3f44191e', '--'] cwd: /var/lib/copr-rpmbuild/workspace/workdir-uxi5ap5m/llama-cpp rc: 0 stdout: stderr: Note: switching to '2fd4146d4a1bdeb6a90f318d3b940f9b3f44191e'. You are in 'detached HEAD' state. You can look around, make experimental changes and commit them, and you can discard any commits you make in this state without impacting any branches by switching back to a branch. If you want to create a new branch to retain commits you create, you may do so (now or later) by using -c with the switch command. Example: git switch -c Or undo this operation with: git switch - Turn off this advice by setting config variable advice.detachedHead to false HEAD is now at 2fd4146 automatic import of llama-cpp Running: dist-git-client sources cmd: ['dist-git-client', 'sources'] cwd: /var/lib/copr-rpmbuild/workspace/workdir-uxi5ap5m/llama-cpp rc: 0 stdout: stderr: INFO: Reading stdout from command: git rev-parse --abbrev-ref HEAD INFO: Reading stdout from command: git rev-parse HEAD INFO: Reading sources specification file: sources INFO: Downloading llama.cpp-b6153.tar.gz INFO: Reading stdout from command: curl --help all INFO: Calling: curl -H Pragma: -o llama.cpp-b6153.tar.gz --location --connect-timeout 60 --retry 3 --retry-delay 10 --remote-time --show-error --fail --retry-all-errors https://copr-dist-git.fedorainfracloud.org/repo/pkgs/@rocm-packagers-sig/RH/llama-cpp/llama.cpp-b6153.tar.gz/md5/e7eae951975b13b8eed5bb4264c632cc/llama.cpp-b6153.tar.gz % Total % Received % Xferd Average Speed Time Time Time Current Dload Upload Total Spent Left Speed 100 24.3M 100 24.3M 0 0 380M 0 --:--:-- --:--:-- --:--:-- 385M INFO: Reading stdout from command: md5sum llama.cpp-b6153.tar.gz tail: /var/lib/copr-rpmbuild/main.log: file truncated Running (timeout=18000): unbuffer mock --spec /var/lib/copr-rpmbuild/workspace/workdir-uxi5ap5m/llama-cpp/llama-cpp.spec --sources /var/lib/copr-rpmbuild/workspace/workdir-uxi5ap5m/llama-cpp --resultdir /var/lib/copr-rpmbuild/results --uniqueext 1766268696.323769 -r /var/lib/copr-rpmbuild/results/configs/child.cfg INFO: mock.py version 6.6 starting (python version = 3.13.7, NVR = mock-6.6-1.fc42), args: /usr/libexec/mock/mock --spec /var/lib/copr-rpmbuild/workspace/workdir-uxi5ap5m/llama-cpp/llama-cpp.spec --sources /var/lib/copr-rpmbuild/workspace/workdir-uxi5ap5m/llama-cpp --resultdir /var/lib/copr-rpmbuild/results --uniqueext 1766268696.323769 -r /var/lib/copr-rpmbuild/results/configs/child.cfg Start(bootstrap): init plugins INFO: tmpfs initialized INFO: selinux enabled INFO: chroot_scan: initialized INFO: compress_logs: initialized Finish(bootstrap): init plugins Start: init plugins INFO: tmpfs initialized INFO: selinux enabled INFO: chroot_scan: initialized INFO: compress_logs: initialized Finish: init plugins INFO: Signal handler active Start: run INFO: Start(/var/lib/copr-rpmbuild/workspace/workdir-uxi5ap5m/llama-cpp/llama-cpp.spec) Config(fedora-rawhide-x86_64) Start: clean chroot Finish: clean chroot Mock Version: 6.6 INFO: Mock Version: 6.6 Start(bootstrap): chroot init INFO: mounting tmpfs at /var/lib/mock/fedora-rawhide-x86_64-bootstrap-1766268696.323769/root. INFO: calling preinit hooks INFO: enabled root cache INFO: enabled package manager cache Start(bootstrap): cleaning package manager metadata Finish(bootstrap): cleaning package manager metadata INFO: Guessed host environment type: unknown INFO: Using container image: registry.fedoraproject.org/fedora:rawhide INFO: Pulling image: registry.fedoraproject.org/fedora:rawhide INFO: Tagging container image as mock-bootstrap-de6ef8e5-3926-416b-bc75-2175fa8cb946 INFO: Checking that 72235d1638968c9fc35bb27acfa01c9bae36f6fc6444a685be8a246577921c91 image matches host's architecture INFO: Copy content of container 72235d1638968c9fc35bb27acfa01c9bae36f6fc6444a685be8a246577921c91 to /var/lib/mock/fedora-rawhide-x86_64-bootstrap-1766268696.323769/root INFO: mounting 72235d1638968c9fc35bb27acfa01c9bae36f6fc6444a685be8a246577921c91 with podman image mount INFO: image 72235d1638968c9fc35bb27acfa01c9bae36f6fc6444a685be8a246577921c91 as /var/lib/containers/storage/overlay/76d8c395a07fe7a35ae732f946c798bf8c5cd75f124c5f789c490693c3e00c1f/merged INFO: umounting image 72235d1638968c9fc35bb27acfa01c9bae36f6fc6444a685be8a246577921c91 (/var/lib/containers/storage/overlay/76d8c395a07fe7a35ae732f946c798bf8c5cd75f124c5f789c490693c3e00c1f/merged) with podman image umount INFO: Removing image mock-bootstrap-de6ef8e5-3926-416b-bc75-2175fa8cb946 INFO: Package manager dnf5 detected and used (fallback) INFO: Not updating bootstrap chroot, bootstrap_image_ready=True Start(bootstrap): creating root cache Finish(bootstrap): creating root cache Finish(bootstrap): chroot init Start: chroot init INFO: mounting tmpfs at /var/lib/mock/fedora-rawhide-x86_64-1766268696.323769/root. INFO: calling preinit hooks INFO: enabled root cache INFO: enabled package manager cache Start: cleaning package manager metadata Finish: cleaning package manager metadata INFO: enabled HW Info plugin INFO: Package manager dnf5 detected and used (direct choice) INFO: Buildroot is handled by package management downloaded with a bootstrap image: rpm-6.0.1-1.fc44.x86_64 rpm-sequoia-1.10.0-1.fc44.x86_64 dnf5-5.3.0.0-3.fc44.x86_64 dnf5-plugins-5.3.0.0-3.fc44.x86_64 Start: installing minimal buildroot with dnf5 Updating and loading repositories: Copr repository 100% | 316.9 KiB/s | 234.5 KiB | 00m01s fedora 100% | 8.7 MiB/s | 21.8 MiB | 00m02s Repositories loaded. Package Arch Version Repository Size Installing group/module packages: bash x86_64 0:5.3.0-2.fc43 fedora 8.4 MiB bzip2 x86_64 0:1.0.8-21.fc43 fedora 95.3 KiB coreutils x86_64 0:9.9-1.fc44 fedora 5.4 MiB cpio x86_64 0:2.15-6.fc43 fedora 1.1 MiB diffutils x86_64 0:3.12-3.fc43 fedora 1.6 MiB fedora-release-common noarch 0:44-0.10 fedora 20.6 KiB findutils x86_64 1:4.10.0-6.fc43 fedora 1.8 MiB gawk x86_64 0:5.3.2-2.fc43 fedora 1.8 MiB glibc-minimal-langpack x86_64 0:2.42.9000-16.fc44 fedora 0.0 B grep x86_64 0:3.12-2.fc43 fedora 1.0 MiB gzip x86_64 0:1.14-1.fc44 fedora 397.8 KiB info x86_64 0:7.2-7.fc44 fedora 357.9 KiB patch x86_64 0:2.8-3.fc44 fedora 226.6 KiB redhat-rpm-config noarch 0:343-18.fc44 fedora 183.6 KiB rpm-build x86_64 0:6.0.1-1.fc44 fedora 287.5 KiB sed x86_64 0:4.9-6.fc44 fedora 857.3 KiB shadow-utils x86_64 2:4.18.0-7.fc44 fedora 3.9 MiB tar x86_64 2:1.35-6.fc43 fedora 2.9 MiB unzip x86_64 0:6.0-68.fc44 fedora 390.3 KiB util-linux x86_64 0:2.41.3-8.fc44 fedora 3.5 MiB which x86_64 0:2.23-3.fc43 fedora 83.5 KiB xz x86_64 1:5.8.1-4.fc44 fedora 1.3 MiB Installing dependencies: R-srpm-macros noarch 0:1.3.0-1.fc44 fedora 3.2 KiB add-determinism x86_64 0:0.7.2-2.fc44 fedora 2.3 MiB alternatives x86_64 0:1.33-3.fc44 fedora 62.2 KiB ansible-srpm-macros noarch 0:1-18.1.fc43 fedora 35.7 KiB audit-libs x86_64 0:4.1.2-2.fc44 fedora 378.8 KiB binutils x86_64 0:2.45.50-12.fc44 copr_base 27.0 MiB build-reproducibility-srpm-macros noarch 0:0.7.2-2.fc44 fedora 1.2 KiB bzip2-libs x86_64 0:1.0.8-21.fc43 fedora 80.6 KiB ca-certificates noarch 0:2025.2.80_v9.0.304-2.fc44 fedora 2.7 MiB cmake-srpm-macros noarch 0:3.31.10-3.fc44 fedora 524.0 B coreutils-common x86_64 0:9.9-1.fc44 fedora 11.2 MiB crypto-policies noarch 0:20251128-1.git19878fe.fc44 fedora 132.6 KiB curl x86_64 0:8.18.0~rc2-1.fc44 fedora 471.5 KiB cyrus-sasl-lib x86_64 0:2.1.28-33.fc44 fedora 2.3 MiB debugedit x86_64 0:5.2-3.fc44 fedora 214.0 KiB dwz x86_64 0:0.16-2.fc43 fedora 287.1 KiB ed x86_64 0:1.22.3-1.fc44 fedora 148.9 KiB efi-srpm-macros noarch 0:6-5.fc44 fedora 40.2 KiB elfutils x86_64 0:0.194-2.fc44 fedora 2.9 MiB elfutils-debuginfod-client x86_64 0:0.194-2.fc44 fedora 84.0 KiB elfutils-default-yama-scope noarch 0:0.194-2.fc44 fedora 1.8 KiB elfutils-libelf x86_64 0:0.194-2.fc44 fedora 1.1 MiB elfutils-libs x86_64 0:0.194-2.fc44 fedora 687.5 KiB fedora-gpg-keys noarch 0:44-0.1 fedora 131.2 KiB fedora-release noarch 0:44-0.10 fedora 0.0 B fedora-release-identity-basic noarch 0:44-0.10 fedora 664.0 B fedora-repos noarch 0:44-0.1 fedora 4.9 KiB fedora-repos-rawhide noarch 0:44-0.1 fedora 2.2 KiB file x86_64 0:5.46-8.fc44 fedora 100.2 KiB file-libs x86_64 0:5.46-8.fc44 fedora 11.9 MiB filesystem x86_64 0:3.18-50.fc43 fedora 112.0 B filesystem-srpm-macros noarch 0:3.18-50.fc43 fedora 38.2 KiB fonts-srpm-macros noarch 1:5.0.0-1.fc44 fedora 55.8 KiB forge-srpm-macros noarch 0:0.4.0-3.fc43 fedora 38.9 KiB fpc-srpm-macros noarch 0:1.3-15.fc43 fedora 144.0 B gap-srpm-macros noarch 0:2-1.fc44 fedora 2.1 KiB gdb-minimal x86_64 0:16.3-6.fc44 fedora 13.3 MiB gdbm-libs x86_64 1:1.23-10.fc43 fedora 129.9 KiB ghc-srpm-macros noarch 0:1.9.2-3.fc43 fedora 779.0 B glibc x86_64 0:2.42.9000-16.fc44 fedora 6.8 MiB glibc-common x86_64 0:2.42.9000-16.fc44 fedora 1.0 MiB glibc-gconv-extra x86_64 0:2.42.9000-16.fc44 fedora 7.2 MiB gmp x86_64 1:6.3.0-4.fc44 fedora 815.3 KiB gnat-srpm-macros noarch 0:6-8.fc43 fedora 1.0 KiB gnulib-l10n noarch 0:20241231-1.fc44 fedora 655.0 KiB gnupg2 x86_64 0:2.4.8-4.fc43 fedora 6.5 MiB gnupg2-dirmngr x86_64 0:2.4.8-4.fc43 fedora 618.4 KiB gnupg2-gpg-agent x86_64 0:2.4.8-4.fc43 fedora 671.4 KiB gnupg2-gpgconf x86_64 0:2.4.8-4.fc43 fedora 250.0 KiB gnupg2-keyboxd x86_64 0:2.4.8-4.fc43 fedora 201.4 KiB gnupg2-verify x86_64 0:2.4.8-4.fc43 fedora 348.5 KiB gnutls x86_64 0:3.8.11-6.fc44 fedora 3.6 MiB go-srpm-macros noarch 0:3.8.0-1.fc44 fedora 61.9 KiB gpgverify noarch 0:2.2-3.fc43 fedora 8.7 KiB ima-evm-utils-libs x86_64 0:1.6.2-7.fc44 fedora 60.7 KiB jansson x86_64 0:2.14-3.fc43 fedora 89.1 KiB java-srpm-macros noarch 0:1-7.fc43 fedora 870.0 B json-c x86_64 0:0.18-7.fc43 fedora 82.7 KiB kernel-srpm-macros noarch 0:1.0-27.fc43 fedora 1.9 KiB keyutils-libs x86_64 0:1.6.3-6.fc43 fedora 54.3 KiB krb5-libs x86_64 0:1.21.3-10.fc44 fedora 2.3 MiB libacl x86_64 0:2.3.2-4.fc43 fedora 35.9 KiB libarchive x86_64 0:3.8.4-1.fc44 fedora 955.3 KiB libassuan x86_64 0:2.5.7-4.fc43 fedora 163.8 KiB libatomic x86_64 0:16.0.0-0.2.fc44 copr_base 36.7 KiB libattr x86_64 0:2.5.2-6.fc43 fedora 24.4 KiB libblkid x86_64 0:2.41.3-8.fc44 fedora 262.3 KiB libbrotli x86_64 0:1.2.0-1.fc44 fedora 865.4 KiB libcap x86_64 0:2.77-1.fc44 fedora 209.1 KiB libcap-ng x86_64 0:0.8.5-8.fc44 fedora 68.9 KiB libcom_err x86_64 0:1.47.3-3.fc44 fedora 63.1 KiB libcurl x86_64 0:8.18.0~rc2-1.fc44 fedora 984.7 KiB libeconf x86_64 0:0.7.9-2.fc43 fedora 64.9 KiB libevent x86_64 0:2.1.12-16.fc43 fedora 883.1 KiB libfdisk x86_64 0:2.41.3-8.fc44 fedora 380.3 KiB libffi x86_64 0:3.5.2-1.fc44 fedora 83.8 KiB libfsverity x86_64 0:1.6-3.fc43 fedora 28.5 KiB libgcc x86_64 0:16.0.0-0.2.fc44 copr_base 270.7 KiB libgcrypt x86_64 0:1.11.2-1.fc44 fedora 1.6 MiB libgomp x86_64 0:16.0.0-0.2.fc44 copr_base 570.9 KiB libgpg-error x86_64 0:1.58-1.fc44 fedora 941.6 KiB libidn2 x86_64 0:2.3.8-2.fc43 fedora 552.5 KiB libksba x86_64 0:1.6.7-4.fc43 fedora 398.5 KiB liblastlog2 x86_64 0:2.41.3-8.fc44 fedora 33.6 KiB libmount x86_64 0:2.41.3-8.fc44 fedora 372.6 KiB libnghttp2 x86_64 0:1.68.0-2.fc44 fedora 162.2 KiB libnghttp3 x86_64 0:1.13.1-1.fc44 fedora 155.3 KiB libpkgconf x86_64 0:2.3.0-3.fc43 fedora 78.1 KiB libpsl x86_64 0:0.21.5-6.fc43 fedora 76.4 KiB libselinux x86_64 0:3.9-5.fc44 fedora 193.1 KiB libselinux-utils x86_64 0:3.9-5.fc44 fedora 309.0 KiB libsemanage x86_64 0:3.9-4.fc44 fedora 308.5 KiB libsepol x86_64 0:3.9-2.fc43 fedora 822.0 KiB libsmartcols x86_64 0:2.41.3-8.fc44 fedora 180.3 KiB libssh x86_64 0:0.11.3-1.fc44 fedora 567.1 KiB libssh-config noarch 0:0.11.3-1.fc44 fedora 277.0 B libstdc++ x86_64 0:16.0.0-0.2.fc44 copr_base 2.9 MiB libtasn1 x86_64 0:4.20.0-2.fc43 fedora 176.3 KiB libtool-ltdl x86_64 0:2.5.4-8.fc44 fedora 70.1 KiB libunistring x86_64 0:1.1-10.fc43 fedora 1.7 MiB libusb1 x86_64 0:1.0.29-4.fc44 fedora 171.3 KiB libuuid x86_64 0:2.41.3-8.fc44 fedora 37.2 KiB libverto x86_64 0:0.3.2-11.fc43 fedora 25.4 KiB libxcrypt x86_64 0:4.5.2-2.fc44 fedora 285.3 KiB libxml2 x86_64 0:2.12.10-5.fc44 fedora 1.7 MiB libzstd x86_64 0:1.5.7-3.fc44 fedora 940.3 KiB linkdupes x86_64 0:0.7.2-2.fc44 fedora 838.7 KiB lua-libs x86_64 0:5.4.8-4.fc44 fedora 281.9 KiB lua-srpm-macros noarch 0:1-16.fc43 fedora 1.3 KiB lz4-libs x86_64 0:1.10.0-3.fc43 fedora 161.4 KiB mpfr x86_64 0:4.2.2-2.fc43 fedora 832.8 KiB ncurses-base noarch 0:6.5-8.20250614.fc44 fedora 328.1 KiB ncurses-libs x86_64 0:6.5-8.20250614.fc44 fedora 946.4 KiB nettle x86_64 0:3.10.1-2.fc43 fedora 790.6 KiB ngtcp2 x86_64 0:1.18.0-1.fc44 fedora 314.3 KiB ngtcp2-crypto-ossl x86_64 0:1.18.0-1.fc44 fedora 51.7 KiB npth x86_64 0:1.8-3.fc43 fedora 49.6 KiB ocaml-srpm-macros noarch 0:11-2.fc43 fedora 1.9 KiB openblas-srpm-macros noarch 0:2-20.fc43 fedora 112.0 B openldap x86_64 0:2.6.10-4.fc44 fedora 659.8 KiB openssl-libs x86_64 1:3.5.4-1.fc44 fedora 8.9 MiB p11-kit x86_64 0:0.25.8-1.fc44 fedora 2.3 MiB p11-kit-trust x86_64 0:0.25.8-1.fc44 fedora 446.5 KiB package-notes-srpm-macros noarch 0:0.5-14.fc43 fedora 1.6 KiB pam-libs x86_64 0:1.7.1-3.fc43 fedora 126.8 KiB pcre2 x86_64 0:10.47-1.fc44 fedora 702.6 KiB pcre2-syntax noarch 0:10.47-1.fc44 fedora 281.9 KiB perl-srpm-macros noarch 0:1-60.fc43 fedora 861.0 B pkgconf x86_64 0:2.3.0-3.fc43 fedora 88.5 KiB pkgconf-m4 noarch 0:2.3.0-3.fc43 fedora 14.4 KiB pkgconf-pkg-config x86_64 0:2.3.0-3.fc43 fedora 989.0 B policycoreutils x86_64 0:3.9-5.fc44 fedora 683.5 KiB popt x86_64 0:1.19-9.fc43 fedora 132.8 KiB publicsuffix-list-dafsa noarch 0:20250616-2.fc43 fedora 69.1 KiB pyproject-srpm-macros noarch 0:1.18.6-1.fc44 fedora 1.9 KiB python-srpm-macros noarch 0:3.14-9.fc44 fedora 51.6 KiB qt5-srpm-macros noarch 0:5.15.18-1.fc44 fedora 500.0 B qt6-srpm-macros noarch 0:6.10.1-1.fc44 fedora 464.0 B readline x86_64 0:8.3-2.fc43 fedora 511.7 KiB rpm x86_64 0:6.0.1-1.fc44 fedora 3.1 MiB rpm-build-libs x86_64 0:6.0.1-1.fc44 fedora 264.4 KiB rpm-libs x86_64 0:6.0.1-1.fc44 fedora 933.8 KiB rpm-plugin-selinux x86_64 0:6.0.1-1.fc44 fedora 12.0 KiB rpm-sequoia x86_64 0:1.10.0-1.fc44 fedora 2.5 MiB rpm-sign-libs x86_64 0:6.0.1-1.fc44 fedora 39.7 KiB rust-srpm-macros noarch 0:28.4-1.fc44 fedora 5.5 KiB selinux-policy noarch 0:42.19-1.fc44 fedora 32.0 KiB selinux-policy-targeted noarch 0:42.19-1.fc44 fedora 18.7 MiB setup noarch 0:2.15.0-27.fc44 fedora 724.9 KiB sqlite-libs x86_64 0:3.51.0-1.fc44 fedora 1.5 MiB systemd-libs x86_64 0:259-1.fc44 fedora 2.3 MiB systemd-standalone-sysusers x86_64 0:259-1.fc44 fedora 293.5 KiB tpm2-tss x86_64 0:4.1.3-8.fc43 fedora 1.6 MiB tree-sitter-srpm-macros noarch 0:0.4.2-1.fc43 fedora 8.3 KiB util-linux-core x86_64 0:2.41.3-8.fc44 fedora 1.5 MiB xxhash-libs x86_64 0:0.8.3-3.fc43 fedora 90.2 KiB xz-libs x86_64 1:5.8.1-4.fc44 fedora 217.8 KiB zig-srpm-macros noarch 0:1-5.fc43 fedora 1.1 KiB zip x86_64 0:3.0-44.fc43 fedora 694.5 KiB zlib-ng-compat x86_64 0:2.3.2-2.fc44 fedora 161.5 KiB zstd x86_64 0:1.5.7-3.fc44 fedora 506.2 KiB Installing groups: Buildsystem building group Transaction Summary: Installing: 183 packages Total size of inbound packages is 67 MiB. Need to download 67 MiB. After this operation, 219 MiB extra will be used (install 219 MiB, remove 0 B). [ 1/183] bzip2-0:1.0.8-21.fc43.x86_64 100% | 189.8 KiB/s | 51.6 KiB | 00m00s [ 2/183] cpio-0:2.15-6.fc43.x86_64 100% | 1.1 MiB/s | 293.1 KiB | 00m00s [ 3/183] coreutils-0:9.9-1.fc44.x86_64 100% | 1.7 MiB/s | 1.2 MiB | 00m01s [ 4/183] diffutils-0:3.12-3.fc43.x86_6 100% | 2.0 MiB/s | 392.3 KiB | 00m00s [ 5/183] bash-0:5.3.0-2.fc43.x86_64 100% | 2.5 MiB/s | 1.9 MiB | 00m01s [ 6/183] fedora-release-common-0:44-0. 100% | 391.5 KiB/s | 24.7 KiB | 00m00s [ 7/183] glibc-minimal-langpack-0:2.42 100% | 1.0 MiB/s | 70.5 KiB | 00m00s [ 8/183] grep-0:3.12-2.fc43.x86_64 100% | 3.6 MiB/s | 299.1 KiB | 00m00s [ 9/183] findutils-1:4.10.0-6.fc43.x86 100% | 3.6 MiB/s | 550.0 KiB | 00m00s [ 10/183] gzip-0:1.14-1.fc44.x86_64 100% | 2.5 MiB/s | 177.7 KiB | 00m00s [ 11/183] info-0:7.2-7.fc44.x86_64 100% | 2.4 MiB/s | 182.9 KiB | 00m00s [ 12/183] patch-0:2.8-3.fc44.x86_64 100% | 1.6 MiB/s | 113.9 KiB | 00m00s [ 13/183] redhat-rpm-config-0:343-18.fc 100% | 1.2 MiB/s | 79.4 KiB | 00m00s [ 14/183] rpm-build-0:6.0.1-1.fc44.x86_ 100% | 1.9 MiB/s | 137.9 KiB | 00m00s [ 15/183] sed-0:4.9-6.fc44.x86_64 100% | 3.7 MiB/s | 317.1 KiB | 00m00s [ 16/183] shadow-utils-2:4.18.0-7.fc44. 100% | 9.6 MiB/s | 1.3 MiB | 00m00s [ 17/183] unzip-0:6.0-68.fc44.x86_64 100% | 2.4 MiB/s | 184.6 KiB | 00m00s [ 18/183] tar-2:1.35-6.fc43.x86_64 100% | 6.0 MiB/s | 856.4 KiB | 00m00s [ 19/183] which-0:2.23-3.fc43.x86_64 100% | 642.0 KiB/s | 41.7 KiB | 00m00s [ 20/183] xz-1:5.8.1-4.fc44.x86_64 100% | 5.6 MiB/s | 572.9 KiB | 00m00s [ 21/183] util-linux-0:2.41.3-8.fc44.x8 100% | 13.6 MiB/s | 1.2 MiB | 00m00s [ 22/183] gawk-0:5.3.2-2.fc43.x86_64 100% | 8.7 MiB/s | 1.1 MiB | 00m00s [ 23/183] glibc-0:2.42.9000-16.fc44.x86 100% | 21.1 MiB/s | 2.3 MiB | 00m00s [ 24/183] ncurses-libs-0:6.5-8.20250614 100% | 2.6 MiB/s | 333.1 KiB | 00m00s [ 25/183] filesystem-0:3.18-50.fc43.x86 100% | 7.3 MiB/s | 1.3 MiB | 00m00s [ 26/183] bzip2-libs-0:1.0.8-21.fc43.x8 100% | 662.5 KiB/s | 43.1 KiB | 00m00s [ 27/183] gmp-1:6.3.0-4.fc44.x86_64 100% | 4.2 MiB/s | 319.3 KiB | 00m00s [ 28/183] libacl-0:2.3.2-4.fc43.x86_64 100% | 373.5 KiB/s | 24.3 KiB | 00m00s [ 29/183] libattr-0:2.5.2-6.fc43.x86_64 100% | 274.7 KiB/s | 17.9 KiB | 00m00s [ 30/183] libcap-0:2.77-1.fc44.x86_64 100% | 1.3 MiB/s | 87.1 KiB | 00m00s [ 31/183] coreutils-common-0:9.9-1.fc44 100% | 11.0 MiB/s | 2.1 MiB | 00m00s [ 32/183] libselinux-0:3.9-5.fc44.x86_6 100% | 1.4 MiB/s | 97.8 KiB | 00m00s [ 33/183] systemd-libs-0:259-1.fc44.x86 100% | 11.2 MiB/s | 822.5 KiB | 00m00s [ 34/183] fedora-repos-0:44-0.1.noarch 100% | 139.6 KiB/s | 9.1 KiB | 00m00s [ 35/183] openssl-libs-1:3.5.4-1.fc44.x 100% | 19.7 MiB/s | 2.6 MiB | 00m00s [ 36/183] glibc-common-0:2.42.9000-16.f 100% | 5.2 MiB/s | 358.3 KiB | 00m00s [ 37/183] pcre2-0:10.47-1.fc44.x86_64 100% | 3.6 MiB/s | 267.2 KiB | 00m00s [ 38/183] ed-0:1.22.3-1.fc44.x86_64 100% | 1.3 MiB/s | 84.1 KiB | 00m00s [ 39/183] R-srpm-macros-0:1.3.0-1.fc44. 100% | 162.9 KiB/s | 10.3 KiB | 00m00s [ 40/183] ansible-srpm-macros-0:1-18.1. 100% | 306.3 KiB/s | 19.9 KiB | 00m00s [ 41/183] build-reproducibility-srpm-ma 100% | 197.8 KiB/s | 12.9 KiB | 00m00s [ 42/183] cmake-srpm-macros-0:3.31.10-3 100% | 164.3 KiB/s | 10.4 KiB | 00m00s [ 43/183] dwz-0:0.16-2.fc43.x86_64 100% | 1.9 MiB/s | 135.5 KiB | 00m00s [ 44/183] efi-srpm-macros-0:6-5.fc44.no 100% | 346.6 KiB/s | 22.5 KiB | 00m00s [ 45/183] file-0:5.46-8.fc44.x86_64 100% | 774.7 KiB/s | 48.8 KiB | 00m00s [ 46/183] filesystem-srpm-macros-0:3.18 100% | 406.4 KiB/s | 26.4 KiB | 00m00s [ 47/183] fonts-srpm-macros-1:5.0.0-1.f 100% | 419.8 KiB/s | 27.3 KiB | 00m00s [ 48/183] forge-srpm-macros-0:0.4.0-3.f 100% | 318.9 KiB/s | 20.1 KiB | 00m00s [ 49/183] fpc-srpm-macros-0:1.3-15.fc43 100% | 121.4 KiB/s | 7.9 KiB | 00m00s [ 50/183] gap-srpm-macros-0:2-1.fc44.no 100% | 139.3 KiB/s | 9.1 KiB | 00m00s [ 51/183] ghc-srpm-macros-0:1.9.2-3.fc4 100% | 138.8 KiB/s | 8.7 KiB | 00m00s [ 52/183] gnat-srpm-macros-0:6-8.fc43.n 100% | 130.6 KiB/s | 8.5 KiB | 00m00s [ 53/183] go-srpm-macros-0:3.8.0-1.fc44 100% | 435.5 KiB/s | 28.3 KiB | 00m00s [ 54/183] java-srpm-macros-0:1-7.fc43.n 100% | 126.1 KiB/s | 7.9 KiB | 00m00s [ 55/183] kernel-srpm-macros-0:1.0-27.f 100% | 137.2 KiB/s | 8.9 KiB | 00m00s [ 56/183] lua-srpm-macros-0:1-16.fc43.n 100% | 134.7 KiB/s | 8.8 KiB | 00m00s [ 57/183] ocaml-srpm-macros-0:11-2.fc43 100% | 147.0 KiB/s | 9.3 KiB | 00m00s [ 58/183] openblas-srpm-macros-0:2-20.f 100% | 116.8 KiB/s | 7.6 KiB | 00m00s [ 59/183] package-notes-srpm-macros-0:0 100% | 138.2 KiB/s | 9.0 KiB | 00m00s [ 60/183] perl-srpm-macros-0:1-60.fc43. 100% | 131.6 KiB/s | 8.3 KiB | 00m00s [ 61/183] pyproject-srpm-macros-0:1.18. 100% | 204.8 KiB/s | 13.3 KiB | 00m00s [ 62/183] python-srpm-macros-0:3.14-9.f 100% | 366.3 KiB/s | 23.8 KiB | 00m00s [ 63/183] qt5-srpm-macros-0:5.15.18-1.f 100% | 136.5 KiB/s | 8.6 KiB | 00m00s [ 64/183] qt6-srpm-macros-0:6.10.1-1.fc 100% | 144.0 KiB/s | 9.4 KiB | 00m00s [ 65/183] rpm-0:6.0.1-1.fc44.x86_64 100% | 8.1 MiB/s | 577.6 KiB | 00m00s [ 66/183] rust-srpm-macros-0:28.4-1.fc4 100% | 172.8 KiB/s | 10.9 KiB | 00m00s [ 67/183] tree-sitter-srpm-macros-0:0.4 100% | 205.4 KiB/s | 13.4 KiB | 00m00s [ 68/183] zig-srpm-macros-0:1-5.fc43.no 100% | 129.8 KiB/s | 8.4 KiB | 00m00s [ 69/183] zip-0:3.0-44.fc43.x86_64 100% | 3.9 MiB/s | 261.6 KiB | 00m00s [ 70/183] debugedit-0:5.2-3.fc44.x86_64 100% | 1.3 MiB/s | 85.6 KiB | 00m00s [ 71/183] elfutils-0:0.194-2.fc44.x86_6 100% | 8.0 MiB/s | 574.6 KiB | 00m00s [ 72/183] elfutils-libelf-0:0.194-2.fc4 100% | 3.1 MiB/s | 204.7 KiB | 00m00s [ 73/183] libarchive-0:3.8.4-1.fc44.x86 100% | 5.4 MiB/s | 422.8 KiB | 00m00s [ 74/183] popt-0:1.19-9.fc43.x86_64 100% | 1.0 MiB/s | 65.7 KiB | 00m00s [ 75/183] readline-0:8.3-2.fc43.x86_64 100% | 3.4 MiB/s | 224.6 KiB | 00m00s [ 76/183] rpm-build-libs-0:6.0.1-1.fc44 100% | 1.8 MiB/s | 126.9 KiB | 00m00s [ 77/183] zstd-0:1.5.7-3.fc44.x86_64 100% | 2.9 MiB/s | 189.5 KiB | 00m00s [ 78/183] rpm-libs-0:6.0.1-1.fc44.x86_6 100% | 5.8 MiB/s | 401.1 KiB | 00m00s [ 79/183] audit-libs-0:4.1.2-2.fc44.x86 100% | 2.0 MiB/s | 138.4 KiB | 00m00s [ 80/183] libeconf-0:0.7.9-2.fc43.x86_6 100% | 558.9 KiB/s | 35.2 KiB | 00m00s [ 81/183] libsemanage-0:3.9-4.fc44.x86_ 100% | 1.9 MiB/s | 123.5 KiB | 00m00s [ 82/183] libxcrypt-0:4.5.2-2.fc44.x86_ 100% | 1.8 MiB/s | 128.2 KiB | 00m00s [ 83/183] pam-libs-0:1.7.1-3.fc43.x86_6 100% | 913.1 KiB/s | 57.5 KiB | 00m00s [ 84/183] setup-0:2.15.0-27.fc44.noarch 100% | 2.3 MiB/s | 157.4 KiB | 00m00s [ 85/183] mpfr-0:4.2.2-2.fc43.x86_64 100% | 5.1 MiB/s | 347.0 KiB | 00m00s [ 86/183] xz-libs-1:5.8.1-4.fc44.x86_64 100% | 1.6 MiB/s | 112.8 KiB | 00m00s [ 87/183] libblkid-0:2.41.3-8.fc44.x86_ 100% | 1.8 MiB/s | 122.8 KiB | 00m00s [ 88/183] libfdisk-0:2.41.3-8.fc44.x86_ 100% | 2.5 MiB/s | 161.4 KiB | 00m00s [ 89/183] libcap-ng-0:0.8.5-8.fc44.x86_ 100% | 495.3 KiB/s | 32.2 KiB | 00m00s [ 90/183] liblastlog2-0:2.41.3-8.fc44.x 100% | 351.6 KiB/s | 22.9 KiB | 00m00s [ 91/183] libmount-0:2.41.3-8.fc44.x86_ 100% | 2.5 MiB/s | 162.1 KiB | 00m00s [ 92/183] libsmartcols-0:2.41.3-8.fc44. 100% | 1.2 MiB/s | 83.5 KiB | 00m00s [ 93/183] libuuid-0:2.41.3-8.fc44.x86_6 100% | 397.0 KiB/s | 25.8 KiB | 00m00s [ 94/183] util-linux-core-0:2.41.3-8.fc 100% | 7.8 MiB/s | 550.3 KiB | 00m00s [ 95/183] zlib-ng-compat-0:2.3.2-2.fc44 100% | 1.3 MiB/s | 88.9 KiB | 00m00s [ 96/183] glibc-gconv-extra-0:2.42.9000 100% | 20.1 MiB/s | 1.6 MiB | 00m00s [ 97/183] gnulib-l10n-0:20241231-1.fc44 100% | 2.3 MiB/s | 150.2 KiB | 00m00s [ 98/183] ncurses-base-0:6.5-8.20250614 100% | 1.3 MiB/s | 88.1 KiB | 00m00s [ 99/183] libsepol-0:3.9-2.fc43.x86_64 100% | 5.0 MiB/s | 345.4 KiB | 00m00s [100/183] crypto-policies-0:20251128-1. 100% | 1.4 MiB/s | 98.1 KiB | 00m00s [101/183] ca-certificates-0:2025.2.80_v 100% | 12.9 MiB/s | 973.8 KiB | 00m00s [102/183] fedora-gpg-keys-0:44-0.1.noar 100% | 2.1 MiB/s | 138.8 KiB | 00m00s [103/183] fedora-repos-rawhide-0:44-0.1 100% | 133.0 KiB/s | 8.6 KiB | 00m00s [104/183] pcre2-syntax-0:10.47-1.fc44.n 100% | 2.5 MiB/s | 164.7 KiB | 00m00s [105/183] add-determinism-0:0.7.2-2.fc4 100% | 11.9 MiB/s | 887.6 KiB | 00m00s [106/183] linkdupes-0:0.7.2-2.fc44.x86_ 100% | 4.6 MiB/s | 356.3 KiB | 00m00s [107/183] file-libs-0:5.46-8.fc44.x86_6 100% | 11.5 MiB/s | 849.9 KiB | 00m00s [108/183] curl-0:8.18.0~rc2-1.fc44.x86_ 100% | 3.5 MiB/s | 237.5 KiB | 00m00s [109/183] elfutils-debuginfod-client-0: 100% | 723.4 KiB/s | 46.3 KiB | 00m00s [110/183] elfutils-libs-0:0.194-2.fc44. 100% | 3.7 MiB/s | 271.0 KiB | 00m00s [111/183] libzstd-0:1.5.7-3.fc44.x86_64 100% | 5.2 MiB/s | 359.1 KiB | 00m00s [112/183] libxml2-0:2.12.10-5.fc44.x86_ 100% | 9.7 MiB/s | 692.7 KiB | 00m00s [113/183] lz4-libs-0:1.10.0-3.fc43.x86_ 100% | 1.2 MiB/s | 78.0 KiB | 00m00s [114/183] lua-libs-0:5.4.8-4.fc44.x86_6 100% | 2.0 MiB/s | 133.1 KiB | 00m00s [115/183] rpm-sign-libs-0:6.0.1-1.fc44. 100% | 443.8 KiB/s | 28.0 KiB | 00m00s [116/183] sqlite-libs-0:3.51.0-1.fc44.x 100% | 10.4 MiB/s | 766.5 KiB | 00m00s [117/183] rpm-sequoia-0:1.10.0-1.fc44.x 100% | 9.8 MiB/s | 939.4 KiB | 00m00s [118/183] elfutils-default-yama-scope-0 100% | 158.9 KiB/s | 11.8 KiB | 00m00s [119/183] json-c-0:0.18-7.fc43.x86_64 100% | 691.9 KiB/s | 45.0 KiB | 00m00s [120/183] ima-evm-utils-libs-0:1.6.2-7. 100% | 467.1 KiB/s | 29.4 KiB | 00m00s [121/183] libfsverity-0:1.6-3.fc43.x86_ 100% | 286.6 KiB/s | 18.6 KiB | 00m00s [122/183] gnupg2-0:2.4.8-4.fc43.x86_64 100% | 12.0 MiB/s | 1.6 MiB | 00m00s [123/183] gpgverify-0:2.2-3.fc43.noarch 100% | 176.2 KiB/s | 11.1 KiB | 00m00s [124/183] gnupg2-dirmngr-0:2.4.8-4.fc43 100% | 4.0 MiB/s | 274.6 KiB | 00m00s [125/183] gnupg2-gpg-agent-0:2.4.8-4.fc 100% | 3.9 MiB/s | 272.9 KiB | 00m00s [126/183] gnupg2-gpgconf-0:2.4.8-4.fc43 100% | 1.6 MiB/s | 115.0 KiB | 00m00s [127/183] gnupg2-keyboxd-0:2.4.8-4.fc43 100% | 1.4 MiB/s | 94.7 KiB | 00m00s [128/183] gnupg2-verify-0:2.4.8-4.fc43. 100% | 2.5 MiB/s | 171.2 KiB | 00m00s [129/183] libassuan-0:2.5.7-4.fc43.x86_ 100% | 1.0 MiB/s | 67.4 KiB | 00m00s [130/183] libgcrypt-0:1.11.2-1.fc44.x86 100% | 8.3 MiB/s | 596.1 KiB | 00m00s [131/183] libgpg-error-0:1.58-1.fc44.x8 100% | 3.6 MiB/s | 250.3 KiB | 00m00s [132/183] npth-0:1.8-3.fc43.x86_64 100% | 407.2 KiB/s | 25.7 KiB | 00m00s [133/183] tpm2-tss-0:4.1.3-8.fc43.x86_6 100% | 6.1 MiB/s | 425.9 KiB | 00m00s [134/183] gnutls-0:3.8.11-6.fc44.x86_64 100% | 15.7 MiB/s | 1.4 MiB | 00m00s [135/183] libksba-0:1.6.7-4.fc43.x86_64 100% | 2.4 MiB/s | 160.4 KiB | 00m00s [136/183] openldap-0:2.6.10-4.fc44.x86_ 100% | 3.8 MiB/s | 259.5 KiB | 00m00s [137/183] libusb1-0:1.0.29-4.fc44.x86_6 100% | 1.2 MiB/s | 79.9 KiB | 00m00s [138/183] libidn2-0:2.3.8-2.fc43.x86_64 100% | 2.7 MiB/s | 174.9 KiB | 00m00s [139/183] libtasn1-0:4.20.0-2.fc43.x86_ 100% | 1.1 MiB/s | 74.5 KiB | 00m00s [140/183] nettle-0:3.10.1-2.fc43.x86_64 100% | 6.2 MiB/s | 424.2 KiB | 00m00s [141/183] libunistring-0:1.1-10.fc43.x8 100% | 7.4 MiB/s | 542.9 KiB | 00m00s [142/183] p11-kit-0:0.25.8-1.fc44.x86_6 100% | 7.1 MiB/s | 510.0 KiB | 00m00s [143/183] cyrus-sasl-lib-0:2.1.28-33.fc 100% | 11.0 MiB/s | 796.5 KiB | 00m00s [144/183] libevent-0:2.1.12-16.fc43.x86 100% | 3.7 MiB/s | 257.8 KiB | 00m00s [145/183] libtool-ltdl-0:2.5.4-8.fc44.x 100% | 557.5 KiB/s | 36.2 KiB | 00m00s [146/183] libgcc-0:16.0.0-0.2.fc44.x86_ 100% | 8.3 MiB/s | 110.8 KiB | 00m00s [147/183] libffi-0:3.5.2-1.fc44.x86_64 100% | 651.7 KiB/s | 41.1 KiB | 00m00s [148/183] gdbm-libs-1:1.23-10.fc43.x86_ 100% | 873.3 KiB/s | 56.8 KiB | 00m00s [149/183] p11-kit-trust-0:0.25.8-1.fc44 100% | 2.1 MiB/s | 139.7 KiB | 00m00s [150/183] pkgconf-pkg-config-0:2.3.0-3. 100% | 152.5 KiB/s | 9.6 KiB | 00m00s [151/183] alternatives-0:1.33-3.fc44.x8 100% | 627.1 KiB/s | 40.8 KiB | 00m00s [152/183] pkgconf-0:2.3.0-3.fc43.x86_64 100% | 685.7 KiB/s | 44.6 KiB | 00m00s [153/183] libstdc++-0:16.0.0-0.2.fc44.x 100% | 75.9 MiB/s | 932.4 KiB | 00m00s [154/183] pkgconf-m4-0:2.3.0-3.fc43.noa 100% | 220.8 KiB/s | 13.9 KiB | 00m00s [155/183] libgomp-0:16.0.0-0.2.fc44.x86 100% | 72.0 MiB/s | 368.9 KiB | 00m00s [156/183] libpkgconf-0:2.3.0-3.fc43.x86 100% | 512.1 KiB/s | 37.9 KiB | 00m00s [157/183] binutils-0:2.45.50-12.fc44.x8 100% | 267.0 MiB/s | 5.9 MiB | 00m00s [158/183] libatomic-0:16.0.0-0.2.fc44.x 100% | 3.3 MiB/s | 20.4 KiB | 00m00s [159/183] jansson-0:2.14-3.fc43.x86_64 100% | 696.6 KiB/s | 45.3 KiB | 00m00s [160/183] fedora-release-0:44-0.10.noar 100% | 175.4 KiB/s | 13.5 KiB | 00m00s [161/183] systemd-standalone-sysusers-0 100% | 1.8 MiB/s | 143.8 KiB | 00m00s [162/183] fedora-release-identity-basic 100% | 180.7 KiB/s | 14.3 KiB | 00m00s [163/183] xxhash-libs-0:0.8.3-3.fc43.x8 100% | 487.2 KiB/s | 38.5 KiB | 00m00s [164/183] gdb-minimal-0:16.3-6.fc44.x86 100% | 36.1 MiB/s | 4.4 MiB | 00m00s [165/183] libcurl-0:8.18.0~rc2-1.fc44.x 100% | 5.4 MiB/s | 435.1 KiB | 00m00s [166/183] libbrotli-0:1.2.0-1.fc44.x86_ 100% | 5.0 MiB/s | 349.2 KiB | 00m00s [167/183] krb5-libs-0:1.21.3-10.fc44.x8 100% | 8.4 MiB/s | 761.1 KiB | 00m00s [168/183] libnghttp3-0:1.13.1-1.fc44.x8 100% | 1.1 MiB/s | 70.2 KiB | 00m00s [169/183] libnghttp2-0:1.68.0-2.fc44.x8 100% | 1.1 MiB/s | 72.9 KiB | 00m00s [170/183] libpsl-0:0.21.5-6.fc43.x86_64 100% | 1.0 MiB/s | 65.0 KiB | 00m00s [171/183] libssh-0:0.11.3-1.fc44.x86_64 100% | 3.5 MiB/s | 232.8 KiB | 00m00s [172/183] ngtcp2-0:1.18.0-1.fc44.x86_64 100% | 2.2 MiB/s | 147.1 KiB | 00m00s [173/183] ngtcp2-crypto-ossl-0:1.18.0-1 100% | 410.0 KiB/s | 26.7 KiB | 00m00s [174/183] keyutils-libs-0:1.6.3-6.fc43. 100% | 497.6 KiB/s | 31.4 KiB | 00m00s [175/183] libcom_err-0:1.47.3-3.fc44.x8 100% | 414.3 KiB/s | 26.9 KiB | 00m00s [176/183] libverto-0:0.3.2-11.fc43.x86_ 100% | 318.1 KiB/s | 20.7 KiB | 00m00s [177/183] publicsuffix-list-dafsa-0:202 100% | 938.9 KiB/s | 59.2 KiB | 00m00s [178/183] libssh-config-0:0.11.3-1.fc44 100% | 140.2 KiB/s | 9.1 KiB | 00m00s [179/183] policycoreutils-0:3.9-5.fc44. 100% | 3.2 MiB/s | 214.6 KiB | 00m00s [180/183] selinux-policy-0:42.19-1.fc44 100% | 1.0 MiB/s | 65.4 KiB | 00m00s [181/183] libselinux-utils-0:3.9-5.fc44 100% | 1.8 MiB/s | 119.3 KiB | 00m00s [182/183] rpm-plugin-selinux-0:6.0.1-1. 100% | 296.0 KiB/s | 19.2 KiB | 00m00s [183/183] selinux-policy-targeted-0:42. 100% | 32.2 MiB/s | 6.8 MiB | 00m00s -------------------------------------------------------------------------------- [183/183] Total 100% | 13.3 MiB/s | 66.9 MiB | 00m05s Running transaction Importing OpenPGP key 0x6D9F90A6: UserID : "Fedora (44) " Fingerprint: 36F612DCF27F7D1A48A835E4DBFCF71C6D9F90A6 From : file:///usr/share/distribution-gpg-keys/fedora/RPM-GPG-KEY-fedora-44-primary The key was successfully imported. Importing OpenPGP key 0x6D9F90A6: UserID : "Fedora (44) " Fingerprint: 36F612DCF27F7D1A48A835E4DBFCF71C6D9F90A6 From : file:///usr/share/distribution-gpg-keys/fedora/RPM-GPG-KEY-fedora-44-primary The key was successfully imported. Importing OpenPGP key 0x31645531: UserID : "Fedora (43) " Fingerprint: C6E7F081CF80E13146676E88829B606631645531 From : file:///usr/share/distribution-gpg-keys/fedora/RPM-GPG-KEY-fedora-43-primary The key was successfully imported. Importing OpenPGP key 0xF577861E: UserID : "Fedora (45) " Fingerprint: 4F50A6114CD5C6976A7F1179655A4B02F577861E From : file:///usr/share/distribution-gpg-keys/fedora/RPM-GPG-KEY-fedora-45-primary The key was successfully imported. [ 1/185] Verify package files 100% | 690.0 B/s | 183.0 B | 00m00s >>> Running %pretrans scriptlet: filesystem-0:3.18-50.fc43.x86_64 >>> Finished %pretrans scriptlet: filesystem-0:3.18-50.fc43.x86_64 >>> [RPM] /var/lib/mock/fedora-rawhide-x86_64-1766268696.323769/root/var/cache/dnf/copr_base-6729cd57cc4ff6a8/packages/libgcc-16.0.0-0.2.fc44.x86_64.rpm: Header OpenPGP V4 RSA/SHA256 signature, key ID 4d0ea48a8d983303: NOKEY [ 2/185] Prepare transaction 100% | 3.4 KiB/s | 183.0 B | 00m00s [ 3/185] Installing libgcc-0:16.0.0-0. 100% | 266.0 MiB/s | 272.4 KiB | 00m00s [ 4/185] Installing libssh-config-0:0. 100% | 0.0 B/s | 816.0 B | 00m00s [ 5/185] Installing publicsuffix-list- 100% | 0.0 B/s | 69.8 KiB | 00m00s [ 6/185] Installing fedora-release-ide 100% | 0.0 B/s | 920.0 B | 00m00s [ 7/185] Installing fedora-gpg-keys-0: 100% | 43.7 MiB/s | 179.0 KiB | 00m00s [ 8/185] Installing fedora-repos-rawhi 100% | 0.0 B/s | 2.4 KiB | 00m00s [ 9/185] Installing fedora-repos-0:44- 100% | 0.0 B/s | 5.7 KiB | 00m00s [ 10/185] Installing fedora-release-com 100% | 24.3 MiB/s | 24.9 KiB | 00m00s [ 11/185] Installing fedora-release-0:4 100% | 20.2 KiB/s | 124.0 B | 00m00s >>> Running sysusers scriptlet: setup-0:2.15.0-27.fc44.noarch >>> Finished sysusers scriptlet: setup-0:2.15.0-27.fc44.noarch >>> Scriptlet output: >>> Creating group 'adm' with GID 4. >>> Creating group 'audio' with GID 63. >>> Creating group 'cdrom' with GID 11. >>> Creating group 'clock' with GID 103. >>> Creating group 'dialout' with GID 18. >>> Creating group 'disk' with GID 6. >>> Creating group 'floppy' with GID 19. >>> Creating group 'ftp' with GID 50. >>> Creating group 'games' with GID 20. >>> Creating group 'input' with GID 104. >>> Creating group 'kmem' with GID 9. >>> Creating group 'kvm' with GID 36. >>> Creating group 'lock' with GID 54. >>> Creating group 'lp' with GID 7. >>> Creating group 'mail' with GID 12. >>> Creating group 'man' with GID 15. >>> Creating group 'mem' with GID 8. >>> Creating group 'nobody' with GID 65534. >>> Creating group 'render' with GID 105. >>> Creating group 'root' with GID 0. >>> Creating group 'sgx' with GID 106. >>> Creating group 'sys' with GID 3. >>> Creating group 'tape' with GID 33. >>> Creating group 'tty' with GID 5. >>> Creating group 'users' with GID 100. >>> Creating group 'utmp' with GID 22. >>> Creating group 'video' with GID 39. >>> Creating group 'wheel' with GID 10. >>> Creating user 'adm' (adm) with UID 3 and GID 4. >>> Creating group 'bin' with GID 1. >>> Creating user 'bin' (bin) with UID 1 and GID 1. >>> Creating group 'daemon' with GID 2. >>> Creating user 'daemon' (daemon) with UID 2 and GID 2. >>> Creating user 'ftp' (FTP User) with UID 14 and GID 50. >>> Creating user 'games' (games) with UID 12 and GID 100. >>> Creating user 'halt' (halt) with UID 7 and GID 0. >>> Creating user 'lp' (lp) with UID 4 and GID 7. >>> Creating user 'mail' (mail) with UID 8 and GID 12. >>> Creating user 'nobody' (Kernel Overflow User) with UID 65534 and GID 65534. >>> Creating user 'operator' (operator) with UID 11 and GID 0. >>> Creating user 'root' (Super User) with UID 0 and GID 0. >>> Creating user 'shutdown' (shutdown) with UID 6 and GID 0. >>> Creating user 'sync' (sync) with UID 5 and GID 0. >>> [ 12/185] Installing setup-0:2.15.0-27. 100% | 51.0 MiB/s | 730.6 KiB | 00m00s >>> [RPM] /etc/hosts created as /etc/hosts.rpmnew [ 13/185] Installing filesystem-0:3.18- 100% | 2.8 MiB/s | 212.8 KiB | 00m00s [ 14/185] Installing pkgconf-m4-0:2.3.0 100% | 0.0 B/s | 14.8 KiB | 00m00s [ 15/185] Installing pcre2-syntax-0:10. 100% | 277.7 MiB/s | 284.3 KiB | 00m00s [ 16/185] Installing gnulib-l10n-0:2024 100% | 215.5 MiB/s | 661.9 KiB | 00m00s [ 17/185] Installing coreutils-common-0 100% | 387.2 MiB/s | 11.2 MiB | 00m00s [ 18/185] Installing ncurses-base-0:6.5 100% | 86.3 MiB/s | 353.5 KiB | 00m00s [ 19/185] Installing bash-0:5.3.0-2.fc4 100% | 271.9 MiB/s | 8.4 MiB | 00m00s [ 20/185] Installing glibc-common-0:2.4 100% | 64.0 MiB/s | 1.0 MiB | 00m00s [ 21/185] Installing glibc-gconv-extra- 100% | 280.2 MiB/s | 7.3 MiB | 00m00s [ 22/185] Installing glibc-0:2.42.9000- 100% | 185.3 MiB/s | 6.9 MiB | 00m00s [ 23/185] Installing ncurses-libs-0:6.5 100% | 232.7 MiB/s | 953.0 KiB | 00m00s [ 24/185] Installing glibc-minimal-lang 100% | 0.0 B/s | 124.0 B | 00m00s [ 25/185] Installing zlib-ng-compat-0:2 100% | 158.6 MiB/s | 162.4 KiB | 00m00s [ 26/185] Installing bzip2-libs-0:1.0.8 100% | 79.8 MiB/s | 81.7 KiB | 00m00s [ 27/185] Installing libgpg-error-0:1.5 100% | 61.7 MiB/s | 947.5 KiB | 00m00s [ 28/185] Installing libassuan-0:2.5.7- 100% | 161.7 MiB/s | 165.6 KiB | 00m00s [ 29/185] Installing libgcrypt-0:1.11.2 100% | 394.0 MiB/s | 1.6 MiB | 00m00s [ 30/185] Installing readline-0:8.3-2.f 100% | 250.9 MiB/s | 513.9 KiB | 00m00s [ 31/185] Installing gmp-1:6.3.0-4.fc44 100% | 399.2 MiB/s | 817.5 KiB | 00m00s [ 32/185] Installing xz-libs-1:5.8.1-4. 100% | 213.8 MiB/s | 218.9 KiB | 00m00s [ 33/185] Installing libuuid-0:2.41.3-8 100% | 0.0 B/s | 38.2 KiB | 00m00s [ 34/185] Installing popt-0:1.19-9.fc43 100% | 68.1 MiB/s | 139.4 KiB | 00m00s [ 35/185] Installing libzstd-0:1.5.7-3. 100% | 306.5 MiB/s | 941.6 KiB | 00m00s [ 36/185] Installing elfutils-libelf-0: 100% | 373.7 MiB/s | 1.1 MiB | 00m00s [ 37/185] Installing npth-0:1.8-3.fc43. 100% | 0.0 B/s | 50.7 KiB | 00m00s [ 38/185] Installing libblkid-0:2.41.3- 100% | 257.2 MiB/s | 263.4 KiB | 00m00s [ 39/185] Installing systemd-libs-0:259 100% | 334.3 MiB/s | 2.3 MiB | 00m00s [ 40/185] Installing libxcrypt-0:4.5.2- 100% | 281.3 MiB/s | 288.0 KiB | 00m00s [ 41/185] Installing libsepol-0:3.9-2.f 100% | 267.9 MiB/s | 822.9 KiB | 00m00s [ 42/185] Installing sqlite-libs-0:3.51 100% | 383.0 MiB/s | 1.5 MiB | 00m00s [ 43/185] Installing gnupg2-gpgconf-0:2 100% | 18.9 MiB/s | 252.0 KiB | 00m00s [ 44/185] Installing libattr-0:2.5.2-6. 100% | 0.0 B/s | 25.4 KiB | 00m00s [ 45/185] Installing libacl-0:2.3.2-4.f 100% | 0.0 B/s | 36.8 KiB | 00m00s [ 46/185] Installing pcre2-0:10.47-1.fc 100% | 343.8 MiB/s | 704.1 KiB | 00m00s [ 47/185] Installing libselinux-0:3.9-5 100% | 189.8 MiB/s | 194.4 KiB | 00m00s [ 48/185] Installing grep-0:3.12-2.fc43 100% | 62.7 MiB/s | 1.0 MiB | 00m00s [ 49/185] Installing sed-0:4.9-6.fc44.x 100% | 56.3 MiB/s | 865.5 KiB | 00m00s [ 50/185] Installing findutils-1:4.10.0 100% | 109.3 MiB/s | 1.9 MiB | 00m00s [ 51/185] Installing libtasn1-0:4.20.0- 100% | 173.9 MiB/s | 178.1 KiB | 00m00s [ 52/185] Installing libunistring-0:1.1 100% | 345.3 MiB/s | 1.7 MiB | 00m00s [ 53/185] Installing libidn2-0:2.3.8-2. 100% | 60.6 MiB/s | 558.7 KiB | 00m00s [ 54/185] Installing crypto-policies-0: 100% | 30.8 MiB/s | 157.7 KiB | 00m00s [ 55/185] Installing xz-1:5.8.1-4.fc44. 100% | 74.0 MiB/s | 1.3 MiB | 00m00s [ 56/185] Installing libmount-0:2.41.3- 100% | 364.8 MiB/s | 373.6 KiB | 00m00s [ 57/185] Installing gnupg2-verify-0:2. 100% | 26.3 MiB/s | 349.9 KiB | 00m00s [ 58/185] Installing dwz-0:0.16-2.fc43. 100% | 23.5 MiB/s | 288.5 KiB | 00m00s [ 59/185] Installing mpfr-0:4.2.2-2.fc4 100% | 271.6 MiB/s | 834.4 KiB | 00m00s [ 60/185] Installing gawk-0:5.3.2-2.fc4 100% | 106.8 MiB/s | 1.8 MiB | 00m00s [ 61/185] Installing libksba-0:1.6.7-4. 100% | 391.7 MiB/s | 401.1 KiB | 00m00s [ 62/185] Installing unzip-0:6.0-68.fc4 100% | 29.6 MiB/s | 393.8 KiB | 00m00s [ 63/185] Installing file-libs-0:5.46-8 100% | 658.7 MiB/s | 11.9 MiB | 00m00s [ 64/185] Installing file-0:5.46-8.fc44 100% | 8.3 MiB/s | 101.7 KiB | 00m00s [ 65/185] Installing diffutils-0:3.12-3 100% | 97.6 MiB/s | 1.6 MiB | 00m00s [ 66/185] Installing libeconf-0:0.7.9-2 100% | 65.0 MiB/s | 66.5 KiB | 00m00s [ 67/185] Installing libcap-ng-0:0.8.5- 100% | 69.2 MiB/s | 70.8 KiB | 00m00s [ 68/185] Installing audit-libs-0:4.1.2 100% | 186.3 MiB/s | 381.5 KiB | 00m00s [ 69/185] Installing pam-libs-0:1.7.1-3 100% | 126.0 MiB/s | 129.0 KiB | 00m00s [ 70/185] Installing libcap-0:2.77-1.fc 100% | 16.1 MiB/s | 214.3 KiB | 00m00s [ 71/185] Installing libsemanage-0:3.9- 100% | 303.0 MiB/s | 310.2 KiB | 00m00s [ 72/185] Installing libsmartcols-0:2.4 100% | 177.1 MiB/s | 181.4 KiB | 00m00s [ 73/185] Installing lua-libs-0:5.4.8-4 100% | 276.7 MiB/s | 283.3 KiB | 00m00s [ 74/185] Installing json-c-0:0.18-7.fc 100% | 82.0 MiB/s | 84.0 KiB | 00m00s [ 75/185] Installing libffi-0:3.5.2-1.f 100% | 83.2 MiB/s | 85.2 KiB | 00m00s [ 76/185] Installing p11-kit-0:0.25.8-1 100% | 114.5 MiB/s | 2.3 MiB | 00m00s [ 77/185] Installing alternatives-0:1.3 100% | 5.2 MiB/s | 63.8 KiB | 00m00s [ 78/185] Installing p11-kit-trust-0:0. 100% | 21.9 MiB/s | 448.3 KiB | 00m00s [ 79/185] Installing ngtcp2-0:1.18.0-1. 100% | 154.2 MiB/s | 315.8 KiB | 00m00s [ 80/185] Installing openssl-libs-1:3.5 100% | 371.3 MiB/s | 8.9 MiB | 00m00s [ 81/185] Installing coreutils-0:9.9-1. 100% | 166.3 MiB/s | 5.5 MiB | 00m00s [ 82/185] Installing ca-certificates-0: 100% | 2.0 MiB/s | 2.5 MiB | 00m01s [ 83/185] Installing gzip-0:1.14-1.fc44 100% | 26.3 MiB/s | 403.3 KiB | 00m00s [ 84/185] Installing rpm-sequoia-0:1.10 100% | 352.3 MiB/s | 2.5 MiB | 00m00s [ 85/185] Installing libfsverity-0:1.6- 100% | 28.8 MiB/s | 29.5 KiB | 00m00s [ 86/185] Installing libevent-0:2.1.12- 100% | 288.7 MiB/s | 886.8 KiB | 00m00s [ 87/185] Installing ngtcp2-crypto-ossl 100% | 51.3 MiB/s | 52.6 KiB | 00m00s [ 88/185] Installing util-linux-core-0: 100% | 82.0 MiB/s | 1.5 MiB | 00m00s [ 89/185] Installing zip-0:3.0-44.fc43. 100% | 48.7 MiB/s | 698.4 KiB | 00m00s [ 90/185] Installing gnupg2-keyboxd-0:2 100% | 33.0 MiB/s | 202.7 KiB | 00m00s [ 91/185] Installing libpsl-0:0.21.5-6. 100% | 75.7 MiB/s | 77.5 KiB | 00m00s [ 92/185] Installing tar-2:1.35-6.fc43. 100% | 134.5 MiB/s | 3.0 MiB | 00m00s [ 93/185] Installing linkdupes-0:0.7.2- 100% | 54.7 MiB/s | 840.1 KiB | 00m00s [ 94/185] Installing libselinux-utils-0 100% | 22.6 MiB/s | 323.4 KiB | 00m00s [ 95/185] Installing liblastlog2-0:2.41 100% | 5.8 MiB/s | 35.8 KiB | 00m00s [ 96/185] Installing systemd-standalone 100% | 20.5 MiB/s | 294.2 KiB | 00m00s [ 97/185] Installing libusb1-0:1.0.29-4 100% | 21.1 MiB/s | 172.9 KiB | 00m00s >>> Running sysusers scriptlet: tpm2-tss-0:4.1.3-8.fc43.x86_64 >>> Finished sysusers scriptlet: tpm2-tss-0:4.1.3-8.fc43.x86_64 >>> Scriptlet output: >>> Creating group 'tss' with GID 59. >>> Creating user 'tss' (Account used for TPM access) with UID 59 and GID 59. >>> [ 98/185] Installing tpm2-tss-0:4.1.3-8 100% | 262.0 MiB/s | 1.6 MiB | 00m00s [ 99/185] Installing ima-evm-utils-libs 100% | 60.5 MiB/s | 62.0 KiB | 00m00s [100/185] Installing gnupg2-gpg-agent-0 100% | 33.0 MiB/s | 675.4 KiB | 00m00s [101/185] Installing libfdisk-0:2.41.3- 100% | 124.1 MiB/s | 381.3 KiB | 00m00s [102/185] Installing util-linux-0:2.41. 100% | 101.8 MiB/s | 3.6 MiB | 00m00s [103/185] Installing policycoreutils-0: 100% | 27.8 MiB/s | 711.8 KiB | 00m00s [104/185] Installing selinux-policy-0:4 100% | 1.6 MiB/s | 33.6 KiB | 00m00s [105/185] Installing selinux-policy-tar 100% | 188.8 MiB/s | 14.9 MiB | 00m00s [106/185] Installing libxml2-0:2.12.10- 100% | 85.2 MiB/s | 1.7 MiB | 00m00s [107/185] Installing nettle-0:3.10.1-2. 100% | 258.4 MiB/s | 793.7 KiB | 00m00s [108/185] Installing gnutls-0:3.8.11-6. 100% | 364.9 MiB/s | 3.6 MiB | 00m00s [109/185] Installing bzip2-0:1.0.8-21.f 100% | 7.5 MiB/s | 99.8 KiB | 00m00s [110/185] Installing add-determinism-0: 100% | 128.0 MiB/s | 2.3 MiB | 00m00s [111/185] Installing build-reproducibil 100% | 0.0 B/s | 1.5 KiB | 00m00s [112/185] Installing cpio-0:2.15-6.fc43 100% | 68.7 MiB/s | 1.1 MiB | 00m00s [113/185] Installing ed-0:1.22.3-1.fc44 100% | 11.4 MiB/s | 151.2 KiB | 00m00s [114/185] Installing patch-0:2.8-3.fc44 100% | 17.1 MiB/s | 228.2 KiB | 00m00s [115/185] Installing lz4-libs-0:1.10.0- 100% | 158.6 MiB/s | 162.5 KiB | 00m00s [116/185] Installing libarchive-0:3.8.4 100% | 311.6 MiB/s | 957.2 KiB | 00m00s [117/185] Installing libtool-ltdl-0:2.5 100% | 69.6 MiB/s | 71.2 KiB | 00m00s [118/185] Installing gdbm-libs-1:1.23-1 100% | 128.5 MiB/s | 131.6 KiB | 00m00s [119/185] Installing cyrus-sasl-lib-0:2 100% | 127.8 MiB/s | 2.3 MiB | 00m00s [120/185] Installing openldap-0:2.6.10- 100% | 216.0 MiB/s | 663.6 KiB | 00m00s [121/185] Installing gnupg2-dirmngr-0:2 100% | 30.3 MiB/s | 621.1 KiB | 00m00s [122/185] Installing gnupg2-0:2.4.8-4.f 100% | 211.3 MiB/s | 6.6 MiB | 00m00s [123/185] Installing gpgverify-0:2.2-3. 100% | 0.0 B/s | 9.4 KiB | 00m00s [124/185] Installing libpkgconf-0:2.3.0 100% | 77.4 MiB/s | 79.2 KiB | 00m00s [125/185] Installing pkgconf-0:2.3.0-3. 100% | 6.8 MiB/s | 91.0 KiB | 00m00s [126/185] Installing pkgconf-pkg-config 100% | 147.8 KiB/s | 1.8 KiB | 00m00s [127/185] Installing libgomp-0:16.0.0-0 100% | 279.4 MiB/s | 572.3 KiB | 00m00s [128/185] Installing jansson-0:2.14-3.f 100% | 88.3 MiB/s | 90.5 KiB | 00m00s [129/185] Installing libatomic-0:16.0.0 100% | 0.0 B/s | 37.5 KiB | 00m00s [130/185] Installing libstdc++-0:16.0.0 100% | 364.8 MiB/s | 2.9 MiB | 00m00s [131/185] Installing rpm-libs-0:6.0.1-1 100% | 304.5 MiB/s | 935.3 KiB | 00m00s [132/185] Installing rpm-sign-libs-0:6. 100% | 39.6 MiB/s | 40.6 KiB | 00m00s [133/185] Installing zstd-0:1.5.7-3.fc4 100% | 35.6 MiB/s | 509.8 KiB | 00m00s [134/185] Installing xxhash-libs-0:0.8. 100% | 89.4 MiB/s | 91.6 KiB | 00m00s [135/185] Installing libbrotli-0:1.2.0- 100% | 282.4 MiB/s | 867.7 KiB | 00m00s [136/185] Installing libnghttp2-0:1.68. 100% | 159.5 MiB/s | 163.4 KiB | 00m00s [137/185] Installing libnghttp3-0:1.13. 100% | 153.0 MiB/s | 156.7 KiB | 00m00s [138/185] Installing keyutils-libs-0:1. 100% | 54.4 MiB/s | 55.7 KiB | 00m00s [139/185] Installing libcom_err-0:1.47. 100% | 62.7 MiB/s | 64.2 KiB | 00m00s [140/185] Installing libverto-0:0.3.2-1 100% | 26.6 MiB/s | 27.2 KiB | 00m00s [141/185] Installing krb5-libs-0:1.21.3 100% | 328.5 MiB/s | 2.3 MiB | 00m00s [142/185] Installing libssh-0:0.11.3-1. 100% | 277.9 MiB/s | 569.2 KiB | 00m00s [143/185] Installing libcurl-0:8.18.0~r 100% | 320.9 MiB/s | 985.8 KiB | 00m00s [144/185] Installing curl-0:8.18.0~rc2- 100% | 21.0 MiB/s | 474.1 KiB | 00m00s [145/185] Installing rpm-0:6.0.1-1.fc44 100% | 79.7 MiB/s | 2.6 MiB | 00m00s [146/185] Installing cmake-srpm-macros- 100% | 0.0 B/s | 804.0 B | 00m00s [147/185] Installing efi-srpm-macros-0: 100% | 0.0 B/s | 41.2 KiB | 00m00s [148/185] Installing java-srpm-macros-0 100% | 0.0 B/s | 1.1 KiB | 00m00s [149/185] Installing lua-srpm-macros-0: 100% | 0.0 B/s | 1.9 KiB | 00m00s [150/185] Installing tree-sitter-srpm-m 100% | 0.0 B/s | 9.3 KiB | 00m00s [151/185] Installing zig-srpm-macros-0: 100% | 0.0 B/s | 1.7 KiB | 00m00s [152/185] Installing filesystem-srpm-ma 100% | 0.0 B/s | 38.9 KiB | 00m00s [153/185] Installing elfutils-default-y 100% | 408.6 KiB/s | 2.0 KiB | 00m00s [154/185] Installing elfutils-libs-0:0. 100% | 224.4 MiB/s | 689.3 KiB | 00m00s [155/185] Installing elfutils-debuginfo 100% | 6.0 MiB/s | 86.3 KiB | 00m00s [156/185] Installing elfutils-0:0.194-2 100% | 146.5 MiB/s | 2.9 MiB | 00m00s [157/185] Installing binutils-0:2.45.50 100% | 322.3 MiB/s | 27.1 MiB | 00m00s [158/185] Installing gdb-minimal-0:16.3 100% | 276.2 MiB/s | 13.3 MiB | 00m00s [159/185] Installing debugedit-0:5.2-3. 100% | 16.3 MiB/s | 217.3 KiB | 00m00s [160/185] Installing rpm-build-libs-0:6 100% | 259.0 MiB/s | 265.2 KiB | 00m00s [161/185] Installing rust-srpm-macros-0 100% | 0.0 B/s | 6.4 KiB | 00m00s [162/185] Installing qt6-srpm-macros-0: 100% | 0.0 B/s | 740.0 B | 00m00s [163/185] Installing qt5-srpm-macros-0: 100% | 0.0 B/s | 776.0 B | 00m00s [164/185] Installing perl-srpm-macros-0 100% | 0.0 B/s | 1.1 KiB | 00m00s [165/185] Installing package-notes-srpm 100% | 0.0 B/s | 2.0 KiB | 00m00s [166/185] Installing openblas-srpm-macr 100% | 0.0 B/s | 392.0 B | 00m00s [167/185] Installing ocaml-srpm-macros- 100% | 0.0 B/s | 2.1 KiB | 00m00s [168/185] Installing kernel-srpm-macros 100% | 0.0 B/s | 2.3 KiB | 00m00s [169/185] Installing gnat-srpm-macros-0 100% | 0.0 B/s | 1.3 KiB | 00m00s [170/185] Installing ghc-srpm-macros-0: 100% | 0.0 B/s | 1.0 KiB | 00m00s [171/185] Installing gap-srpm-macros-0: 100% | 0.0 B/s | 2.7 KiB | 00m00s [172/185] Installing fpc-srpm-macros-0: 100% | 0.0 B/s | 420.0 B | 00m00s [173/185] Installing ansible-srpm-macro 100% | 0.0 B/s | 36.2 KiB | 00m00s [174/185] Installing redhat-rpm-config- 100% | 92.7 MiB/s | 189.9 KiB | 00m00s [175/185] Installing forge-srpm-macros- 100% | 0.0 B/s | 40.3 KiB | 00m00s [176/185] Installing fonts-srpm-macros- 100% | 55.7 MiB/s | 57.0 KiB | 00m00s [177/185] Installing go-srpm-macros-0:3 100% | 12.3 MiB/s | 63.0 KiB | 00m00s [178/185] Installing rpm-build-0:6.0.1- 100% | 19.3 MiB/s | 296.6 KiB | 00m00s [179/185] Installing pyproject-srpm-mac 100% | 0.0 B/s | 2.5 KiB | 00m00s [180/185] Installing R-srpm-macros-0:1. 100% | 0.0 B/s | 4.0 KiB | 00m00s [181/185] Installing python-srpm-macros 100% | 0.0 B/s | 52.9 KiB | 00m00s [182/185] Installing rpm-plugin-selinux 100% | 0.0 B/s | 13.0 KiB | 00m00s [183/185] Installing which-0:2.23-3.fc4 100% | 6.0 MiB/s | 85.7 KiB | 00m00s [184/185] Installing shadow-utils-2:4.1 100% | 132.4 MiB/s | 4.0 MiB | 00m00s [185/185] Installing info-0:7.2-7.fc44. 100% | 46.0 KiB/s | 358.3 KiB | 00m08s Warning: skipped OpenPGP checks for 5 packages from repository: copr_base Complete! Finish: installing minimal buildroot with dnf5 Start: creating root cache Finish: creating root cache Finish: chroot init INFO: Installed packages: INFO: R-srpm-macros-1.3.0-1.fc44.noarch add-determinism-0.7.2-2.fc44.x86_64 alternatives-1.33-3.fc44.x86_64 ansible-srpm-macros-1-18.1.fc43.noarch audit-libs-4.1.2-2.fc44.x86_64 bash-5.3.0-2.fc43.x86_64 binutils-2.45.50-12.fc44.x86_64 build-reproducibility-srpm-macros-0.7.2-2.fc44.noarch bzip2-1.0.8-21.fc43.x86_64 bzip2-libs-1.0.8-21.fc43.x86_64 ca-certificates-2025.2.80_v9.0.304-2.fc44.noarch cmake-srpm-macros-3.31.10-3.fc44.noarch coreutils-9.9-1.fc44.x86_64 coreutils-common-9.9-1.fc44.x86_64 cpio-2.15-6.fc43.x86_64 crypto-policies-20251128-1.git19878fe.fc44.noarch curl-8.18.0~rc2-1.fc44.x86_64 cyrus-sasl-lib-2.1.28-33.fc44.x86_64 debugedit-5.2-3.fc44.x86_64 diffutils-3.12-3.fc43.x86_64 dwz-0.16-2.fc43.x86_64 ed-1.22.3-1.fc44.x86_64 efi-srpm-macros-6-5.fc44.noarch elfutils-0.194-2.fc44.x86_64 elfutils-debuginfod-client-0.194-2.fc44.x86_64 elfutils-default-yama-scope-0.194-2.fc44.noarch elfutils-libelf-0.194-2.fc44.x86_64 elfutils-libs-0.194-2.fc44.x86_64 fedora-gpg-keys-44-0.1.noarch fedora-release-44-0.10.noarch fedora-release-common-44-0.10.noarch fedora-release-identity-basic-44-0.10.noarch fedora-repos-44-0.1.noarch fedora-repos-rawhide-44-0.1.noarch file-5.46-8.fc44.x86_64 file-libs-5.46-8.fc44.x86_64 filesystem-3.18-50.fc43.x86_64 filesystem-srpm-macros-3.18-50.fc43.noarch findutils-4.10.0-6.fc43.x86_64 fonts-srpm-macros-5.0.0-1.fc44.noarch forge-srpm-macros-0.4.0-3.fc43.noarch fpc-srpm-macros-1.3-15.fc43.noarch gap-srpm-macros-2-1.fc44.noarch gawk-5.3.2-2.fc43.x86_64 gdb-minimal-16.3-6.fc44.x86_64 gdbm-libs-1.23-10.fc43.x86_64 ghc-srpm-macros-1.9.2-3.fc43.noarch glibc-2.42.9000-16.fc44.x86_64 glibc-common-2.42.9000-16.fc44.x86_64 glibc-gconv-extra-2.42.9000-16.fc44.x86_64 glibc-minimal-langpack-2.42.9000-16.fc44.x86_64 gmp-6.3.0-4.fc44.x86_64 gnat-srpm-macros-6-8.fc43.noarch gnulib-l10n-20241231-1.fc44.noarch gnupg2-2.4.8-4.fc43.x86_64 gnupg2-dirmngr-2.4.8-4.fc43.x86_64 gnupg2-gpg-agent-2.4.8-4.fc43.x86_64 gnupg2-gpgconf-2.4.8-4.fc43.x86_64 gnupg2-keyboxd-2.4.8-4.fc43.x86_64 gnupg2-verify-2.4.8-4.fc43.x86_64 gnutls-3.8.11-6.fc44.x86_64 go-srpm-macros-3.8.0-1.fc44.noarch gpg-pubkey-36f612dcf27f7d1a48a835e4dbfcf71c6d9f90a6-6786af3b gpg-pubkey-4f50a6114cd5c6976a7f1179655a4b02f577861e-6888bc98 gpg-pubkey-c6e7f081cf80e13146676e88829b606631645531-66b6dccf gpgverify-2.2-3.fc43.noarch grep-3.12-2.fc43.x86_64 gzip-1.14-1.fc44.x86_64 ima-evm-utils-libs-1.6.2-7.fc44.x86_64 info-7.2-7.fc44.x86_64 jansson-2.14-3.fc43.x86_64 java-srpm-macros-1-7.fc43.noarch json-c-0.18-7.fc43.x86_64 kernel-srpm-macros-1.0-27.fc43.noarch keyutils-libs-1.6.3-6.fc43.x86_64 krb5-libs-1.21.3-10.fc44.x86_64 libacl-2.3.2-4.fc43.x86_64 libarchive-3.8.4-1.fc44.x86_64 libassuan-2.5.7-4.fc43.x86_64 libatomic-16.0.0-0.2.fc44.x86_64 libattr-2.5.2-6.fc43.x86_64 libblkid-2.41.3-8.fc44.x86_64 libbrotli-1.2.0-1.fc44.x86_64 libcap-2.77-1.fc44.x86_64 libcap-ng-0.8.5-8.fc44.x86_64 libcom_err-1.47.3-3.fc44.x86_64 libcurl-8.18.0~rc2-1.fc44.x86_64 libeconf-0.7.9-2.fc43.x86_64 libevent-2.1.12-16.fc43.x86_64 libfdisk-2.41.3-8.fc44.x86_64 libffi-3.5.2-1.fc44.x86_64 libfsverity-1.6-3.fc43.x86_64 libgcc-16.0.0-0.2.fc44.x86_64 libgcrypt-1.11.2-1.fc44.x86_64 libgomp-16.0.0-0.2.fc44.x86_64 libgpg-error-1.58-1.fc44.x86_64 libidn2-2.3.8-2.fc43.x86_64 libksba-1.6.7-4.fc43.x86_64 liblastlog2-2.41.3-8.fc44.x86_64 libmount-2.41.3-8.fc44.x86_64 libnghttp2-1.68.0-2.fc44.x86_64 libnghttp3-1.13.1-1.fc44.x86_64 libpkgconf-2.3.0-3.fc43.x86_64 libpsl-0.21.5-6.fc43.x86_64 libselinux-3.9-5.fc44.x86_64 libselinux-utils-3.9-5.fc44.x86_64 libsemanage-3.9-4.fc44.x86_64 libsepol-3.9-2.fc43.x86_64 libsmartcols-2.41.3-8.fc44.x86_64 libssh-0.11.3-1.fc44.x86_64 libssh-config-0.11.3-1.fc44.noarch libstdc++-16.0.0-0.2.fc44.x86_64 libtasn1-4.20.0-2.fc43.x86_64 libtool-ltdl-2.5.4-8.fc44.x86_64 libunistring-1.1-10.fc43.x86_64 libusb1-1.0.29-4.fc44.x86_64 libuuid-2.41.3-8.fc44.x86_64 libverto-0.3.2-11.fc43.x86_64 libxcrypt-4.5.2-2.fc44.x86_64 libxml2-2.12.10-5.fc44.x86_64 libzstd-1.5.7-3.fc44.x86_64 linkdupes-0.7.2-2.fc44.x86_64 lua-libs-5.4.8-4.fc44.x86_64 lua-srpm-macros-1-16.fc43.noarch lz4-libs-1.10.0-3.fc43.x86_64 mpfr-4.2.2-2.fc43.x86_64 ncurses-base-6.5-8.20250614.fc44.noarch ncurses-libs-6.5-8.20250614.fc44.x86_64 nettle-3.10.1-2.fc43.x86_64 ngtcp2-1.18.0-1.fc44.x86_64 ngtcp2-crypto-ossl-1.18.0-1.fc44.x86_64 npth-1.8-3.fc43.x86_64 ocaml-srpm-macros-11-2.fc43.noarch openblas-srpm-macros-2-20.fc43.noarch openldap-2.6.10-4.fc44.x86_64 openssl-libs-3.5.4-1.fc44.x86_64 p11-kit-0.25.8-1.fc44.x86_64 p11-kit-trust-0.25.8-1.fc44.x86_64 package-notes-srpm-macros-0.5-14.fc43.noarch pam-libs-1.7.1-3.fc43.x86_64 patch-2.8-3.fc44.x86_64 pcre2-10.47-1.fc44.x86_64 pcre2-syntax-10.47-1.fc44.noarch perl-srpm-macros-1-60.fc43.noarch pkgconf-2.3.0-3.fc43.x86_64 pkgconf-m4-2.3.0-3.fc43.noarch pkgconf-pkg-config-2.3.0-3.fc43.x86_64 policycoreutils-3.9-5.fc44.x86_64 popt-1.19-9.fc43.x86_64 publicsuffix-list-dafsa-20250616-2.fc43.noarch pyproject-srpm-macros-1.18.6-1.fc44.noarch python-srpm-macros-3.14-9.fc44.noarch qt5-srpm-macros-5.15.18-1.fc44.noarch qt6-srpm-macros-6.10.1-1.fc44.noarch readline-8.3-2.fc43.x86_64 redhat-rpm-config-343-18.fc44.noarch rpm-6.0.1-1.fc44.x86_64 rpm-build-6.0.1-1.fc44.x86_64 rpm-build-libs-6.0.1-1.fc44.x86_64 rpm-libs-6.0.1-1.fc44.x86_64 rpm-plugin-selinux-6.0.1-1.fc44.x86_64 rpm-sequoia-1.10.0-1.fc44.x86_64 rpm-sign-libs-6.0.1-1.fc44.x86_64 rust-srpm-macros-28.4-1.fc44.noarch sed-4.9-6.fc44.x86_64 selinux-policy-42.19-1.fc44.noarch selinux-policy-targeted-42.19-1.fc44.noarch setup-2.15.0-27.fc44.noarch shadow-utils-4.18.0-7.fc44.x86_64 sqlite-libs-3.51.0-1.fc44.x86_64 systemd-libs-259-1.fc44.x86_64 systemd-standalone-sysusers-259-1.fc44.x86_64 tar-1.35-6.fc43.x86_64 tpm2-tss-4.1.3-8.fc43.x86_64 tree-sitter-srpm-macros-0.4.2-1.fc43.noarch unzip-6.0-68.fc44.x86_64 util-linux-2.41.3-8.fc44.x86_64 util-linux-core-2.41.3-8.fc44.x86_64 which-2.23-3.fc43.x86_64 xxhash-libs-0.8.3-3.fc43.x86_64 xz-5.8.1-4.fc44.x86_64 xz-libs-5.8.1-4.fc44.x86_64 zig-srpm-macros-1-5.fc43.noarch zip-3.0-44.fc43.x86_64 zlib-ng-compat-2.3.2-2.fc44.x86_64 zstd-1.5.7-3.fc44.x86_64 Start: buildsrpm Start: rpmbuild -bs Building target platforms: x86_64 Building for target x86_64 setting SOURCE_DATE_EPOCH=1766188800 Wrote: /builddir/build/SRPMS/llama-cpp-b6153-1.fc44.src.rpm Finish: rpmbuild -bs INFO: chroot_scan: 1 files copied to /var/lib/copr-rpmbuild/results/chroot_scan INFO: /var/lib/mock/fedora-rawhide-x86_64-1766268696.323769/root/var/log/dnf5.log INFO: chroot_scan: creating tarball /var/lib/copr-rpmbuild/results/chroot_scan.tar.gz /bin/tar: Removing leading `/' from member names Finish: buildsrpm INFO: Done(/var/lib/copr-rpmbuild/workspace/workdir-uxi5ap5m/llama-cpp/llama-cpp.spec) Config(child) 0 minutes 33 seconds INFO: Results and/or logs in: /var/lib/copr-rpmbuild/results INFO: Cleaning up build root ('cleanup_on_success=True') Start: clean chroot INFO: unmounting tmpfs. Finish: clean chroot INFO: Start(/var/lib/copr-rpmbuild/results/llama-cpp-b6153-1.fc44.src.rpm) Config(fedora-rawhide-x86_64) Start(bootstrap): chroot init INFO: mounting tmpfs at /var/lib/mock/fedora-rawhide-x86_64-bootstrap-1766268696.323769/root. INFO: reusing tmpfs at /var/lib/mock/fedora-rawhide-x86_64-bootstrap-1766268696.323769/root. INFO: calling preinit hooks INFO: enabled root cache INFO: enabled package manager cache Start(bootstrap): cleaning package manager metadata Finish(bootstrap): cleaning package manager metadata Finish(bootstrap): chroot init Start: chroot init INFO: mounting tmpfs at /var/lib/mock/fedora-rawhide-x86_64-1766268696.323769/root. INFO: calling preinit hooks INFO: enabled root cache Start: unpacking root cache Finish: unpacking root cache INFO: enabled package manager cache Start: cleaning package manager metadata Finish: cleaning package manager metadata INFO: enabled HW Info plugin INFO: Buildroot is handled by package management downloaded with a bootstrap image: rpm-6.0.1-1.fc44.x86_64 rpm-sequoia-1.10.0-1.fc44.x86_64 dnf5-5.3.0.0-3.fc44.x86_64 dnf5-plugins-5.3.0.0-3.fc44.x86_64 Finish: chroot init Start: build phase for llama-cpp-b6153-1.fc44.src.rpm Start: build setup for llama-cpp-b6153-1.fc44.src.rpm Building target platforms: x86_64 Building for target x86_64 setting SOURCE_DATE_EPOCH=1766188800 Wrote: /builddir/build/SRPMS/llama-cpp-b6153-1.fc44.src.rpm Updating and loading repositories: Copr repository 100% | 37.3 KiB/s | 1.5 KiB | 00m00s fedora 100% | 136.7 KiB/s | 26.5 KiB | 00m00s Repositories loaded. Package "curl-8.18.0~rc2-1.fc44.x86_64" is already installed. Package Arch Version Repository Size Installing: cmake x86_64 0:3.31.10-3.fc44 fedora 34.5 MiB gcc-c++ x86_64 0:16.0.0-0.2.fc44 copr_base 48.4 MiB git x86_64 0:2.52.0-1.fc44 fedora 56.4 KiB hipblas-devel x86_64 0:7.1.0-4.fc44 copr_base 2.4 MiB hipcc-libomp-devel x86_64 0:20-9.rocm7.1.1.fc44 copr_base 0.0 B langpacks-en noarch 0:4.2-5.fc43 fedora 400.0 B libcurl-devel x86_64 0:8.18.0~rc2-1.fc44 fedora 1.4 MiB openmpi x86_64 0:5.0.9-1.fc44 fedora 7.0 MiB pthreadpool-devel x86_64 0:0.0^git20230829.4fe0e1e-8.fc44 fedora 99.1 KiB rocblas-devel x86_64 0:7.1.1-3.fc44 copr_base 2.7 MiB rocm-comgr-devel x86_64 0:20-10.rocm7.1.1.fc44 copr_base 100.5 KiB rocm-hip-devel x86_64 0:7.1.1-1.fc44 copr_base 2.4 MiB rocm-rpm-macros noarch 0:7.1.0-7.fc44 fedora 18.9 KiB rocm-runtime-devel x86_64 0:7.1.1-3.fc44 copr_base 683.4 KiB wget2-wget x86_64 0:2.2.0-6.fc43 fedora 42.0 B xxd x86_64 2:9.1.1972-1.fc44 fedora 33.1 KiB Installing dependencies: abattis-cantarell-vf-fonts noarch 0:0.301-15.fc43 fedora 192.7 KiB annobin-docs noarch 0:13.03-1.fc44 fedora 99.2 KiB annobin-plugin-gcc x86_64 0:13.03-1.fc44 fedora 695.8 KiB brotli x86_64 0:1.2.0-1.fc44 fedora 33.6 KiB brotli-devel x86_64 0:1.2.0-1.fc44 fedora 65.9 KiB clang-resource-filesystem x86_64 0:21.1.7-1.fc44 fedora 15.3 KiB cmake-data noarch 0:3.31.10-3.fc44 fedora 8.4 MiB cmake-filesystem x86_64 0:3.31.10-3.fc44 fedora 0.0 B cmake-rpm-macros noarch 0:3.31.10-3.fc44 fedora 8.2 KiB cpp x86_64 0:16.0.0-0.2.fc44 copr_base 43.7 MiB default-fonts-core-sans noarch 0:4.2-5.fc43 fedora 11.9 KiB dns-root-data noarch 0:2025080400-2.fc44 fedora 12.0 KiB emacs-filesystem x86_64 1:30.2-1.fc44 fedora 0.0 B environment-modules x86_64 0:5.6.1-1.fc44 fedora 1.9 MiB expat x86_64 0:2.7.3-1.fc44 fedora 301.1 KiB fonts-filesystem noarch 1:5.0.0-1.fc44 fedora 0.0 B gcc x86_64 0:16.0.0-0.2.fc44 copr_base 126.0 MiB gcc-plugin-annobin x86_64 0:16.0.0-0.2.fc44 copr_base 57.2 KiB git-core x86_64 0:2.52.0-1.fc44 fedora 24.0 MiB git-core-doc noarch 0:2.52.0-1.fc44 fedora 18.4 MiB glibc-devel x86_64 0:2.42.9000-16.fc44 fedora 2.3 MiB gnutls-dane x86_64 0:3.8.11-6.fc44 fedora 60.9 KiB google-noto-fonts-common noarch 0:20251201-1.fc44 fedora 17.7 KiB google-noto-sans-mono-vf-fonts noarch 0:20251201-1.fc44 fedora 561.2 KiB google-noto-sans-vf-fonts noarch 0:20251201-1.fc44 fedora 1.4 MiB google-noto-serif-vf-fonts noarch 0:20251201-1.fc44 fedora 1.6 MiB gpgme x86_64 0:1.24.3-6.fc44 fedora 587.9 KiB groff-base x86_64 0:1.23.0-11.fc44 fedora 3.8 MiB hipblas x86_64 0:7.1.0-4.fc44 copr_base 803.6 KiB hipblas-common-devel noarch 0:7.1.0-1.fc44 copr_base 16.8 KiB hipcc x86_64 0:20-9.rocm7.1.1.fc44 copr_base 634.5 KiB hiredis x86_64 0:1.2.0-7.fc43 fedora 105.9 KiB hwdata noarch 0:0.402-1.fc44 fedora 9.7 MiB hwloc-libs x86_64 0:2.12.0-2.fc43 fedora 2.9 MiB jsoncpp x86_64 0:1.9.6-2.fc43 fedora 257.6 KiB kernel-headers x86_64 0:6.19.0-0.rc1.15.fc44 fedora 6.9 MiB keyutils-libs-devel x86_64 0:1.6.3-6.fc43 fedora 48.2 KiB krb5-devel x86_64 0:1.21.3-10.fc44 fedora 705.9 KiB langpacks-core-en noarch 0:4.2-5.fc43 fedora 398.0 B langpacks-fonts-en noarch 0:4.2-5.fc43 fedora 341.0 B less x86_64 0:685-6.fc44 fedora 448.6 KiB libcbor x86_64 0:0.13.0-1.fc44 fedora 79.4 KiB libcom_err-devel x86_64 0:1.47.3-3.fc44 fedora 16.7 KiB libdrm x86_64 0:2.4.128-3.fc44 fedora 399.9 KiB libedit x86_64 0:3.1-57.20251016cvs.fc44 fedora 240.2 KiB libfabric x86_64 0:2.3.1-1.fc44 fedora 9.0 MiB libfido2 x86_64 0:1.16.0-4.fc44 fedora 238.5 KiB libgfortran x86_64 0:16.0.0-0.2.fc44 copr_base 3.4 MiB libibverbs x86_64 0:60.0-1.fc44 fedora 1.2 MiB libidn2-devel x86_64 0:2.3.8-2.fc43 fedora 149.1 KiB libkadm5 x86_64 0:1.21.3-10.fc44 fedora 213.9 KiB libmpc x86_64 0:1.3.1-8.fc43 fedora 160.6 KiB libnghttp2-devel x86_64 0:1.68.0-2.fc44 fedora 288.0 KiB libnghttp3-devel x86_64 0:1.13.1-1.fc44 fedora 105.0 KiB libnl3 x86_64 0:3.12.0-2.fc44 fedora 1.0 MiB libomp x86_64 0:21.1.7-1.fc44 fedora 2.8 MiB libomp-devel x86_64 0:21.1.7-1.fc44 fedora 1.5 MiB libpciaccess x86_64 0:0.16-16.fc43 fedora 44.5 KiB libpipeline x86_64 0:1.5.8-3.fc43 fedora 145.1 KiB libpsl-devel x86_64 0:0.21.5-6.fc43 fedora 110.2 KiB libpsm2 x86_64 0:12.0.1-3.fc43 fedora 442.3 KiB libquadmath x86_64 0:16.0.0-0.2.fc44 copr_base 325.9 KiB librdmacm x86_64 0:60.0-1.fc44 fedora 150.1 KiB libselinux-devel x86_64 0:3.9-5.fc44 fedora 127.3 KiB libsepol-devel x86_64 0:3.9-2.fc43 fedora 121.4 KiB libssh-devel x86_64 0:0.11.3-1.fc44 fedora 178.0 KiB libstdc++-devel x86_64 0:16.0.0-0.2.fc44 copr_base 38.7 MiB libtommath x86_64 0:1.3.1~rc1-6.fc43 fedora 126.4 KiB libuv x86_64 1:1.51.0-2.fc43 fedora 570.2 KiB libverto-devel x86_64 0:0.3.2-11.fc43 fedora 25.7 KiB libxcrypt-devel x86_64 0:4.5.2-2.fc44 fedora 31.0 KiB llvm-filesystem x86_64 0:21.1.7-1.fc44 fedora 0.0 B llvm-libs x86_64 0:21.1.7-1.fc44 fedora 138.6 MiB make x86_64 1:4.4.1-11.fc43 fedora 1.8 MiB man-db x86_64 0:2.13.1-2.fc43 fedora 2.9 MiB mpdecimal x86_64 0:4.0.1-2.fc43 fedora 217.2 KiB munge-libs x86_64 0:0.5.16-6.fc43 fedora 28.0 KiB ncurses x86_64 0:6.5-8.20250614.fc44 fedora 609.8 KiB ngtcp2-crypto-ossl-devel x86_64 0:1.18.0-1.fc44 fedora 7.5 KiB ngtcp2-devel x86_64 0:1.18.0-1.fc44 fedora 296.9 KiB numactl-libs x86_64 0:2.0.19-3.fc43 fedora 56.9 KiB openssh x86_64 0:10.2p1-1.fc44 fedora 1.4 MiB openssh-clients x86_64 0:10.2p1-1.fc44 fedora 2.6 MiB openssl-devel x86_64 1:3.5.4-1.fc44 fedora 4.6 MiB orangefs x86_64 0:2.10.1-1.fc44 fedora 1.7 MiB pcre2-devel x86_64 0:10.47-1.fc44 fedora 2.1 MiB pcre2-utf16 x86_64 0:10.47-1.fc44 fedora 639.2 KiB pcre2-utf32 x86_64 0:10.47-1.fc44 fedora 611.1 KiB perl-AutoLoader noarch 0:5.74-520.fc43 fedora 20.6 KiB perl-B x86_64 0:1.89-520.fc43 fedora 501.3 KiB perl-Carp noarch 0:1.54-520.fc43 fedora 46.6 KiB perl-Class-Struct noarch 0:0.68-520.fc43 fedora 25.4 KiB perl-Data-Dumper x86_64 0:2.191-521.fc43 fedora 115.6 KiB perl-Digest noarch 0:1.20-520.fc43 fedora 35.3 KiB perl-Digest-MD5 x86_64 0:2.59-520.fc43 fedora 59.7 KiB perl-DynaLoader x86_64 0:1.57-520.fc43 fedora 32.1 KiB perl-Encode x86_64 4:3.21-520.fc43 fedora 4.7 MiB perl-Errno x86_64 0:1.38-520.fc43 fedora 8.4 KiB perl-Error noarch 1:0.17030-2.fc43 fedora 76.7 KiB perl-Exporter noarch 0:5.79-520.fc43 fedora 54.3 KiB perl-Fcntl x86_64 0:1.20-520.fc43 fedora 48.8 KiB perl-File-Basename noarch 0:2.86-520.fc43 fedora 14.0 KiB perl-File-Copy noarch 0:2.41-520.fc43 fedora 19.7 KiB perl-File-Path noarch 0:2.18-521.fc44 fedora 63.5 KiB perl-File-Temp noarch 1:0.231.200-1.fc44 fedora 163.7 KiB perl-File-Which noarch 0:1.27-14.fc43 fedora 30.4 KiB perl-File-stat noarch 0:1.14-520.fc43 fedora 12.5 KiB perl-FileHandle noarch 0:2.05-520.fc43 fedora 9.4 KiB perl-Getopt-Long noarch 1:2.58-520.fc43 fedora 144.5 KiB perl-Getopt-Std noarch 0:1.14-520.fc43 fedora 11.2 KiB perl-Git noarch 0:2.52.0-1.fc44 fedora 64.4 KiB perl-HTTP-Tiny noarch 0:0.090-521.fc43 fedora 154.4 KiB perl-IO x86_64 0:1.55-520.fc43 fedora 147.4 KiB perl-IO-Socket-IP noarch 0:0.43-521.fc43 fedora 100.3 KiB perl-IO-Socket-SSL noarch 0:2.095-2.fc43 fedora 714.5 KiB perl-IPC-Open3 noarch 0:1.24-520.fc43 fedora 27.7 KiB perl-MIME-Base32 noarch 0:1.303-24.fc43 fedora 30.7 KiB perl-MIME-Base64 x86_64 0:3.16-520.fc43 fedora 42.0 KiB perl-Net-SSLeay x86_64 0:1.94-11.fc43 fedora 1.3 MiB perl-POSIX x86_64 0:2.23-520.fc43 fedora 231.4 KiB perl-PathTools x86_64 0:3.94-520.fc43 fedora 180.0 KiB perl-Pod-Escapes noarch 1:1.07-520.fc43 fedora 24.9 KiB perl-Pod-Perldoc noarch 0:3.28.01-521.fc43 fedora 163.7 KiB perl-Pod-Simple noarch 1:3.47-3.fc43 fedora 565.3 KiB perl-Pod-Usage noarch 4:2.05-520.fc43 fedora 86.3 KiB perl-Scalar-List-Utils x86_64 5:1.70-1.fc43 fedora 144.9 KiB perl-SelectSaver noarch 0:1.02-520.fc43 fedora 2.2 KiB perl-Socket x86_64 4:2.040-2.fc43 fedora 120.3 KiB perl-Storable x86_64 1:3.37-521.fc43 fedora 231.2 KiB perl-Symbol noarch 0:1.09-520.fc43 fedora 6.8 KiB perl-Term-ANSIColor noarch 0:5.01-521.fc43 fedora 97.5 KiB perl-Term-Cap noarch 0:1.18-520.fc43 fedora 29.3 KiB perl-TermReadKey x86_64 0:2.38-26.fc43 fedora 64.0 KiB perl-Text-ParseWords noarch 0:3.31-520.fc43 fedora 13.6 KiB perl-Text-Tabs+Wrap noarch 0:2024.001-520.fc43 fedora 22.6 KiB perl-Time-Local noarch 2:1.350-520.fc43 fedora 69.0 KiB perl-URI noarch 0:5.34-2.fc44 fedora 268.0 KiB perl-base noarch 0:2.27-520.fc43 fedora 12.6 KiB perl-constant noarch 0:1.33-521.fc43 fedora 26.2 KiB perl-if noarch 0:0.61.000-520.fc43 fedora 5.8 KiB perl-interpreter x86_64 4:5.42.0-520.fc43 fedora 118.6 KiB perl-lib x86_64 0:0.65-520.fc43 fedora 8.5 KiB perl-libnet noarch 0:3.15-521.fc43 fedora 289.4 KiB perl-libs x86_64 4:5.42.0-520.fc43 fedora 11.5 MiB perl-locale noarch 0:1.13-520.fc43 fedora 6.1 KiB perl-mro x86_64 0:1.29-520.fc43 fedora 41.6 KiB perl-overload noarch 0:1.40-520.fc43 fedora 71.6 KiB perl-overloading noarch 0:0.02-520.fc43 fedora 4.9 KiB perl-parent noarch 1:0.244-520.fc43 fedora 10.3 KiB perl-podlators noarch 1:6.0.2-520.fc43 fedora 317.5 KiB perl-vars noarch 0:1.05-520.fc43 fedora 3.9 KiB pmix x86_64 0:5.0.7-2.fc43 fedora 2.2 MiB procps-ng x86_64 0:4.0.4-9.fc44 fedora 1.0 MiB protobuf-c x86_64 0:1.5.1-2.fc43 fedora 49.8 KiB prrte x86_64 0:3.0.6-8.fc43 fedora 158.3 KiB prrte-libs x86_64 0:3.0.6-8.fc43 fedora 1.7 MiB pthreadpool x86_64 0:0.0^git20230829.4fe0e1e-8.fc44 fedora 113.5 KiB publicsuffix-list noarch 0:20250616-2.fc43 fedora 332.8 KiB python-pip-wheel noarch 0:25.3-1.fc44 fedora 1.2 MiB python3 x86_64 0:3.14.2-1.fc44 fedora 28.9 KiB python3-libs x86_64 0:3.14.2-1.fc44 fedora 43.1 MiB rdma-core-common noarch 0:60.0-1.fc44 fedora 21.9 KiB rhash x86_64 0:1.4.5-3.fc43 fedora 351.1 KiB rocblas x86_64 0:7.1.1-3.fc44 copr_base 973.4 MiB rocm-clang x86_64 0:20-10.rocm7.1.1.fc44 copr_base 68.5 MiB rocm-clang-devel x86_64 0:20-10.rocm7.1.1.fc44 copr_base 26.1 MiB rocm-clang-libs x86_64 0:20-10.rocm7.1.1.fc44 copr_base 94.1 MiB rocm-clang-runtime-devel x86_64 0:20-10.rocm7.1.1.fc44 copr_base 8.4 MiB rocm-comgr x86_64 0:20-10.rocm7.1.1.fc44 copr_base 126.3 MiB rocm-device-libs x86_64 0:20-9.rocm7.1.1.fc44 copr_base 3.2 MiB rocm-hip x86_64 0:7.1.1-1.fc44 copr_base 27.0 MiB rocm-libc++ x86_64 0:20-10.rocm7.1.1.fc44 copr_base 1.3 MiB rocm-libc++-devel x86_64 0:20-10.rocm7.1.1.fc44 copr_base 15.0 MiB rocm-lld x86_64 0:20-10.rocm7.1.1.fc44 copr_base 5.9 MiB rocm-llvm x86_64 0:20-10.rocm7.1.1.fc44 copr_base 52.5 MiB rocm-llvm-devel x86_64 0:20-10.rocm7.1.1.fc44 copr_base 28.3 MiB rocm-llvm-filesystem x86_64 0:20-10.rocm7.1.1.fc44 copr_base 0.0 B rocm-llvm-libs x86_64 0:20-10.rocm7.1.1.fc44 copr_base 91.6 MiB rocm-llvm-static x86_64 0:20-10.rocm7.1.1.fc44 copr_base 1.9 GiB rocm-runtime x86_64 0:7.1.1-3.fc44 copr_base 3.2 MiB rocsolver x86_64 0:7.1.0-3.fc44 copr_base 936.6 MiB tcl x86_64 1:9.0.2-1.fc44 fedora 4.3 MiB tzdata noarch 0:2025c-1.fc44 fedora 1.2 MiB ucx x86_64 0:1.19.0-1.fc44 copr_base 2.4 MiB unbound-libs x86_64 0:1.24.2-1.fc44 fedora 1.5 MiB vim-filesystem noarch 2:9.1.1972-1.fc44 fedora 40.0 B wget2 x86_64 0:2.2.0-6.fc43 fedora 1.0 MiB wget2-libs x86_64 0:2.2.0-6.fc43 fedora 365.6 KiB zlib-ng-compat-devel x86_64 0:2.3.2-2.fc44 fedora 107.0 KiB Transaction Summary: Installing: 205 packages Total size of inbound packages is 2 GiB. Need to download 2 GiB. After this operation, 5 GiB extra will be used (install 5 GiB, remove 0 B). [ 1/205] langpacks-en-0:4.2-5.fc43.noa 100% | 132.2 KiB/s | 9.4 KiB | 00m00s [ 2/205] git-0:2.52.0-1.fc44.x86_64 100% | 383.6 KiB/s | 41.0 KiB | 00m00s [ 3/205] xxd-2:9.1.1972-1.fc44.x86_64 100% | 758.6 KiB/s | 31.1 KiB | 00m00s [ 4/205] openmpi-0:5.0.9-1.fc44.x86_64 100% | 7.5 MiB/s | 2.1 MiB | 00m00s [ 5/205] hipblas-devel-0:7.1.0-4.fc44. 100% | 2.1 MiB/s | 83.0 KiB | 00m00s [ 6/205] cmake-0:3.31.10-3.fc44.x86_64 100% | 39.6 MiB/s | 12.3 MiB | 00m00s [ 7/205] hipcc-libomp-devel-0:20-9.roc 100% | 123.2 KiB/s | 14.7 KiB | 00m00s [ 8/205] libcurl-devel-0:8.18.0~rc2-1. 100% | 35.2 MiB/s | 937.7 KiB | 00m00s [ 9/205] pthreadpool-devel-0:0.0^git20 100% | 630.9 KiB/s | 14.5 KiB | 00m00s [ 10/205] rocblas-devel-0:7.1.1-3.fc44. 100% | 1.2 MiB/s | 102.3 KiB | 00m00s [ 11/205] rocm-comgr-devel-0:20-10.rocm 100% | 158.3 KiB/s | 32.8 KiB | 00m00s [ 12/205] rocm-rpm-macros-0:7.1.0-7.fc4 100% | 831.6 KiB/s | 16.6 KiB | 00m00s [ 13/205] rocm-hip-devel-0:7.1.1-1.fc44 100% | 1.3 MiB/s | 262.6 KiB | 00m00s [ 14/205] git-core-0:2.52.0-1.fc44.x86_ 100% | 93.7 MiB/s | 5.2 MiB | 00m00s [ 15/205] git-core-doc-0:2.52.0-1.fc44. 100% | 73.3 MiB/s | 3.1 MiB | 00m00s [ 16/205] rocm-runtime-devel-0:7.1.1-3. 100% | 800.0 KiB/s | 116.8 KiB | 00m00s [ 17/205] perl-File-Basename-0:2.86-520 100% | 858.4 KiB/s | 17.2 KiB | 00m00s [ 18/205] perl-Getopt-Long-1:2.58-520.f 100% | 1.8 MiB/s | 63.6 KiB | 00m00s [ 19/205] perl-Git-0:2.52.0-1.fc44.noar 100% | 1.9 MiB/s | 38.1 KiB | 00m00s [ 20/205] gcc-c++-0:16.0.0-0.2.fc44.x86 100% | 22.3 MiB/s | 17.3 MiB | 00m01s [ 21/205] perl-IPC-Open3-0:1.24-520.fc4 100% | 1.0 MiB/s | 23.9 KiB | 00m00s [ 22/205] perl-PathTools-0:3.94-520.fc4 100% | 4.1 MiB/s | 87.2 KiB | 00m00s [ 23/205] perl-TermReadKey-0:2.38-26.fc 100% | 1.7 MiB/s | 35.2 KiB | 00m00s [ 24/205] perl-lib-0:0.65-520.fc43.x86_ 100% | 712.0 KiB/s | 15.0 KiB | 00m00s [ 25/205] perl-interpreter-4:5.42.0-520 100% | 2.1 MiB/s | 72.4 KiB | 00m00s [ 26/205] langpacks-core-en-0:4.2-5.fc4 100% | 470.1 KiB/s | 9.4 KiB | 00m00s [ 27/205] langpacks-fonts-en-0:4.2-5.fc 100% | 485.6 KiB/s | 9.7 KiB | 00m00s [ 28/205] libpsm2-0:12.0.1-3.fc43.x86_6 100% | 8.9 MiB/s | 201.3 KiB | 00m00s [ 29/205] openssh-clients-0:10.2p1-1.fc 100% | 30.6 MiB/s | 750.9 KiB | 00m00s [ 30/205] libfabric-0:2.3.1-1.fc44.x86_ 100% | 35.9 MiB/s | 1.8 MiB | 00m00s [ 31/205] orangefs-0:2.10.1-1.fc44.x86_ 100% | 24.1 MiB/s | 592.7 KiB | 00m00s [ 32/205] pmix-0:5.0.7-2.fc43.x86_64 100% | 28.3 MiB/s | 725.1 KiB | 00m00s [ 33/205] prrte-0:3.0.6-8.fc43.x86_64 100% | 2.6 MiB/s | 56.0 KiB | 00m00s [ 34/205] cmake-filesystem-0:3.31.10-3. 100% | 693.5 KiB/s | 13.9 KiB | 00m00s [ 35/205] cmake-data-0:3.31.10-3.fc44.n 100% | 53.8 MiB/s | 2.5 MiB | 00m00s [ 36/205] hwloc-libs-0:2.12.0-2.fc43.x8 100% | 14.4 MiB/s | 2.1 MiB | 00m00s [ 37/205] expat-0:2.7.3-1.fc44.x86_64 100% | 5.1 MiB/s | 119.9 KiB | 00m00s [ 38/205] jsoncpp-0:1.9.6-2.fc43.x86_64 100% | 4.7 MiB/s | 101.1 KiB | 00m00s [ 39/205] libuv-1:1.51.0-2.fc43.x86_64 100% | 11.8 MiB/s | 266.1 KiB | 00m00s [ 40/205] make-1:4.4.1-11.fc43.x86_64 100% | 19.7 MiB/s | 585.2 KiB | 00m00s [ 41/205] rhash-0:1.4.5-3.fc43.x86_64 100% | 8.4 MiB/s | 197.9 KiB | 00m00s [ 42/205] libmpc-0:1.3.1-8.fc43.x86_64 100% | 3.1 MiB/s | 70.4 KiB | 00m00s [ 43/205] pthreadpool-0:0.0^git20230829 100% | 2.2 MiB/s | 47.4 KiB | 00m00s [ 44/205] libstdc++-devel-0:16.0.0-0.2. 100% | 59.1 MiB/s | 5.4 MiB | 00m00s [ 45/205] rocblas-0:7.1.1-3.fc44.x86_64 100% | 260.2 MiB/s | 275.5 MiB | 00m01s [ 46/205] perl-File-Copy-0:2.41-520.fc4 100% | 1.0 MiB/s | 20.1 KiB | 00m00s [ 47/205] perl-File-Which-0:1.27-14.fc4 100% | 1.0 MiB/s | 21.4 KiB | 00m00s [ 48/205] perl-Getopt-Std-0:1.14-520.fc 100% | 785.3 KiB/s | 15.7 KiB | 00m00s [ 49/205] perl-Scalar-List-Utils-5:1.70 100% | 3.5 MiB/s | 75.0 KiB | 00m00s [ 50/205] perl-URI-0:5.34-2.fc44.noarch 100% | 6.9 MiB/s | 149.4 KiB | 00m00s [ 51/205] environment-modules-0:5.6.1-1 100% | 30.1 MiB/s | 801.0 KiB | 00m00s [ 52/205] less-0:685-6.fc44.x86_64 100% | 7.9 MiB/s | 210.6 KiB | 00m00s [ 53/205] perl-Carp-0:1.54-520.fc43.noa 100% | 1.3 MiB/s | 28.7 KiB | 00m00s [ 54/205] perl-Exporter-0:5.79-520.fc43 100% | 1.5 MiB/s | 30.9 KiB | 00m00s [ 55/205] perl-Pod-Usage-4:2.05-520.fc4 100% | 2.0 MiB/s | 40.5 KiB | 00m00s [ 56/205] perl-Text-ParseWords-0:3.31-5 100% | 817.3 KiB/s | 16.3 KiB | 00m00s [ 57/205] perl-base-0:2.27-520.fc43.noa 100% | 811.2 KiB/s | 16.2 KiB | 00m00s [ 58/205] perl-constant-0:1.33-521.fc43 100% | 1.1 MiB/s | 22.8 KiB | 00m00s [ 59/205] perl-overload-0:1.40-520.fc43 100% | 2.2 MiB/s | 45.6 KiB | 00m00s [ 60/205] perl-Error-1:0.17030-2.fc43.n 100% | 1.9 MiB/s | 40.2 KiB | 00m00s [ 61/205] perl-Fcntl-0:1.20-520.fc43.x8 100% | 1.5 MiB/s | 29.8 KiB | 00m00s [ 62/205] perl-IO-0:1.55-520.fc43.x86_6 100% | 3.5 MiB/s | 82.2 KiB | 00m00s [ 63/205] perl-POSIX-0:2.23-520.fc43.x8 100% | 3.8 MiB/s | 97.8 KiB | 00m00s [ 64/205] perl-Symbol-0:1.09-520.fc43.n 100% | 645.6 KiB/s | 14.2 KiB | 00m00s [ 65/205] rocm-comgr-0:20-10.rocm7.1.1. 100% | 21.3 MiB/s | 31.3 MiB | 00m01s [ 66/205] perl-Errno-0:1.38-520.fc43.x8 100% | 427.0 KiB/s | 14.9 KiB | 00m00s [ 67/205] perl-libs-4:5.42.0-520.fc43.x 100% | 60.9 MiB/s | 2.6 MiB | 00m00s [ 68/205] perl-DynaLoader-0:1.57-520.fc 100% | 619.4 KiB/s | 26.0 KiB | 00m00s [ 69/205] default-fonts-core-sans-0:4.2 100% | 1.5 MiB/s | 29.9 KiB | 00m00s [ 70/205] perl-vars-0:1.05-520.fc43.noa 100% | 618.5 KiB/s | 13.0 KiB | 00m00s [ 71/205] google-noto-sans-mono-vf-font 100% | 11.8 MiB/s | 277.4 KiB | 00m00s [ 72/205] google-noto-serif-vf-fonts-0: 100% | 25.0 MiB/s | 666.0 KiB | 00m00s [ 73/205] libibverbs-0:60.0-1.fc44.x86_ 100% | 17.7 MiB/s | 451.9 KiB | 00m00s [ 74/205] libnl3-0:3.12.0-2.fc44.x86_64 100% | 14.3 MiB/s | 366.0 KiB | 00m00s [ 75/205] librdmacm-0:60.0-1.fc44.x86_6 100% | 3.5 MiB/s | 74.7 KiB | 00m00s [ 76/205] numactl-libs-0:2.0.19-3.fc43. 100% | 1.5 MiB/s | 31.1 KiB | 00m00s [ 77/205] libedit-0:3.1-57.20251016cvs. 100% | 4.9 MiB/s | 105.0 KiB | 00m00s [ 78/205] libfido2-0:1.16.0-4.fc44.x86_ 100% | 4.6 MiB/s | 98.5 KiB | 00m00s [ 79/205] munge-libs-0:0.5.16-6.fc43.x8 100% | 973.1 KiB/s | 20.4 KiB | 00m00s [ 80/205] openssh-0:10.2p1-1.fc44.x86_6 100% | 13.7 MiB/s | 349.9 KiB | 00m00s [ 81/205] prrte-libs-0:3.0.6-8.fc43.x86 100% | 22.3 MiB/s | 547.0 KiB | 00m00s [ 82/205] emacs-filesystem-1:30.2-1.fc4 100% | 329.6 KiB/s | 7.9 KiB | 00m00s [ 83/205] vim-filesystem-2:9.1.1972-1.f 100% | 732.4 KiB/s | 15.4 KiB | 00m00s [ 84/205] perl-Data-Dumper-0:2.191-521. 100% | 2.7 MiB/s | 56.3 KiB | 00m00s [ 85/205] perl-MIME-Base32-0:1.303-24.f 100% | 1.0 MiB/s | 20.4 KiB | 00m00s [ 86/205] perl-MIME-Base64-0:3.16-520.f 100% | 1.3 MiB/s | 29.7 KiB | 00m00s [ 87/205] perl-libnet-0:3.15-521.fc43.n 100% | 5.7 MiB/s | 128.3 KiB | 00m00s [ 88/205] perl-parent-1:0.244-520.fc43. 100% | 740.2 KiB/s | 14.8 KiB | 00m00s [ 89/205] man-db-0:2.13.1-2.fc43.x86_64 100% | 46.9 MiB/s | 1.4 MiB | 00m00s [ 90/205] perl-Pod-Perldoc-0:3.28.01-52 100% | 3.9 MiB/s | 84.3 KiB | 00m00s [ 91/205] perl-podlators-1:6.0.2-520.fc 100% | 6.3 MiB/s | 128.4 KiB | 00m00s [ 92/205] perl-mro-0:1.29-520.fc43.x86_ 100% | 1.5 MiB/s | 29.9 KiB | 00m00s [ 93/205] cpp-0:16.0.0-0.2.fc44.x86_64 100% | 59.3 MiB/s | 14.6 MiB | 00m00s [ 94/205] perl-overloading-0:0.02-520.f 100% | 516.4 KiB/s | 12.9 KiB | 00m00s [ 95/205] perl-SelectSaver-0:1.02-520.f 100% | 586.1 KiB/s | 11.7 KiB | 00m00s [ 96/205] perl-File-stat-0:1.14-520.fc4 100% | 853.2 KiB/s | 17.1 KiB | 00m00s [ 97/205] perl-Socket-4:2.040-2.fc43.x8 100% | 2.7 MiB/s | 54.9 KiB | 00m00s [ 98/205] perl-locale-0:1.13-520.fc43.n 100% | 675.2 KiB/s | 13.5 KiB | 00m00s [ 99/205] gcc-0:16.0.0-0.2.fc44.x86_64 100% | 20.6 MiB/s | 43.6 MiB | 00m02s [100/205] abattis-cantarell-vf-fonts-0: 100% | 2.1 MiB/s | 120.1 KiB | 00m00s [101/205] google-noto-sans-vf-fonts-0:2 100% | 10.4 MiB/s | 614.9 KiB | 00m00s [102/205] fonts-filesystem-1:5.0.0-1.fc 100% | 440.4 KiB/s | 8.8 KiB | 00m00s [103/205] rdma-core-common-0:60.0-1.fc4 100% | 835.5 KiB/s | 16.7 KiB | 00m00s [104/205] libcbor-0:0.13.0-1.fc44.x86_6 100% | 1.6 MiB/s | 34.5 KiB | 00m00s [105/205] perl-B-0:1.89-520.fc43.x86_64 100% | 8.3 MiB/s | 177.7 KiB | 00m00s [106/205] perl-Digest-MD5-0:2.59-520.fc 100% | 1.7 MiB/s | 35.8 KiB | 00m00s [107/205] perl-FileHandle-0:2.05-520.fc 100% | 775.0 KiB/s | 15.5 KiB | 00m00s [108/205] perl-IO-Socket-IP-0:0.43-521. 100% | 1.8 MiB/s | 42.1 KiB | 00m00s [109/205] perl-Time-Local-2:1.350-520.f 100% | 1.5 MiB/s | 34.4 KiB | 00m00s [110/205] google-noto-fonts-common-0:20 100% | 204.7 KiB/s | 17.6 KiB | 00m00s [111/205] libpipeline-0:1.5.8-3.fc43.x8 100% | 2.9 MiB/s | 59.9 KiB | 00m00s [112/205] groff-base-0:1.23.0-11.fc44.x 100% | 39.3 MiB/s | 1.1 MiB | 00m00s [113/205] perl-HTTP-Tiny-0:0.090-521.fc 100% | 2.8 MiB/s | 56.3 KiB | 00m00s [114/205] perl-Pod-Simple-1:3.47-3.fc43 100% | 10.2 MiB/s | 219.9 KiB | 00m00s [115/205] perl-File-Temp-1:0.231.200-1. 100% | 1.1 MiB/s | 59.5 KiB | 00m00s [116/205] perl-Term-ANSIColor-0:5.01-52 100% | 2.3 MiB/s | 47.6 KiB | 00m00s [117/205] perl-Term-Cap-0:1.18-520.fc43 100% | 1.1 MiB/s | 21.9 KiB | 00m00s [118/205] perl-Class-Struct-0:0.68-520. 100% | 919.8 KiB/s | 22.1 KiB | 00m00s [119/205] perl-if-0:0.61.000-520.fc43.n 100% | 700.2 KiB/s | 14.0 KiB | 00m00s [120/205] perl-Digest-0:1.20-520.fc43.n 100% | 1.2 MiB/s | 24.8 KiB | 00m00s [121/205] perl-IO-Socket-SSL-0:2.095-2. 100% | 10.3 MiB/s | 231.5 KiB | 00m00s [122/205] perl-File-Path-0:2.18-521.fc4 100% | 1.3 MiB/s | 35.0 KiB | 00m00s [123/205] perl-Net-SSLeay-0:1.94-11.fc4 100% | 15.9 MiB/s | 374.8 KiB | 00m00s [124/205] perl-Pod-Escapes-1:1.07-520.f 100% | 989.0 KiB/s | 19.8 KiB | 00m00s [125/205] perl-Text-Tabs+Wrap-0:2024.00 100% | 901.4 KiB/s | 21.6 KiB | 00m00s [126/205] ncurses-0:6.5-8.20250614.fc44 100% | 18.9 MiB/s | 426.2 KiB | 00m00s [127/205] perl-AutoLoader-0:5.74-520.fc 100% | 1.0 MiB/s | 21.2 KiB | 00m00s [128/205] wget2-wget-0:2.2.0-6.fc43.x86 100% | 442.4 KiB/s | 9.7 KiB | 00m00s [129/205] wget2-0:2.2.0-6.fc43.x86_64 100% | 11.4 MiB/s | 279.9 KiB | 00m00s [130/205] gpgme-0:1.24.3-6.fc44.x86_64 100% | 9.3 MiB/s | 218.5 KiB | 00m00s [131/205] gnutls-dane-0:3.8.11-6.fc44.x 100% | 1.7 MiB/s | 39.3 KiB | 00m00s [132/205] unbound-libs-0:1.24.2-1.fc44. 100% | 22.5 MiB/s | 575.8 KiB | 00m00s [133/205] dns-root-data-0:2025080400-2. 100% | 719.9 KiB/s | 14.4 KiB | 00m00s [134/205] wget2-libs-0:2.2.0-6.fc43.x86 100% | 2.7 MiB/s | 147.7 KiB | 00m00s [135/205] hiredis-0:1.2.0-7.fc43.x86_64 100% | 2.5 MiB/s | 50.3 KiB | 00m00s [136/205] protobuf-c-0:1.5.1-2.fc43.x86 100% | 1.6 MiB/s | 32.7 KiB | 00m00s [137/205] mpdecimal-0:4.0.1-2.fc43.x86_ 100% | 4.5 MiB/s | 97.1 KiB | 00m00s [138/205] python-pip-wheel-0:25.3-1.fc4 100% | 43.5 MiB/s | 1.1 MiB | 00m00s [139/205] tzdata-0:2025c-1.fc44.noarch 100% | 24.9 MiB/s | 714.4 KiB | 00m00s [140/205] perl-Encode-4:3.21-520.fc43.x 100% | 37.6 MiB/s | 1.1 MiB | 00m00s [141/205] perl-Storable-1:3.37-521.fc43 100% | 4.6 MiB/s | 98.5 KiB | 00m00s [142/205] libquadmath-0:16.0.0-0.2.fc44 100% | 3.8 MiB/s | 181.1 KiB | 00m00s [143/205] libgfortran-0:16.0.0-0.2.fc44 100% | 8.4 MiB/s | 975.4 KiB | 00m00s [144/205] brotli-devel-0:1.2.0-1.fc44.x 100% | 1.3 MiB/s | 34.4 KiB | 00m00s [145/205] python3-libs-0:3.14.2-1.fc44. 100% | 42.0 MiB/s | 9.8 MiB | 00m00s [146/205] brotli-0:1.2.0-1.fc44.x86_64 100% | 1.1 MiB/s | 23.8 KiB | 00m00s [147/205] krb5-devel-0:1.21.3-10.fc44.x 100% | 6.0 MiB/s | 142.0 KiB | 00m00s [148/205] libkadm5-0:1.21.3-10.fc44.x86 100% | 3.7 MiB/s | 76.1 KiB | 00m00s [149/205] libidn2-devel-0:2.3.8-2.fc43. 100% | 2.8 MiB/s | 64.0 KiB | 00m00s [150/205] libnghttp2-devel-0:1.68.0-2.f 100% | 2.4 MiB/s | 54.5 KiB | 00m00s [151/205] libnghttp3-devel-0:1.13.1-1.f 100% | 1.1 MiB/s | 25.8 KiB | 00m00s [152/205] libpsl-devel-0:0.21.5-6.fc43. 100% | 1.5 MiB/s | 33.0 KiB | 00m00s [153/205] publicsuffix-list-0:20250616- 100% | 4.2 MiB/s | 89.9 KiB | 00m00s [154/205] libssh-devel-0:0.11.3-1.fc44. 100% | 1.9 MiB/s | 41.8 KiB | 00m00s [155/205] ngtcp2-crypto-ossl-devel-0:1. 100% | 539.3 KiB/s | 10.8 KiB | 00m00s [156/205] ngtcp2-devel-0:1.18.0-1.fc44. 100% | 2.7 MiB/s | 55.5 KiB | 00m00s [157/205] zlib-ng-compat-devel-0:2.3.2- 100% | 1.3 MiB/s | 38.2 KiB | 00m00s [158/205] keyutils-libs-devel-0:1.6.3-6 100% | 2.8 MiB/s | 59.8 KiB | 00m00s [159/205] openssl-devel-1:3.5.4-1.fc44. 100% | 46.3 MiB/s | 3.0 MiB | 00m00s [160/205] libcom_err-devel-0:1.47.3-3.f 100% | 834.5 KiB/s | 16.7 KiB | 00m00s [161/205] libselinux-devel-0:3.9-5.fc44 100% | 6.8 MiB/s | 152.1 KiB | 00m00s [162/205] libsepol-devel-0:3.9-2.fc43.x 100% | 1.9 MiB/s | 48.4 KiB | 00m00s [163/205] libverto-devel-0:0.3.2-11.fc4 100% | 713.6 KiB/s | 14.3 KiB | 00m00s [164/205] procps-ng-0:4.0.4-9.fc44.x86_ 100% | 14.2 MiB/s | 364.5 KiB | 00m00s [165/205] ucx-0:1.19.0-1.fc44.x86_64 100% | 2.4 MiB/s | 869.9 KiB | 00m00s [166/205] tcl-1:9.0.2-1.fc44.x86_64 100% | 44.0 MiB/s | 1.2 MiB | 00m00s [167/205] libtommath-0:1.3.1~rc1-6.fc43 100% | 2.6 MiB/s | 64.3 KiB | 00m00s [168/205] libdrm-0:2.4.128-3.fc44.x86_6 100% | 7.2 MiB/s | 162.0 KiB | 00m00s [169/205] libpciaccess-0:0.16-16.fc43.x 100% | 1.3 MiB/s | 26.2 KiB | 00m00s [170/205] hwdata-0:0.402-1.fc44.noarch 100% | 47.8 MiB/s | 1.7 MiB | 00m00s [171/205] hipcc-0:20-9.rocm7.1.1.fc44.x 100% | 2.5 MiB/s | 132.7 KiB | 00m00s [172/205] rocm-runtime-0:7.1.1-3.fc44.x 100% | 4.8 MiB/s | 642.3 KiB | 00m00s [173/205] libomp-devel-0:21.1.7-1.fc44. 100% | 11.9 MiB/s | 268.5 KiB | 00m00s [174/205] clang-resource-filesystem-0:2 100% | 1.1 MiB/s | 23.4 KiB | 00m00s [175/205] libomp-0:21.1.7-1.fc44.x86_64 100% | 28.1 MiB/s | 833.9 KiB | 00m00s [176/205] llvm-filesystem-0:21.1.7-1.fc 100% | 325.6 KiB/s | 17.6 KiB | 00m00s [177/205] rocm-device-libs-0:20-9.rocm7 100% | 36.9 MiB/s | 491.3 KiB | 00m00s [178/205] hipblas-0:7.1.0-4.fc44.x86_64 100% | 11.7 MiB/s | 119.8 KiB | 00m00s [179/205] llvm-libs-0:21.1.7-1.fc44.x86 100% | 133.1 MiB/s | 34.9 MiB | 00m00s [180/205] hipblas-common-devel-0:7.1.0- 100% | 81.8 KiB/s | 13.9 KiB | 00m00s [181/205] glibc-devel-0:2.42.9000-16.fc 100% | 24.6 MiB/s | 603.5 KiB | 00m00s [182/205] libxcrypt-devel-0:4.5.2-2.fc4 100% | 1.5 MiB/s | 30.1 KiB | 00m00s [183/205] pcre2-devel-0:10.47-1.fc44.x8 100% | 20.7 MiB/s | 550.9 KiB | 00m00s [184/205] pcre2-utf16-0:10.47-1.fc44.x8 100% | 10.9 MiB/s | 246.1 KiB | 00m00s [185/205] pcre2-utf32-0:10.47-1.fc44.x8 100% | 9.5 MiB/s | 232.9 KiB | 00m00s [186/205] kernel-headers-0:6.19.0-0.rc1 100% | 54.1 MiB/s | 1.7 MiB | 00m00s [187/205] rocm-hip-0:7.1.1-1.fc44.x86_6 100% | 17.8 MiB/s | 10.2 MiB | 00m01s [188/205] rocm-clang-devel-0:20-10.rocm 100% | 13.9 MiB/s | 2.5 MiB | 00m00s [189/205] rocm-clang-0:20-10.rocm7.1.1. 100% | 58.2 MiB/s | 15.9 MiB | 00m00s [190/205] rocm-clang-runtime-devel-0:20 100% | 4.8 MiB/s | 637.3 KiB | 00m00s [191/205] rocm-clang-libs-0:20-10.rocm7 100% | 33.4 MiB/s | 23.1 MiB | 00m01s [192/205] rocm-libc++-devel-0:20-10.roc 100% | 2.2 MiB/s | 1.2 MiB | 00m01s [193/205] rocm-libc++-0:20-10.rocm7.1.1 100% | 3.1 MiB/s | 372.9 KiB | 00m00s [194/205] rocm-llvm-filesystem-0:20-10. 100% | 308.0 KiB/s | 25.3 KiB | 00m00s [195/205] rocm-llvm-libs-0:20-10.rocm7. 100% | 39.7 MiB/s | 21.2 MiB | 00m01s [196/205] rocm-lld-0:20-10.rocm7.1.1.fc 100% | 6.1 MiB/s | 1.6 MiB | 00m00s [197/205] rocm-llvm-devel-0:20-10.rocm7 100% | 15.0 MiB/s | 4.0 MiB | 00m00s [198/205] rocm-llvm-0:20-10.rocm7.1.1.f 100% | 83.3 MiB/s | 13.5 MiB | 00m00s [199/205] python3-0:3.14.2-1.fc44.x86_6 100% | 1.4 MiB/s | 27.8 KiB | 00m00s [200/205] gcc-plugin-annobin-0:16.0.0-0 100% | 1.2 MiB/s | 33.1 KiB | 00m00s [201/205] annobin-plugin-gcc-0:13.03-1. 100% | 26.7 MiB/s | 682.8 KiB | 00m00s [202/205] annobin-docs-0:13.03-1.fc44.n 100% | 4.2 MiB/s | 89.4 KiB | 00m00s [203/205] cmake-rpm-macros-0:3.31.10-3. 100% | 681.6 KiB/s | 13.6 KiB | 00m00s [204/205] rocsolver-0:7.1.0-3.fc44.x86_ 100% | 251.3 MiB/s | 829.8 MiB | 00m03s [205/205] rocm-llvm-static-0:20-10.rocm 100% | 42.0 MiB/s | 282.0 MiB | 00m07s -------------------------------------------------------------------------------- [205/205] Total 100% | 134.4 MiB/s | 1.7 GiB | 00m13s Running transaction [ 1/207] Verify package files 100% | 28.0 B/s | 205.0 B | 00m07s [ 2/207] Prepare transaction 100% | 1.2 KiB/s | 205.0 B | 00m00s [ 3/207] Installing cmake-filesystem-0 100% | 3.7 MiB/s | 7.6 KiB | 00m00s [ 4/207] Installing fonts-filesystem-1 100% | 0.0 B/s | 788.0 B | 00m00s [ 5/207] Installing numactl-libs-0:2.0 100% | 56.4 MiB/s | 57.8 KiB | 00m00s [ 6/207] Installing hwloc-libs-0:2.12. 100% | 411.8 MiB/s | 2.9 MiB | 00m00s [ 7/207] Installing google-noto-fonts- 100% | 0.0 B/s | 18.5 KiB | 00m00s [ 8/207] Installing libnl3-0:3.12.0-2. 100% | 261.8 MiB/s | 1.0 MiB | 00m00s [ 9/207] Installing less-0:685-6.fc44. 100% | 31.5 MiB/s | 452.2 KiB | 00m00s [ 10/207] Installing libmpc-0:1.3.1-8.f 100% | 158.3 MiB/s | 162.1 KiB | 00m00s [ 11/207] Installing expat-0:2.7.3-1.fc 100% | 22.8 MiB/s | 303.2 KiB | 00m00s [ 12/207] Installing libpsm2-0:12.0.1-3 100% | 216.5 MiB/s | 443.4 KiB | 00m00s [ 13/207] Installing zlib-ng-compat-dev 100% | 106.0 MiB/s | 108.6 KiB | 00m00s [ 14/207] Installing rocm-llvm-filesyst 100% | 6.2 MiB/s | 19.1 KiB | 00m00s [ 15/207] Installing rocm-libc++-0:20-1 100% | 47.7 MiB/s | 1.3 MiB | 00m00s [ 16/207] Installing rocm-llvm-libs-0:2 100% | 72.5 MiB/s | 91.6 MiB | 00m01s [ 17/207] Installing rocm-clang-libs-0: 100% | 71.0 MiB/s | 94.1 MiB | 00m01s [ 18/207] Installing openssl-devel-1:3. 100% | 64.3 MiB/s | 5.6 MiB | 00m00s [ 19/207] Installing ngtcp2-devel-0:1.1 100% | 291.1 MiB/s | 298.1 KiB | 00m00s [ 20/207] Installing gpgme-0:1.24.3-6.f 100% | 26.2 MiB/s | 590.4 KiB | 00m00s [ 21/207] Installing groff-base-0:1.23. 100% | 113.1 MiB/s | 3.8 MiB | 00m00s [ 22/207] Installing rdma-core-common-0 100% | 22.6 MiB/s | 23.2 KiB | 00m00s [ 23/207] Installing libibverbs-0:60.0- 100% | 196.9 MiB/s | 1.2 MiB | 00m00s [ 24/207] Installing vim-filesystem-2:9 100% | 4.6 MiB/s | 4.7 KiB | 00m00s [ 25/207] Installing emacs-filesystem-1 100% | 0.0 B/s | 812.0 B | 00m00s [ 26/207] Installing libedit-0:3.1-57.2 100% | 236.2 MiB/s | 241.8 KiB | 00m00s [ 27/207] Installing rocm-comgr-0:20-10 100% | 69.8 MiB/s | 126.3 MiB | 00m02s [ 28/207] Installing make-1:4.4.1-11.fc 100% | 85.7 MiB/s | 1.8 MiB | 00m00s [ 29/207] Installing orangefs-0:2.10.1- 100% | 86.1 MiB/s | 1.7 MiB | 00m00s [ 30/207] Installing librdmacm-0:60.0-1 100% | 148.5 MiB/s | 152.1 KiB | 00m00s [ 31/207] Installing libfabric-0:2.3.1- 100% | 281.9 MiB/s | 9.0 MiB | 00m00s [ 32/207] Installing ngtcp2-crypto-ossl 100% | 0.0 B/s | 8.1 KiB | 00m00s [ 33/207] Installing rocm-lld-0:20-10.r 100% | 64.7 MiB/s | 5.9 MiB | 00m00s [ 34/207] Installing rocm-libc++-devel- 100% | 110.5 MiB/s | 15.4 MiB | 00m00s [ 35/207] Installing cpp-0:16.0.0-0.2.f 100% | 331.0 MiB/s | 43.7 MiB | 00m00s [ 36/207] Installing google-noto-sans-m 100% | 274.5 MiB/s | 562.2 KiB | 00m00s [ 37/207] Installing google-noto-serif- 100% | 318.0 MiB/s | 1.6 MiB | 00m00s [ 38/207] Installing google-noto-sans-v 100% | 347.8 MiB/s | 1.4 MiB | 00m00s [ 39/207] Installing abattis-cantarell- 100% | 189.9 MiB/s | 194.4 KiB | 00m00s [ 40/207] Installing default-fonts-core 100% | 17.8 MiB/s | 18.2 KiB | 00m00s [ 41/207] Installing langpacks-core-en- 100% | 0.0 B/s | 704.0 B | 00m00s [ 42/207] Installing langpacks-fonts-en 100% | 0.0 B/s | 652.0 B | 00m00s [ 43/207] Installing libssh-devel-0:0.1 100% | 176.3 MiB/s | 180.6 KiB | 00m00s [ 44/207] Installing hipblas-common-dev 100% | 0.0 B/s | 18.2 KiB | 00m00s [ 45/207] Installing annobin-docs-0:13. 100% | 98.0 MiB/s | 100.3 KiB | 00m00s [ 46/207] Installing rocm-clang-runtime 100% | 128.5 MiB/s | 8.5 MiB | 00m00s [ 47/207] Installing kernel-headers-0:6 100% | 200.8 MiB/s | 7.0 MiB | 00m00s [ 48/207] Installing glibc-devel-0:2.42 100% | 169.7 MiB/s | 2.4 MiB | 00m00s [ 49/207] Installing libxcrypt-devel-0: 100% | 32.5 MiB/s | 33.3 KiB | 00m00s [ 50/207] Installing gcc-0:16.0.0-0.2.f 100% | 387.8 MiB/s | 126.0 MiB | 00m00s [ 51/207] Installing pcre2-utf32-0:10.4 100% | 298.8 MiB/s | 611.9 KiB | 00m00s [ 52/207] Installing pcre2-utf16-0:10.4 100% | 208.3 MiB/s | 640.0 KiB | 00m00s [ 53/207] Installing pcre2-devel-0:10.4 100% | 106.0 MiB/s | 2.1 MiB | 00m00s [ 54/207] Installing llvm-filesystem-0: 100% | 1.0 MiB/s | 1.1 KiB | 00m00s [ 55/207] Installing llvm-libs-0:21.1.7 100% | 430.5 MiB/s | 138.6 MiB | 00m00s [ 56/207] Installing libomp-0:21.1.7-1. 100% | 354.1 MiB/s | 2.8 MiB | 00m00s [ 57/207] Installing clang-resource-fil 100% | 16.3 MiB/s | 16.7 KiB | 00m00s [ 58/207] Installing libomp-devel-0:21. 100% | 485.7 MiB/s | 1.5 MiB | 00m00s [ 59/207] Installing hwdata-0:0.402-1.f 100% | 512.0 MiB/s | 9.7 MiB | 00m00s [ 60/207] Installing libpciaccess-0:0.1 100% | 44.8 MiB/s | 45.9 KiB | 00m00s [ 61/207] Installing libdrm-0:2.4.128-3 100% | 197.1 MiB/s | 403.7 KiB | 00m00s [ 62/207] Installing rocm-runtime-0:7.1 100% | 460.6 MiB/s | 3.2 MiB | 00m00s [ 63/207] Installing rocm-runtime-devel 100% | 335.7 MiB/s | 687.6 KiB | 00m00s [ 64/207] Installing libtommath-0:1.3.1 100% | 124.5 MiB/s | 127.5 KiB | 00m00s [ 65/207] Installing tcl-1:9.0.2-1.fc44 100% | 160.7 MiB/s | 4.3 MiB | 00m00s [ 66/207] Installing procps-ng-0:4.0.4- 100% | 59.4 MiB/s | 1.0 MiB | 00m00s [ 67/207] Installing libverto-devel-0:0 100% | 0.0 B/s | 26.4 KiB | 00m00s [ 68/207] Installing libsepol-devel-0:3 100% | 62.9 MiB/s | 128.9 KiB | 00m00s [ 69/207] Installing libselinux-devel-0 100% | 39.6 MiB/s | 162.1 KiB | 00m00s [ 70/207] Installing libcom_err-devel-0 100% | 1.4 MiB/s | 18.3 KiB | 00m00s [ 71/207] Installing keyutils-libs-deve 100% | 53.9 MiB/s | 55.2 KiB | 00m00s [ 72/207] Installing publicsuffix-list- 100% | 326.0 MiB/s | 333.8 KiB | 00m00s [ 73/207] Installing libpsl-devel-0:0.2 100% | 110.9 MiB/s | 113.5 KiB | 00m00s [ 74/207] Installing libnghttp3-devel-0 100% | 0.0 B/s | 105.8 KiB | 00m00s [ 75/207] Installing libnghttp2-devel-0 100% | 282.3 MiB/s | 289.1 KiB | 00m00s [ 76/207] Installing libidn2-devel-0:2. 100% | 76.5 MiB/s | 156.7 KiB | 00m00s [ 77/207] Installing libkadm5-0:1.21.3- 100% | 210.9 MiB/s | 215.9 KiB | 00m00s [ 78/207] Installing krb5-devel-0:1.21. 100% | 46.6 MiB/s | 715.2 KiB | 00m00s [ 79/207] Installing brotli-0:1.2.0-1.f 100% | 2.6 MiB/s | 34.4 KiB | 00m00s [ 80/207] Installing brotli-devel-0:1.2 100% | 66.8 MiB/s | 68.4 KiB | 00m00s [ 81/207] Installing ucx-0:1.19.0-1.fc4 100% | 122.8 MiB/s | 2.5 MiB | 00m00s [ 82/207] Installing libquadmath-0:16.0 100% | 319.5 MiB/s | 327.2 KiB | 00m00s [ 83/207] Installing libgfortran-0:16.0 100% | 284.8 MiB/s | 3.4 MiB | 00m00s [ 84/207] Installing tzdata-0:2025c-1.f 100% | 45.8 MiB/s | 1.5 MiB | 00m00s [ 85/207] Installing python-pip-wheel-0 100% | 602.0 MiB/s | 1.2 MiB | 00m00s [ 86/207] Installing mpdecimal-0:4.0.1- 100% | 30.5 MiB/s | 218.8 KiB | 00m00s [ 87/207] Installing python3-libs-0:3.1 100% | 312.8 MiB/s | 43.5 MiB | 00m00s [ 88/207] Installing python3-0:3.14.2-1 100% | 2.1 MiB/s | 30.6 KiB | 00m00s [ 89/207] Installing cmake-rpm-macros-0 100% | 8.7 MiB/s | 8.9 KiB | 00m00s [ 90/207] Installing rocm-llvm-0:20-10. 100% | 67.4 MiB/s | 52.5 MiB | 00m01s [ 91/207] Installing rocm-llvm-devel-0: 100% | 92.1 MiB/s | 28.7 MiB | 00m00s [ 92/207] Installing rocm-llvm-static-0 100% | 92.1 MiB/s | 1.9 GiB | 00m21s [ 93/207] Installing protobuf-c-0:1.5.1 100% | 50.2 MiB/s | 51.4 KiB | 00m00s [ 94/207] Installing hiredis-0:1.2.0-7. 100% | 105.1 MiB/s | 107.6 KiB | 00m00s [ 95/207] Installing dns-root-data-0:20 100% | 1.9 MiB/s | 13.8 KiB | 00m00s >>> Running sysusers scriptlet: unbound-libs-0:1.24.2-1.fc44.x86_64 >>> Finished sysusers scriptlet: unbound-libs-0:1.24.2-1.fc44.x86_64 >>> Scriptlet output: >>> Creating group 'unbound' with GID 999. >>> Creating user 'unbound' (Unbound DNS resolver) with UID 999 and GID 999. >>> [ 96/207] Installing unbound-libs-0:1.2 100% | 295.1 MiB/s | 1.5 MiB | 00m00s [ 97/207] Installing gnutls-dane-0:3.8. 100% | 60.3 MiB/s | 61.7 KiB | 00m00s [ 98/207] Installing wget2-libs-0:2.2.0 100% | 179.1 MiB/s | 366.8 KiB | 00m00s [ 99/207] Installing wget2-0:2.2.0-6.fc 100% | 58.6 MiB/s | 1.1 MiB | 00m00s [100/207] Installing ncurses-0:6.5-8.20 100% | 40.1 MiB/s | 616.4 KiB | 00m00s [101/207] Installing perl-Digest-0:1.20 100% | 36.2 MiB/s | 37.1 KiB | 00m00s [102/207] Installing perl-Digest-MD5-0: 100% | 60.1 MiB/s | 61.6 KiB | 00m00s [103/207] Installing perl-B-0:1.89-520. 100% | 246.4 MiB/s | 504.7 KiB | 00m00s [104/207] Installing perl-FileHandle-0: 100% | 0.0 B/s | 9.8 KiB | 00m00s [105/207] Installing perl-libnet-0:3.15 100% | 143.9 MiB/s | 294.7 KiB | 00m00s [106/207] Installing perl-Data-Dumper-0 100% | 114.8 MiB/s | 117.5 KiB | 00m00s [107/207] Installing perl-MIME-Base32-0 100% | 31.4 MiB/s | 32.2 KiB | 00m00s [108/207] Installing perl-AutoLoader-0: 100% | 0.0 B/s | 21.0 KiB | 00m00s [109/207] Installing perl-URI-0:5.34-2. 100% | 91.7 MiB/s | 281.8 KiB | 00m00s [110/207] Installing perl-IO-Socket-IP- 100% | 99.8 MiB/s | 102.2 KiB | 00m00s [111/207] Installing perl-Net-SSLeay-0: 100% | 271.7 MiB/s | 1.4 MiB | 00m00s [112/207] Installing perl-IO-Socket-SSL 100% | 350.9 MiB/s | 718.6 KiB | 00m00s [113/207] Installing perl-Text-Tabs+Wra 100% | 0.0 B/s | 23.9 KiB | 00m00s [114/207] Installing perl-Pod-Escapes-1 100% | 0.0 B/s | 25.9 KiB | 00m00s [115/207] Installing perl-File-Path-0:2 100% | 0.0 B/s | 64.5 KiB | 00m00s [116/207] Installing perl-if-0:0.61.000 100% | 0.0 B/s | 6.2 KiB | 00m00s [117/207] Installing perl-Time-Local-2: 100% | 68.9 MiB/s | 70.6 KiB | 00m00s [118/207] Installing perl-locale-0:1.13 100% | 0.0 B/s | 6.5 KiB | 00m00s [119/207] Installing perl-Pod-Simple-1: 100% | 280.7 MiB/s | 574.9 KiB | 00m00s [120/207] Installing perl-HTTP-Tiny-0:0 100% | 152.8 MiB/s | 156.4 KiB | 00m00s [121/207] Installing perl-File-Temp-1:0 100% | 161.6 MiB/s | 165.5 KiB | 00m00s [122/207] Installing perl-Class-Struct- 100% | 0.0 B/s | 25.9 KiB | 00m00s [123/207] Installing perl-IPC-Open3-0:1 100% | 0.0 B/s | 28.5 KiB | 00m00s [124/207] Installing perl-Term-Cap-0:1. 100% | 0.0 B/s | 30.6 KiB | 00m00s [125/207] Installing perl-Term-ANSIColo 100% | 96.9 MiB/s | 99.2 KiB | 00m00s [126/207] Installing perl-POSIX-0:2.23- 100% | 227.2 MiB/s | 232.6 KiB | 00m00s [127/207] Installing perl-podlators-1:6 100% | 22.4 MiB/s | 321.4 KiB | 00m00s [128/207] Installing perl-Pod-Perldoc-0 100% | 12.7 MiB/s | 169.2 KiB | 00m00s [129/207] Installing perl-File-stat-0:1 100% | 0.0 B/s | 13.1 KiB | 00m00s [130/207] Installing perl-Socket-4:2.04 100% | 119.4 MiB/s | 122.3 KiB | 00m00s [131/207] Installing perl-SelectSaver-0 100% | 0.0 B/s | 2.6 KiB | 00m00s [132/207] Installing perl-Symbol-0:1.09 100% | 0.0 B/s | 7.3 KiB | 00m00s [133/207] Installing perl-Pod-Usage-4:2 100% | 7.2 MiB/s | 87.9 KiB | 00m00s [134/207] Installing perl-IO-0:1.55-520 100% | 148.1 MiB/s | 151.7 KiB | 00m00s [135/207] Installing perl-overloading-0 100% | 0.0 B/s | 5.6 KiB | 00m00s [136/207] Installing perl-mro-0:1.29-52 100% | 0.0 B/s | 42.7 KiB | 00m00s [137/207] Installing perl-Fcntl-0:1.20- 100% | 0.0 B/s | 49.9 KiB | 00m00s [138/207] Installing perl-base-0:2.27-5 100% | 0.0 B/s | 13.0 KiB | 00m00s [139/207] Installing perl-Text-ParseWor 100% | 0.0 B/s | 14.6 KiB | 00m00s [140/207] Installing perl-File-Basename 100% | 0.0 B/s | 14.6 KiB | 00m00s [141/207] Installing perl-Getopt-Long-1 100% | 143.8 MiB/s | 147.2 KiB | 00m00s [142/207] Installing perl-Storable-1:3. 100% | 227.4 MiB/s | 232.8 KiB | 00m00s [143/207] Installing perl-overload-0:1. 100% | 0.0 B/s | 72.0 KiB | 00m00s [144/207] Installing perl-parent-1:0.24 100% | 0.0 B/s | 11.0 KiB | 00m00s [145/207] Installing perl-MIME-Base64-0 100% | 43.2 MiB/s | 44.3 KiB | 00m00s [146/207] Installing perl-vars-0:1.05-5 100% | 0.0 B/s | 4.3 KiB | 00m00s [147/207] Installing perl-Errno-0:1.38- 100% | 1.7 MiB/s | 8.8 KiB | 00m00s [148/207] Installing perl-constant-0:1. 100% | 0.0 B/s | 27.4 KiB | 00m00s [149/207] Installing perl-Scalar-List-U 100% | 145.2 MiB/s | 148.7 KiB | 00m00s [150/207] Installing perl-Getopt-Std-0: 100% | 0.0 B/s | 11.8 KiB | 00m00s [151/207] Installing perl-Encode-4:3.21 100% | 187.8 MiB/s | 4.7 MiB | 00m00s [152/207] Installing perl-DynaLoader-0: 100% | 0.0 B/s | 32.5 KiB | 00m00s [153/207] Installing perl-PathTools-0:3 100% | 180.2 MiB/s | 184.6 KiB | 00m00s [154/207] Installing perl-Exporter-0:5. 100% | 0.0 B/s | 55.6 KiB | 00m00s [155/207] Installing perl-Carp-0:1.54-5 100% | 23.3 MiB/s | 47.7 KiB | 00m00s [156/207] Installing perl-libs-4:5.42.0 100% | 277.3 MiB/s | 11.6 MiB | 00m00s [157/207] Installing perl-interpreter-4 100% | 9.0 MiB/s | 120.3 KiB | 00m00s [158/207] Installing perl-TermReadKey-0 100% | 64.6 MiB/s | 66.2 KiB | 00m00s [159/207] Installing perl-lib-0:0.65-52 100% | 0.0 B/s | 8.9 KiB | 00m00s [160/207] Installing perl-File-Copy-0:2 100% | 0.0 B/s | 20.2 KiB | 00m00s [161/207] Installing perl-File-Which-0: 100% | 0.0 B/s | 31.4 KiB | 00m00s [162/207] Installing perl-Error-1:0.170 100% | 78.1 MiB/s | 80.0 KiB | 00m00s [163/207] Installing libpipeline-0:1.5. 100% | 14.3 MiB/s | 146.6 KiB | 00m00s [164/207] Installing man-db-0:2.13.1-2. 100% | 85.7 MiB/s | 2.9 MiB | 00m00s [165/207] Installing environment-module 100% | 67.6 MiB/s | 1.9 MiB | 00m00s [166/207] Installing libcbor-0:0.13.0-1 100% | 78.9 MiB/s | 80.8 KiB | 00m00s [167/207] Installing libfido2-0:1.16.0- 100% | 234.4 MiB/s | 240.1 KiB | 00m00s [168/207] Installing munge-libs-0:0.5.1 100% | 0.0 B/s | 28.8 KiB | 00m00s [169/207] Installing pmix-0:5.0.7-2.fc4 100% | 314.8 MiB/s | 2.2 MiB | 00m00s [170/207] Installing prrte-libs-0:3.0.6 100% | 278.4 MiB/s | 1.7 MiB | 00m00s [171/207] Installing prrte-0:3.0.6-8.fc 100% | 158.4 MiB/s | 162.2 KiB | 00m00s [172/207] Installing openssh-0:10.2p1-1 100% | 88.8 MiB/s | 1.4 MiB | 00m00s [173/207] Installing openssh-clients-0: 100% | 108.7 MiB/s | 2.6 MiB | 00m00s [174/207] Installing git-core-0:2.52.0- 100% | 333.7 MiB/s | 24.0 MiB | 00m00s [175/207] Installing git-core-doc-0:2.5 100% | 357.1 MiB/s | 18.6 MiB | 00m00s [176/207] Installing git-0:2.52.0-1.fc4 100% | 56.4 MiB/s | 57.7 KiB | 00m00s [177/207] Installing perl-Git-0:2.52.0- 100% | 63.8 MiB/s | 65.4 KiB | 00m00s [178/207] Installing rocm-clang-0:20-10 100% | 74.1 MiB/s | 68.5 MiB | 00m01s [179/207] Installing rocm-clang-devel-0 100% | 122.1 MiB/s | 26.3 MiB | 00m00s [180/207] Installing rocm-device-libs-0 100% | 90.6 MiB/s | 3.3 MiB | 00m00s [181/207] Installing hipcc-0:20-9.rocm7 100% | 29.6 MiB/s | 635.9 KiB | 00m00s [182/207] Installing rocm-hip-0:7.1.1-1 100% | 325.3 MiB/s | 27.0 MiB | 00m00s [183/207] Installing rocblas-0:7.1.1-3. 100% | 83.9 MiB/s | 973.9 MiB | 00m12s [184/207] Installing rocsolver-0:7.1.0- 100% | 31.5 MiB/s | 936.6 MiB | 00m30s [185/207] Installing hipblas-0:7.1.0-4. 100% | 22.5 MiB/s | 805.0 KiB | 00m00s [186/207] Installing rocm-comgr-devel-0 100% | 1.4 MiB/s | 101.9 KiB | 00m00s [187/207] Installing rocm-hip-devel-0:7 100% | 21.2 MiB/s | 2.4 MiB | 00m00s [188/207] Installing pthreadpool-0:0.0^ 100% | 3.2 MiB/s | 114.5 KiB | 00m00s [189/207] Installing libstdc++-devel-0: 100% | 151.7 MiB/s | 38.8 MiB | 00m00s [190/207] Installing rhash-0:1.4.5-3.fc 100% | 14.5 MiB/s | 356.4 KiB | 00m00s [191/207] Installing libuv-1:1.51.0-2.f 100% | 93.3 MiB/s | 573.0 KiB | 00m00s [192/207] Installing jsoncpp-0:1.9.6-2. 100% | 42.2 MiB/s | 259.2 KiB | 00m00s [193/207] Installing cmake-0:3.31.10-3. 100% | 297.9 MiB/s | 34.6 MiB | 00m00s [194/207] Installing cmake-data-0:3.31. 100% | 100.8 MiB/s | 9.0 MiB | 00m00s [195/207] Installing gcc-c++-0:16.0.0-0 100% | 86.8 MiB/s | 48.4 MiB | 00m01s [196/207] Installing pthreadpool-devel- 100% | 32.5 MiB/s | 99.8 KiB | 00m00s [197/207] Installing rocblas-devel-0:7. 100% | 143.3 MiB/s | 2.7 MiB | 00m00s [198/207] Installing hipblas-devel-0:7. 100% | 160.2 MiB/s | 2.4 MiB | 00m00s [199/207] Installing hipcc-libomp-devel 100% | 30.3 KiB/s | 124.0 B | 00m00s [200/207] Installing openmpi-0:5.0.9-1. 100% | 305.4 MiB/s | 7.0 MiB | 00m00s [201/207] Installing rocm-rpm-macros-0: 100% | 3.8 MiB/s | 19.5 KiB | 00m00s [202/207] Installing wget2-wget-0:2.2.0 100% | 20.6 KiB/s | 444.0 B | 00m00s [203/207] Installing libcurl-devel-0:8. 100% | 50.4 MiB/s | 1.5 MiB | 00m00s [204/207] Installing gcc-plugin-annobin 100% | 2.6 MiB/s | 58.6 KiB | 00m00s [205/207] Installing annobin-plugin-gcc 100% | 37.8 MiB/s | 697.4 KiB | 00m00s [206/207] Installing langpacks-en-0:4.2 100% | 170.9 KiB/s | 700.0 B | 00m00s [207/207] Installing xxd-2:9.1.1972-1.f 100% | 48.4 KiB/s | 34.2 KiB | 00m01s Warning: skipped OpenPGP checks for 35 packages from repository: copr_base Complete! Building target platforms: x86_64 Building for target x86_64 setting SOURCE_DATE_EPOCH=1766188800 Wrote: /builddir/build/SRPMS/llama-cpp-b6153-1.fc44.src.rpm Updating and loading repositories: Copr repository 100% | 109.3 KiB/s | 1.5 KiB | 00m00s fedora 100% | 204.0 KiB/s | 26.5 KiB | 00m00s Repositories loaded. Nothing to do. Package "cmake-3.31.10-3.fc44.x86_64" is already installed. Package "curl-8.18.0~rc2-1.fc44.x86_64" is already installed. Package "gcc-c++-16.0.0-0.2.fc44.x86_64" is already installed. Package "git-2.52.0-1.fc44.x86_64" is already installed. Package "hipblas-devel-7.1.0-4.fc44.x86_64" is already installed. Package "hipcc-libomp-devel-20-9.rocm7.1.1.fc44.x86_64" is already installed. Package "langpacks-en-4.2-5.fc43.noarch" is already installed. Package "libcurl-devel-8.18.0~rc2-1.fc44.x86_64" is already installed. Package "openmpi-5.0.9-1.fc44.x86_64" is already installed. Package "pthreadpool-devel-0.0^git20230829.4fe0e1e-8.fc44.x86_64" is already installed. Package "rocblas-devel-7.1.1-3.fc44.x86_64" is already installed. Package "rocm-comgr-devel-20-10.rocm7.1.1.fc44.x86_64" is already installed. Package "rocm-hip-devel-7.1.1-1.fc44.x86_64" is already installed. Package "rocm-rpm-macros-7.1.0-7.fc44.noarch" is already installed. Package "rocm-runtime-devel-7.1.1-3.fc44.x86_64" is already installed. Package "wget2-wget-2.2.0-6.fc43.x86_64" is already installed. Package "xxd-2:9.1.1972-1.fc44.x86_64" is already installed. Finish: build setup for llama-cpp-b6153-1.fc44.src.rpm Start: rpmbuild llama-cpp-b6153-1.fc44.src.rpm Building target platforms: x86_64 Building for target x86_64 setting SOURCE_DATE_EPOCH=1766188800 Executing(%mkbuilddir): /bin/sh -e /var/tmp/rpm-tmp.7ISIQf Executing(%prep): /bin/sh -e /var/tmp/rpm-tmp.GdXFAv + umask 022 + cd /builddir/build/BUILD/llama-cpp-b6153-build + cd /builddir/build/BUILD/llama-cpp-b6153-build + rm -rf llama.cpp-b6153 + /usr/lib/rpm/rpmuncompress -x /builddir/build/SOURCES/llama.cpp-b6153.tar.gz + STATUS=0 + '[' 0 -ne 0 ']' + cd llama.cpp-b6153 + /usr/bin/chmod -Rf a+rX,u+w,g-w,o-w . + sed -i -e 's/POSITION_INDEPENDENT_CODE ON/POSITION_INDEPENDENT_CODE ON SOVERSION b6153/' src/CMakeLists.txt + sed -i -e 's/POSITION_INDEPENDENT_CODE ON/POSITION_INDEPENDENT_CODE ON SOVERSION b6153/' ggml/src/CMakeLists.txt + sed -i -e 's/POSITION_INDEPENDENT_CODE ON/POSITION_INDEPENDENT_CODE ON SOVERSION b6153/' tools/mtmd/CMakeLists.txt + sed -i '/target_link_libraries(ggml-hip PRIVATE ggml-base.*/aset_target_properties(ggml-hip PROPERTIES SOVERSION b6153)' ggml/src/ggml-hip/CMakeLists.txt + sed -i '/target_compile_features(${GGML_CPU_NAME} PRIVATE c_std_11.*/aset_target_properties(${GGML_CPU_NAME} PROPERTIES SOVERSION b6153)' ggml/src/ggml-cpu/CMakeLists.txt + sed -i '/#include ' src/llama-mmap.h + rm -rf exmples/llma.android + find . -name .gitignore -exec rm -rf '{}' ';' + RPM_EC=0 ++ jobs -p + exit 0 Executing(%build): /bin/sh -e /var/tmp/rpm-tmp.5mGn65 + umask 022 + cd /builddir/build/BUILD/llama-cpp-b6153-build + CFLAGS='-O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -fcf-protection -mtls-dialect=gnu2 -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer ' + export CFLAGS + CXXFLAGS='-O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer' + export CXXFLAGS + FFLAGS='-O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -fcf-protection -mtls-dialect=gnu2 -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -I/usr/lib64/gfortran/modules ' + export FFLAGS + FCFLAGS='-O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -fcf-protection -mtls-dialect=gnu2 -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -I/usr/lib64/gfortran/modules ' + export FCFLAGS + VALAFLAGS=-g + export VALAFLAGS + RUSTFLAGS='-Copt-level=3 -Cdebuginfo=2 -Ccodegen-units=1 -Cstrip=none -Cforce-frame-pointers=yes -Clink-arg=-specs=/usr/lib/rpm/redhat/redhat-package-notes --cap-lints=warn' + export RUSTFLAGS + LDFLAGS='-Wl,-z,relro -Wl,--as-needed -Wl,-z,pack-relative-relocs -Wl,-z,now -Wl,-z,now -Wl,--build-id=sha1 -specs=/usr/lib/rpm/redhat/redhat-package-notes ' + export LDFLAGS + LT_SYS_LIBRARY_PATH=/usr/lib64: + export LT_SYS_LIBRARY_PATH + CC=hipcc + export CC + CXX=hipcc + export CXX + cd llama.cpp-b6153 + CFLAGS='-O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -fcf-protection -mtls-dialect=gnu2 -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer ' + export CFLAGS + CXXFLAGS='-O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer' + export CXXFLAGS + FFLAGS='-O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -fcf-protection -mtls-dialect=gnu2 -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -I/usr/lib64/gfortran/modules ' + export FFLAGS + FCFLAGS='-O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -fcf-protection -mtls-dialect=gnu2 -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -I/usr/lib64/gfortran/modules ' + export FCFLAGS + VALAFLAGS=-g + export VALAFLAGS + RUSTFLAGS='-Copt-level=3 -Cdebuginfo=2 -Ccodegen-units=1 -Cstrip=none -Cforce-frame-pointers=yes -Clink-arg=-specs=/usr/lib/rpm/redhat/redhat-package-notes --cap-lints=warn' + export RUSTFLAGS + LDFLAGS='-Wl,-z,relro -Wl,--as-needed -Wl,-z,pack-relative-relocs -Wl,-z,now -Wl,-z,now -Wl,--build-id=sha1 -specs=/usr/lib/rpm/redhat/redhat-package-notes ' + export LDFLAGS + LT_SYS_LIBRARY_PATH=/usr/lib64: + export LT_SYS_LIBRARY_PATH + CC=hipcc + export CC + CXX=hipcc + export CXX + /usr/bin/cmake -S . -B redhat-linux-build -DCMAKE_C_FLAGS_RELEASE:STRING=-DNDEBUG -DCMAKE_CXX_FLAGS_RELEASE:STRING=-DNDEBUG -DCMAKE_Fortran_FLAGS_RELEASE:STRING=-DNDEBUG -DCMAKE_VERBOSE_MAKEFILE:BOOL=ON -DCMAKE_INSTALL_DO_STRIP:BOOL=OFF -DCMAKE_INSTALL_PREFIX:PATH=/usr -DCMAKE_INSTALL_FULL_SBINDIR:PATH=/usr/bin -DCMAKE_INSTALL_SBINDIR:PATH=bin -DINCLUDE_INSTALL_DIR:PATH=/usr/include -DLIB_INSTALL_DIR:PATH=/usr/lib64 -DSYSCONF_INSTALL_DIR:PATH=/etc -DSHARE_INSTALL_PREFIX:PATH=/usr/share -DLIB_SUFFIX=64 -DBUILD_SHARED_LIBS:BOOL=ON -DCMAKE_INSTALL_LIBDIR=lib64 -DCMAKE_SKIP_RPATH=ON -DGGML_AVX=OFF -DGGML_AVX2=OFF -DGGML_AVX512=OFF -DGGML_AVX512_VBMI=OFF -DGGML_AVX512_VNNI=OFF -DGGML_FMA=OFF -DGGML_F16C=OFF -DGGML_HIP=ON '-DAMDGPU_TARGETS=gfx900;gfx906:xnack-;gfx908:xnack-;gfx90a:xnack+;gfx90a:xnack-;gfx942;gfx950;gfx1010;gfx1012;gfx1030;gfx1031;gfx1035;gfx1036;gfx1100;gfx1101;gfx1102;gfx1103;gfx1150;gfx1151;gfx1152;gfx1153;gfx1200;gfx1201' -DLLAMA_BUILD_EXAMPLES=OFF -DLLAMA_BUILD_TESTS=OFF -- The C compiler identification is Clang 20.0.0 -- The CXX compiler identification is Clang 20.0.0 -- Detecting C compiler ABI info -- Detecting C compiler ABI info - done -- Check for working C compiler: /usr/bin/hipcc - skipped -- Detecting C compile features -- Detecting C compile features - done -- Detecting CXX compiler ABI info -- Detecting CXX compiler ABI info - done -- Check for working CXX compiler: /usr/bin/hipcc - skipped -- Detecting CXX compile features -- Detecting CXX compile features - done -- Found Git: /usr/bin/git (found version "2.52.0") fatal: not a git repository (or any of the parent directories): .git fatal: not a git repository (or any of the parent directories): .git sh: line 1: /usr/bin/rocm_agent_enumerator: No such file or directory sh: line 1: /usr/bin/rocm_agent_enumerator: No such file or directory -- Setting GGML_NATIVE_DEFAULT to OFF -- Performing Test CMAKE_HAVE_LIBC_PTHREAD -- Performing Test CMAKE_HAVE_LIBC_PTHREAD - Success -- Found Threads: TRUE -- Warning: ccache not found - consider installing it for faster compilation or disable this warning with GGML_CCACHE=OFF -- CMAKE_SYSTEM_PROCESSOR: x86_64 -- GGML_SYSTEM_ARCH: x86 -- Including CPU backend -- Could NOT find OpenMP_C (missing: OpenMP_C_FLAGS OpenMP_C_LIB_NAMES) -- Could NOT find OpenMP_CXX (missing: OpenMP_CXX_FLAGS OpenMP_CXX_LIB_NAMES) -- Could NOT find OpenMP (missing: OpenMP_C_FOUND OpenMP_CXX_FOUND) -- x86 detected -- Adding CPU backend variant ggml-cpu: CMake Warning at ggml/src/ggml-cpu/CMakeLists.txt:80 (message): OpenMP not found Call Stack (most recent call first): ggml/src/CMakeLists.txt:372 (ggml_add_cpu_backend_variant_impl) CMake Warning at ggml/src/ggml-hip/CMakeLists.txt:27 (message): Setting hipcc as the C++ compiler is legacy behavior. Prefer setting the HIP compiler directly. See README for details. CMake Warning (dev) at /usr/lib64/cmake/hip/hip-config-amd.cmake:70 (message): AMDGPU_TARGETS is deprecated. Please use GPU_TARGETS instead. Call Stack (most recent call first): /usr/lib64/cmake/hip/hip-config.cmake:148 (include) ggml/src/ggml-hip/CMakeLists.txt:39 (find_package) This warning is for project developers. Use -Wno-dev to suppress it. -- Performing Test HIP_CLANG_SUPPORTS_PARALLEL_JOBS -- Performing Test HIP_CLANG_SUPPORTS_PARALLEL_JOBS - Success -- HIP and hipBLAS found -- Including HIP backend -- ggml version: 0.0.0 -- ggml commit: unknown CMake Warning at common/CMakeLists.txt:32 (message): Git repository not found; to enable automatic generation of build info, make sure Git is installed and the project is a Git repository. -- Found CURL: /usr/lib64/libcurl.so (found version "8.18.0-rc2") -- Configuring done (10.9s) -- Generating done (0.1s) CMake Warning: Manually-specified variables were not used by the project: CMAKE_Fortran_FLAGS_RELEASE CMAKE_INSTALL_DO_STRIP INCLUDE_INSTALL_DIR LIB_INSTALL_DIR LIB_SUFFIX SHARE_INSTALL_PREFIX SYSCONF_INSTALL_DIR -- Build files have been written to: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build + /usr/bin/cmake --build redhat-linux-build -j4 --verbose Change Dir: '/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build' Run Build Command(s): /usr/bin/cmake -E env VERBOSE=1 /usr/bin/gmake -f Makefile -j4 /usr/bin/cmake -S/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153 -B/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build --check-build-system CMakeFiles/Makefile.cmake 0 /usr/bin/cmake -E cmake_progress_start /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build/CMakeFiles /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build//CMakeFiles/progress.marks /usr/bin/gmake -f CMakeFiles/Makefile2 all gmake[1]: Entering directory '/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build' /usr/bin/gmake -f ggml/src/CMakeFiles/ggml-base.dir/build.make ggml/src/CMakeFiles/ggml-base.dir/depend /usr/bin/gmake -f common/CMakeFiles/build_info.dir/build.make common/CMakeFiles/build_info.dir/depend /usr/bin/gmake -f tools/mtmd/CMakeFiles/llama-llava-cli.dir/build.make tools/mtmd/CMakeFiles/llama-llava-cli.dir/depend /usr/bin/gmake -f tools/mtmd/CMakeFiles/llama-gemma3-cli.dir/build.make tools/mtmd/CMakeFiles/llama-gemma3-cli.dir/depend gmake[2]: Entering directory '/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build' cd /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build && /usr/bin/cmake -E cmake_depends "Unix Makefiles" /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153 /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build/ggml/src /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build/ggml/src/CMakeFiles/ggml-base.dir/DependInfo.cmake "--color=" gmake[2]: Entering directory '/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build' cd /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build && /usr/bin/cmake -E cmake_depends "Unix Makefiles" /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153 /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/tools/mtmd /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build/tools/mtmd /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build/tools/mtmd/CMakeFiles/llama-llava-cli.dir/DependInfo.cmake "--color=" gmake[2]: Entering directory '/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build' cd /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build && /usr/bin/cmake -E cmake_depends "Unix Makefiles" /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153 /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/tools/mtmd /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build/tools/mtmd /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build/tools/mtmd/CMakeFiles/llama-gemma3-cli.dir/DependInfo.cmake "--color=" gmake[2]: Entering directory '/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build' cd /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build && /usr/bin/cmake -E cmake_depends "Unix Makefiles" /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153 /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/common /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build/common /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build/common/CMakeFiles/build_info.dir/DependInfo.cmake "--color=" gmake[2]: Leaving directory '/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build' gmake[2]: Leaving directory '/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build' /usr/bin/gmake -f ggml/src/CMakeFiles/ggml-base.dir/build.make ggml/src/CMakeFiles/ggml-base.dir/build gmake[2]: Leaving directory '/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build' /usr/bin/gmake -f tools/mtmd/CMakeFiles/llama-llava-cli.dir/build.make tools/mtmd/CMakeFiles/llama-llava-cli.dir/build /usr/bin/gmake -f tools/mtmd/CMakeFiles/llama-gemma3-cli.dir/build.make tools/mtmd/CMakeFiles/llama-gemma3-cli.dir/build gmake[2]: Entering directory '/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build' gmake[2]: Entering directory '/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build' gmake[2]: Entering directory '/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build' gmake[2]: Leaving directory '/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build' /usr/bin/gmake -f common/CMakeFiles/build_info.dir/build.make common/CMakeFiles/build_info.dir/build gmake[2]: Entering directory '/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build' [ 1%] Building C object ggml/src/CMakeFiles/ggml-base.dir/ggml.c.o [ 1%] Building CXX object tools/mtmd/CMakeFiles/llama-llava-cli.dir/deprecation-warning.cpp.o cd /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build/tools/mtmd && /usr/bin/hipcc -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -DNDEBUG -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -MD -MT tools/mtmd/CMakeFiles/llama-llava-cli.dir/deprecation-warning.cpp.o -MF CMakeFiles/llama-llava-cli.dir/deprecation-warning.cpp.o.d -o CMakeFiles/llama-llava-cli.dir/deprecation-warning.cpp.o -c /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/tools/mtmd/deprecation-warning.cpp [ 2%] Building CXX object tools/mtmd/CMakeFiles/llama-gemma3-cli.dir/deprecation-warning.cpp.o cd /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build/ggml/src && /usr/bin/hipcc -DGGML_BUILD -DGGML_COMMIT=\"unknown\" -DGGML_SCHED_MAX_COPIES=4 -DGGML_SHARED -DGGML_VERSION=\"0.0.0\" -D_GNU_SOURCE -D_XOPEN_SOURCE=600 -Dggml_base_EXPORTS -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/. -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -fcf-protection -mtls-dialect=gnu2 -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -DNDEBUG -std=gnu11 -fPIC -Wshadow -Wstrict-prototypes -Wpointer-arith -Wmissing-prototypes -Werror=implicit-int -Werror=implicit-function-declaration -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wdouble-promotion -MD -MT ggml/src/CMakeFiles/ggml-base.dir/ggml.c.o -MF CMakeFiles/ggml-base.dir/ggml.c.o.d -o CMakeFiles/ggml-base.dir/ggml.c.o -c /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml.c cd /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build/tools/mtmd && /usr/bin/hipcc -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -DNDEBUG -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -MD -MT tools/mtmd/CMakeFiles/llama-gemma3-cli.dir/deprecation-warning.cpp.o -MF CMakeFiles/llama-gemma3-cli.dir/deprecation-warning.cpp.o.d -o CMakeFiles/llama-gemma3-cli.dir/deprecation-warning.cpp.o -c /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/tools/mtmd/deprecation-warning.cpp [ 2%] Building CXX object common/CMakeFiles/build_info.dir/build-info.cpp.o cd /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build/common && /usr/bin/hipcc -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -DNDEBUG -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -MD -MT common/CMakeFiles/build_info.dir/build-info.cpp.o -MF CMakeFiles/build_info.dir/build-info.cpp.o.d -o CMakeFiles/build_info.dir/build-info.cpp.o -c /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build/common/build-info.cpp sh: line 1: /usr/bin/rocm_agent_enumerator: No such file or directory sh: line 1: /usr/bin/rocm_agent_enumerator: No such file or directory sh: line 1: /usr/bin/rocm_agent_enumerator: No such file or directory sh: line 1: /usr/bin/rocm_agent_enumerator: No such file or directory [ 3%] Building CXX object ggml/src/CMakeFiles/ggml-base.dir/ggml.cpp.o cd /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build/ggml/src && /usr/bin/hipcc -DGGML_BUILD -DGGML_COMMIT=\"unknown\" -DGGML_SCHED_MAX_COPIES=4 -DGGML_SHARED -DGGML_VERSION=\"0.0.0\" -D_GNU_SOURCE -D_XOPEN_SOURCE=600 -Dggml_base_EXPORTS -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/. -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -DNDEBUG -std=gnu++17 -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -MD -MT ggml/src/CMakeFiles/ggml-base.dir/ggml.cpp.o -MF CMakeFiles/ggml-base.dir/ggml.cpp.o.d -o CMakeFiles/ggml-base.dir/ggml.cpp.o -c /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml.cpp sh: line 1: /usr/bin/rocm_agent_enumerator: No such file or directory [ 3%] Linking CXX executable ../../bin/llama-llava-cli cd /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build/tools/mtmd && /usr/bin/cmake -E cmake_link_script CMakeFiles/llama-llava-cli.dir/link.txt --verbose=1 sh: line 1: /usr/bin/rocm_agent_enumerator: No such file or directory gmake[2]: Leaving directory '/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build' [ 3%] Built target build_info clang++: warning: argument unused during compilation: '-Xarch_host -fstack-protector-strong' [-Wunused-command-line-argument] clang++: warning: argument unused during compilation: '-Xarch_host -fcf-protection' [-Wunused-command-line-argument] clang++: warning: argument unused during compilation: '-specs=/usr/lib/rpm/redhat/redhat-package-notes' [-Wunused-command-line-argument] [ 3%] Building C object ggml/src/CMakeFiles/ggml-base.dir/ggml-alloc.c.o cd /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build/ggml/src && /usr/bin/hipcc -DGGML_BUILD -DGGML_COMMIT=\"unknown\" -DGGML_SCHED_MAX_COPIES=4 -DGGML_SHARED -DGGML_VERSION=\"0.0.0\" -D_GNU_SOURCE -D_XOPEN_SOURCE=600 -Dggml_base_EXPORTS -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/. -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -fcf-protection -mtls-dialect=gnu2 -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -DNDEBUG -std=gnu11 -fPIC -Wshadow -Wstrict-prototypes -Wpointer-arith -Wmissing-prototypes -Werror=implicit-int -Werror=implicit-function-declaration -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wdouble-promotion -MD -MT ggml/src/CMakeFiles/ggml-base.dir/ggml-alloc.c.o -MF CMakeFiles/ggml-base.dir/ggml-alloc.c.o.d -o CMakeFiles/ggml-base.dir/ggml-alloc.c.o -c /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-alloc.c sh: line 1: /usr/bin/rocm_agent_enumerator: No such file or directory [ 3%] Linking CXX executable ../../bin/llama-gemma3-cli cd /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build/tools/mtmd && /usr/bin/cmake -E cmake_link_script CMakeFiles/llama-gemma3-cli.dir/link.txt --verbose=1 sh: line 1: /usr/bin/rocm_agent_enumerator: No such file or directory clang++: warning: argument unused during compilation: '-Xarch_host -fstack-protector-strong' [-Wunused-command-line-argument] clang++: warning: argument unused during compilation: '-Xarch_host -fcf-protection' [-Wunused-command-line-argument] clang++: warning: argument unused during compilation: '-specs=/usr/lib/rpm/redhat/redhat-package-notes' [-Wunused-command-line-argument] /usr/bin/hipcc -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -DNDEBUG -Wl,-z,relro -Wl,--as-needed -Wl,-z,pack-relative-relocs -Wl,-z,now -Wl,-z,now -Wl,--build-id=sha1 -specs=/usr/lib/rpm/redhat/redhat-package-notes -Xlinker --dependency-file=CMakeFiles/llama-llava-cli.dir/link.d "CMakeFiles/llama-llava-cli.dir/deprecation-warning.cpp.o" -o ../../bin/llama-llava-cli gmake[2]: Leaving directory '/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build' /usr/bin/hipcc -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -DNDEBUG -Wl,-z,relro -Wl,--as-needed -Wl,-z,pack-relative-relocs -Wl,-z,now -Wl,-z,now -Wl,--build-id=sha1 -specs=/usr/lib/rpm/redhat/redhat-package-notes -Xlinker --dependency-file=CMakeFiles/llama-gemma3-cli.dir/link.d "CMakeFiles/llama-gemma3-cli.dir/deprecation-warning.cpp.o" -o ../../bin/llama-gemma3-cli gmake[2]: Leaving directory '/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build' [ 3%] Built target llama-llava-cli [ 3%] Building CXX object ggml/src/CMakeFiles/ggml-base.dir/ggml-backend.cpp.o [ 3%] Built target llama-gemma3-cli /usr/bin/gmake -f tools/mtmd/CMakeFiles/llama-minicpmv-cli.dir/build.make tools/mtmd/CMakeFiles/llama-minicpmv-cli.dir/depend cd /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build/ggml/src && /usr/bin/hipcc -DGGML_BUILD -DGGML_COMMIT=\"unknown\" -DGGML_SCHED_MAX_COPIES=4 -DGGML_SHARED -DGGML_VERSION=\"0.0.0\" -D_GNU_SOURCE -D_XOPEN_SOURCE=600 -Dggml_base_EXPORTS -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/. -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -DNDEBUG -std=gnu++17 -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -MD -MT ggml/src/CMakeFiles/ggml-base.dir/ggml-backend.cpp.o -MF CMakeFiles/ggml-base.dir/ggml-backend.cpp.o.d -o CMakeFiles/ggml-base.dir/ggml-backend.cpp.o -c /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-backend.cpp gmake[2]: Entering directory '/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build' cd /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build && /usr/bin/cmake -E cmake_depends "Unix Makefiles" /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153 /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/tools/mtmd /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build/tools/mtmd /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build/tools/mtmd/CMakeFiles/llama-minicpmv-cli.dir/DependInfo.cmake "--color=" gmake[2]: Leaving directory '/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build' /usr/bin/gmake -f tools/mtmd/CMakeFiles/llama-minicpmv-cli.dir/build.make tools/mtmd/CMakeFiles/llama-minicpmv-cli.dir/build gmake[2]: Entering directory '/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build' [ 4%] Building CXX object tools/mtmd/CMakeFiles/llama-minicpmv-cli.dir/deprecation-warning.cpp.o cd /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build/tools/mtmd && /usr/bin/hipcc -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -DNDEBUG -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -MD -MT tools/mtmd/CMakeFiles/llama-minicpmv-cli.dir/deprecation-warning.cpp.o -MF CMakeFiles/llama-minicpmv-cli.dir/deprecation-warning.cpp.o.d -o CMakeFiles/llama-minicpmv-cli.dir/deprecation-warning.cpp.o -c /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/tools/mtmd/deprecation-warning.cpp sh: line 1: /usr/bin/rocm_agent_enumerator: No such file or directory sh: line 1: /usr/bin/rocm_agent_enumerator: No such file or directory /usr/bin/gmake -f tools/mtmd/CMakeFiles/llama-qwen2vl-cli.dir/build.make tools/mtmd/CMakeFiles/llama-qwen2vl-cli.dir/depend gmake[2]: Entering directory '/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build' cd /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build && /usr/bin/cmake -E cmake_depends "Unix Makefiles" /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153 /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/tools/mtmd /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build/tools/mtmd /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build/tools/mtmd/CMakeFiles/llama-qwen2vl-cli.dir/DependInfo.cmake "--color=" gmake[2]: Leaving directory '/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build' /usr/bin/gmake -f tools/mtmd/CMakeFiles/llama-qwen2vl-cli.dir/build.make tools/mtmd/CMakeFiles/llama-qwen2vl-cli.dir/build gmake[2]: Entering directory '/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build' [ 4%] Building CXX object tools/mtmd/CMakeFiles/llama-qwen2vl-cli.dir/deprecation-warning.cpp.o cd /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build/tools/mtmd && /usr/bin/hipcc -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -DNDEBUG -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -MD -MT tools/mtmd/CMakeFiles/llama-qwen2vl-cli.dir/deprecation-warning.cpp.o -MF CMakeFiles/llama-qwen2vl-cli.dir/deprecation-warning.cpp.o.d -o CMakeFiles/llama-qwen2vl-cli.dir/deprecation-warning.cpp.o -c /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/tools/mtmd/deprecation-warning.cpp sh: line 1: /usr/bin/rocm_agent_enumerator: No such file or directory [ 5%] Building CXX object ggml/src/CMakeFiles/ggml-base.dir/ggml-opt.cpp.o cd /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build/ggml/src && /usr/bin/hipcc -DGGML_BUILD -DGGML_COMMIT=\"unknown\" -DGGML_SCHED_MAX_COPIES=4 -DGGML_SHARED -DGGML_VERSION=\"0.0.0\" -D_GNU_SOURCE -D_XOPEN_SOURCE=600 -Dggml_base_EXPORTS -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/. -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -DNDEBUG -std=gnu++17 -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -MD -MT ggml/src/CMakeFiles/ggml-base.dir/ggml-opt.cpp.o -MF CMakeFiles/ggml-base.dir/ggml-opt.cpp.o.d -o CMakeFiles/ggml-base.dir/ggml-opt.cpp.o -c /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-opt.cpp sh: line 1: /usr/bin/rocm_agent_enumerator: No such file or directory [ 5%] Linking CXX executable ../../bin/llama-minicpmv-cli cd /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build/tools/mtmd && /usr/bin/cmake -E cmake_link_script CMakeFiles/llama-minicpmv-cli.dir/link.txt --verbose=1 sh: line 1: /usr/bin/rocm_agent_enumerator: No such file or directory clang++: warning: argument unused during compilation: '-Xarch_host -fstack-protector-strong' [-Wunused-command-line-argument] clang++: warning: argument unused during compilation: '-Xarch_host -fcf-protection' [-Wunused-command-line-argument] clang++: warning: argument unused during compilation: '-specs=/usr/lib/rpm/redhat/redhat-package-notes' [-Wunused-command-line-argument] [ 6%] Linking CXX executable ../../bin/llama-qwen2vl-cli cd /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build/tools/mtmd && /usr/bin/cmake -E cmake_link_script CMakeFiles/llama-qwen2vl-cli.dir/link.txt --verbose=1 sh: line 1: /usr/bin/rocm_agent_enumerator: No such file or directory /usr/bin/hipcc -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -DNDEBUG -Wl,-z,relro -Wl,--as-needed -Wl,-z,pack-relative-relocs -Wl,-z,now -Wl,-z,now -Wl,--build-id=sha1 -specs=/usr/lib/rpm/redhat/redhat-package-notes -Xlinker --dependency-file=CMakeFiles/llama-minicpmv-cli.dir/link.d "CMakeFiles/llama-minicpmv-cli.dir/deprecation-warning.cpp.o" -o ../../bin/llama-minicpmv-cli gmake[2]: Leaving directory '/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build' clang++: warning: argument unused during compilation: '-Xarch_host -fstack-protector-strong' [-Wunused-command-line-argument] clang++: warning: argument unused during compilation: '-Xarch_host -fcf-protection' [-Wunused-command-line-argument] clang++: warning: argument unused during compilation: '-specs=/usr/lib/rpm/redhat/redhat-package-notes' [-Wunused-command-line-argument] [ 6%] Built target llama-minicpmv-cli [ 6%] Building CXX object ggml/src/CMakeFiles/ggml-base.dir/ggml-threading.cpp.o cd /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build/ggml/src && /usr/bin/hipcc -DGGML_BUILD -DGGML_COMMIT=\"unknown\" -DGGML_SCHED_MAX_COPIES=4 -DGGML_SHARED -DGGML_VERSION=\"0.0.0\" -D_GNU_SOURCE -D_XOPEN_SOURCE=600 -Dggml_base_EXPORTS -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/. -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -DNDEBUG -std=gnu++17 -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -MD -MT ggml/src/CMakeFiles/ggml-base.dir/ggml-threading.cpp.o -MF CMakeFiles/ggml-base.dir/ggml-threading.cpp.o.d -o CMakeFiles/ggml-base.dir/ggml-threading.cpp.o -c /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-threading.cpp sh: line 1: /usr/bin/rocm_agent_enumerator: No such file or directory /usr/bin/hipcc -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -DNDEBUG -Wl,-z,relro -Wl,--as-needed -Wl,-z,pack-relative-relocs -Wl,-z,now -Wl,-z,now -Wl,--build-id=sha1 -specs=/usr/lib/rpm/redhat/redhat-package-notes -Xlinker --dependency-file=CMakeFiles/llama-qwen2vl-cli.dir/link.d "CMakeFiles/llama-qwen2vl-cli.dir/deprecation-warning.cpp.o" -o ../../bin/llama-qwen2vl-cli gmake[2]: Leaving directory '/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build' [ 6%] Built target llama-qwen2vl-cli [ 7%] Building C object ggml/src/CMakeFiles/ggml-base.dir/ggml-quants.c.o cd /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build/ggml/src && /usr/bin/hipcc -DGGML_BUILD -DGGML_COMMIT=\"unknown\" -DGGML_SCHED_MAX_COPIES=4 -DGGML_SHARED -DGGML_VERSION=\"0.0.0\" -D_GNU_SOURCE -D_XOPEN_SOURCE=600 -Dggml_base_EXPORTS -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/. -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -fcf-protection -mtls-dialect=gnu2 -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -DNDEBUG -std=gnu11 -fPIC -Wshadow -Wstrict-prototypes -Wpointer-arith -Wmissing-prototypes -Werror=implicit-int -Werror=implicit-function-declaration -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wdouble-promotion -MD -MT ggml/src/CMakeFiles/ggml-base.dir/ggml-quants.c.o -MF CMakeFiles/ggml-base.dir/ggml-quants.c.o.d -o CMakeFiles/ggml-base.dir/ggml-quants.c.o -c /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-quants.c sh: line 1: /usr/bin/rocm_agent_enumerator: No such file or directory [ 7%] Building CXX object ggml/src/CMakeFiles/ggml-base.dir/gguf.cpp.o cd /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build/ggml/src && /usr/bin/hipcc -DGGML_BUILD -DGGML_COMMIT=\"unknown\" -DGGML_SCHED_MAX_COPIES=4 -DGGML_SHARED -DGGML_VERSION=\"0.0.0\" -D_GNU_SOURCE -D_XOPEN_SOURCE=600 -Dggml_base_EXPORTS -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/. -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -DNDEBUG -std=gnu++17 -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -MD -MT ggml/src/CMakeFiles/ggml-base.dir/gguf.cpp.o -MF CMakeFiles/ggml-base.dir/gguf.cpp.o.d -o CMakeFiles/ggml-base.dir/gguf.cpp.o -c /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/gguf.cpp sh: line 1: /usr/bin/rocm_agent_enumerator: No such file or directory [ 8%] Linking CXX shared library ../../bin/libggml-base.so cd /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build/ggml/src && /usr/bin/cmake -E cmake_link_script CMakeFiles/ggml-base.dir/link.txt --verbose=1 sh: line 1: /usr/bin/rocm_agent_enumerator: No such file or directory clang++: warning: argument unused during compilation: '-Xarch_host -fstack-protector-strong' [-Wunused-command-line-argument] clang++: warning: argument unused during compilation: '-Xarch_host -fcf-protection' [-Wunused-command-line-argument] clang++: warning: argument unused during compilation: '-specs=/usr/lib/rpm/redhat/redhat-package-notes' [-Wunused-command-line-argument] /usr/bin/hipcc -fPIC -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -DNDEBUG -Xlinker --dependency-file=CMakeFiles/ggml-base.dir/link.d -Wl,-z,relro -Wl,--as-needed -Wl,-z,pack-relative-relocs -Wl,-z,now -Wl,-z,now -Wl,--build-id=sha1 -specs=/usr/lib/rpm/redhat/redhat-package-notes -shared -Wl,-soname,libggml-base.so.b6153 -o ../../bin/libggml-base.so.b6153 "CMakeFiles/ggml-base.dir/ggml.c.o" "CMakeFiles/ggml-base.dir/ggml.cpp.o" "CMakeFiles/ggml-base.dir/ggml-alloc.c.o" "CMakeFiles/ggml-base.dir/ggml-backend.cpp.o" "CMakeFiles/ggml-base.dir/ggml-opt.cpp.o" "CMakeFiles/ggml-base.dir/ggml-threading.cpp.o" "CMakeFiles/ggml-base.dir/ggml-quants.c.o" "CMakeFiles/ggml-base.dir/gguf.cpp.o" -lm cd /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build/ggml/src && /usr/bin/cmake -E cmake_symlink_library ../../bin/libggml-base.so.b6153 ../../bin/libggml-base.so.b6153 ../../bin/libggml-base.so gmake[2]: Leaving directory '/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build' [ 8%] Built target ggml-base /usr/bin/gmake -f ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/build.make ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/depend /usr/bin/gmake -f ggml/src/CMakeFiles/ggml-cpu.dir/build.make ggml/src/CMakeFiles/ggml-cpu.dir/depend gmake[2]: Entering directory '/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build' cd /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build && /usr/bin/cmake -E cmake_depends "Unix Makefiles" /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153 /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build/ggml/src /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build/ggml/src/CMakeFiles/ggml-cpu.dir/DependInfo.cmake "--color=" gmake[2]: Entering directory '/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build' cd /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build && /usr/bin/cmake -E cmake_depends "Unix Makefiles" /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153 /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build/ggml/src/ggml-hip /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build/ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/DependInfo.cmake "--color=" gmake[2]: Leaving directory '/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build' /usr/bin/gmake -f ggml/src/CMakeFiles/ggml-cpu.dir/build.make ggml/src/CMakeFiles/ggml-cpu.dir/build gmake[2]: Entering directory '/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build' gmake[2]: Leaving directory '/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build' /usr/bin/gmake -f ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/build.make ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/build gmake[2]: Entering directory '/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build' [ 8%] Building C object ggml/src/CMakeFiles/ggml-cpu.dir/ggml-cpu/ggml-cpu.c.o [ 8%] Building CXX object ggml/src/CMakeFiles/ggml-cpu.dir/ggml-cpu/repack.cpp.o [ 9%] Building CXX object ggml/src/CMakeFiles/ggml-cpu.dir/ggml-cpu/ggml-cpu.cpp.o cd /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build/ggml/src && /usr/bin/hipcc -DGGML_BACKEND_BUILD -DGGML_BACKEND_SHARED -DGGML_SCHED_MAX_COPIES=4 -DGGML_SHARED -DGGML_USE_CPU_REPACK -DGGML_USE_LLAMAFILE -D_GNU_SOURCE -D_XOPEN_SOURCE=600 -Dggml_cpu_EXPORTS -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/.. -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/. -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cpu -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -DNDEBUG -std=gnu++17 -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -MD -MT ggml/src/CMakeFiles/ggml-cpu.dir/ggml-cpu/repack.cpp.o -MF CMakeFiles/ggml-cpu.dir/ggml-cpu/repack.cpp.o.d -o CMakeFiles/ggml-cpu.dir/ggml-cpu/repack.cpp.o -c /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cpu/repack.cpp cd /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build/ggml/src && /usr/bin/hipcc -DGGML_BACKEND_BUILD -DGGML_BACKEND_SHARED -DGGML_SCHED_MAX_COPIES=4 -DGGML_SHARED -DGGML_USE_CPU_REPACK -DGGML_USE_LLAMAFILE -D_GNU_SOURCE -D_XOPEN_SOURCE=600 -Dggml_cpu_EXPORTS -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/.. -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/. -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cpu -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -DNDEBUG -std=gnu++17 -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -MD -MT ggml/src/CMakeFiles/ggml-cpu.dir/ggml-cpu/ggml-cpu.cpp.o -MF CMakeFiles/ggml-cpu.dir/ggml-cpu/ggml-cpu.cpp.o.d -o CMakeFiles/ggml-cpu.dir/ggml-cpu/ggml-cpu.cpp.o -c /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cpu/ggml-cpu.cpp cd /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build/ggml/src && /usr/bin/hipcc -DGGML_BACKEND_BUILD -DGGML_BACKEND_SHARED -DGGML_SCHED_MAX_COPIES=4 -DGGML_SHARED -DGGML_USE_CPU_REPACK -DGGML_USE_LLAMAFILE -D_GNU_SOURCE -D_XOPEN_SOURCE=600 -Dggml_cpu_EXPORTS -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/.. -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/. -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cpu -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -fcf-protection -mtls-dialect=gnu2 -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -DNDEBUG -std=gnu11 -fPIC -Wshadow -Wstrict-prototypes -Wpointer-arith -Wmissing-prototypes -Werror=implicit-int -Werror=implicit-function-declaration -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wdouble-promotion -MD -MT ggml/src/CMakeFiles/ggml-cpu.dir/ggml-cpu/ggml-cpu.c.o -MF CMakeFiles/ggml-cpu.dir/ggml-cpu/ggml-cpu.c.o.d -o CMakeFiles/ggml-cpu.dir/ggml-cpu/ggml-cpu.c.o -c /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cpu/ggml-cpu.c [ 10%] Building CXX object ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/acc.cu.o cd /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build/ggml/src/ggml-hip && /usr/bin/hipcc -DGGML_BACKEND_BUILD -DGGML_BACKEND_SHARED -DGGML_HIP_NO_VMM -DGGML_HIP_ROCWMMA_FATTN_GFX12 -DGGML_SCHED_MAX_COPIES=4 -DGGML_SHARED -DGGML_USE_HIP -DUSE_PROF_API=1 -D_GNU_SOURCE -D_XOPEN_SOURCE=600 -D__HIP_PLATFORM_AMD__=1 -Dggml_hip_EXPORTS -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/.. -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -DNDEBUG -std=gnu++17 -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -x hip --offload-arch=gfx900 --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx942 --offload-arch=gfx950 --offload-arch=gfx1010 --offload-arch=gfx1012 --offload-arch=gfx1030 --offload-arch=gfx1031 --offload-arch=gfx1035 --offload-arch=gfx1036 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1103 --offload-arch=gfx1150 --offload-arch=gfx1151 --offload-arch=gfx1152 --offload-arch=gfx1153 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/acc.cu.o -MF CMakeFiles/ggml-hip.dir/__/ggml-cuda/acc.cu.o.d -o CMakeFiles/ggml-hip.dir/__/ggml-cuda/acc.cu.o -c /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/acc.cu sh: line 1: /usr/bin/rocm_agent_enumerator: No such file or directory sh: line 1: /usr/bin/rocm_agent_enumerator: No such file or directory sh: line 1: /usr/bin/rocm_agent_enumerator: No such file or directory [ 10%] Building CXX object ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/add-id.cu.o cd /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build/ggml/src/ggml-hip && /usr/bin/hipcc -DGGML_BACKEND_BUILD -DGGML_BACKEND_SHARED -DGGML_HIP_NO_VMM -DGGML_HIP_ROCWMMA_FATTN_GFX12 -DGGML_SCHED_MAX_COPIES=4 -DGGML_SHARED -DGGML_USE_HIP -DUSE_PROF_API=1 -D_GNU_SOURCE -D_XOPEN_SOURCE=600 -D__HIP_PLATFORM_AMD__=1 -Dggml_hip_EXPORTS -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/.. -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -DNDEBUG -std=gnu++17 -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -x hip --offload-arch=gfx900 --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx942 --offload-arch=gfx950 --offload-arch=gfx1010 --offload-arch=gfx1012 --offload-arch=gfx1030 --offload-arch=gfx1031 --offload-arch=gfx1035 --offload-arch=gfx1036 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1103 --offload-arch=gfx1150 --offload-arch=gfx1151 --offload-arch=gfx1152 --offload-arch=gfx1153 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/add-id.cu.o -MF CMakeFiles/ggml-hip.dir/__/ggml-cuda/add-id.cu.o.d -o CMakeFiles/ggml-hip.dir/__/ggml-cuda/add-id.cu.o -c /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/add-id.cu In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/acc.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/acc.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1010. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/add-id.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/add-id.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/add-id.cu:14:44: warning: cast from 'const int *' to 'char *' drops const qualifier [-Wcast-qual] 14 | const int i11 = *(int32_t *) ((char *) src2 + i1*sizeof(int32_t) + i2*nb21); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/add-id.cu:20:54: warning: cast from 'const float *' to 'char *' drops const qualifier [-Wcast-qual] 20 | const float * src0_row = (const float *)((char *)src0 + i1*nb01 + i2*nb02); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/add-id.cu:21:54: warning: cast from 'const float *' to 'char *' drops const qualifier [-Wcast-qual] 21 | const float * src1_row = (const float *)((char *)src1 + i11*nb11); | ^ 4 warnings generated when compiling for gfx1010. [ 11%] Building CXX object ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/arange.cu.o cd /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build/ggml/src/ggml-hip && /usr/bin/hipcc -DGGML_BACKEND_BUILD -DGGML_BACKEND_SHARED -DGGML_HIP_NO_VMM -DGGML_HIP_ROCWMMA_FATTN_GFX12 -DGGML_SCHED_MAX_COPIES=4 -DGGML_SHARED -DGGML_USE_HIP -DUSE_PROF_API=1 -D_GNU_SOURCE -D_XOPEN_SOURCE=600 -D__HIP_PLATFORM_AMD__=1 -Dggml_hip_EXPORTS -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/.. -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -DNDEBUG -std=gnu++17 -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -x hip --offload-arch=gfx900 --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx942 --offload-arch=gfx950 --offload-arch=gfx1010 --offload-arch=gfx1012 --offload-arch=gfx1030 --offload-arch=gfx1031 --offload-arch=gfx1035 --offload-arch=gfx1036 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1103 --offload-arch=gfx1150 --offload-arch=gfx1151 --offload-arch=gfx1152 --offload-arch=gfx1153 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/arange.cu.o -MF CMakeFiles/ggml-hip.dir/__/ggml-cuda/arange.cu.o.d -o CMakeFiles/ggml-hip.dir/__/ggml-cuda/arange.cu.o -c /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/arange.cu In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/acc.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/acc.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1012. [ 12%] Building CXX object ggml/src/CMakeFiles/ggml-cpu.dir/ggml-cpu/hbm.cpp.o cd /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build/ggml/src && /usr/bin/hipcc -DGGML_BACKEND_BUILD -DGGML_BACKEND_SHARED -DGGML_SCHED_MAX_COPIES=4 -DGGML_SHARED -DGGML_USE_CPU_REPACK -DGGML_USE_LLAMAFILE -D_GNU_SOURCE -D_XOPEN_SOURCE=600 -Dggml_cpu_EXPORTS -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/.. -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/. -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cpu -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -DNDEBUG -std=gnu++17 -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -MD -MT ggml/src/CMakeFiles/ggml-cpu.dir/ggml-cpu/hbm.cpp.o -MF CMakeFiles/ggml-cpu.dir/ggml-cpu/hbm.cpp.o.d -o CMakeFiles/ggml-cpu.dir/ggml-cpu/hbm.cpp.o -c /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cpu/hbm.cpp sh: line 1: /usr/bin/rocm_agent_enumerator: No such file or directory In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/add-id.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/add-id.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/add-id.cu:14:44: warning: cast from 'const int *' to 'char *' drops const qualifier [-Wcast-qual] 14 | const int i11 = *(int32_t *) ((char *) src2 + i1*sizeof(int32_t) + i2*nb21); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/add-id.cu:20:54: warning: cast from 'const float *' to 'char *' drops const qualifier [-Wcast-qual] 20 | const float * src0_row = (const float *)((char *)src0 + i1*nb01 + i2*nb02); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/add-id.cu:21:54: warning: cast from 'const float *' to 'char *' drops const qualifier [-Wcast-qual] 21 | const float * src1_row = (const float *)((char *)src1 + i11*nb11); | ^ 4 warnings generated when compiling for gfx1012. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/arange.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/arange.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1010. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/acc.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/acc.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1030. [ 12%] Building C object ggml/src/CMakeFiles/ggml-cpu.dir/ggml-cpu/quants.c.o cd /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build/ggml/src && /usr/bin/hipcc -DGGML_BACKEND_BUILD -DGGML_BACKEND_SHARED -DGGML_SCHED_MAX_COPIES=4 -DGGML_SHARED -DGGML_USE_CPU_REPACK -DGGML_USE_LLAMAFILE -D_GNU_SOURCE -D_XOPEN_SOURCE=600 -Dggml_cpu_EXPORTS -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/.. -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/. -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cpu -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -fcf-protection -mtls-dialect=gnu2 -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -DNDEBUG -std=gnu11 -fPIC -Wshadow -Wstrict-prototypes -Wpointer-arith -Wmissing-prototypes -Werror=implicit-int -Werror=implicit-function-declaration -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wdouble-promotion -MD -MT ggml/src/CMakeFiles/ggml-cpu.dir/ggml-cpu/quants.c.o -MF CMakeFiles/ggml-cpu.dir/ggml-cpu/quants.c.o.d -o CMakeFiles/ggml-cpu.dir/ggml-cpu/quants.c.o -c /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cpu/quants.c sh: line 1: /usr/bin/rocm_agent_enumerator: No such file or directory In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/add-id.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/add-id.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/add-id.cu:14:44: warning: cast from 'const int *' to 'char *' drops const qualifier [-Wcast-qual] 14 | const int i11 = *(int32_t *) ((char *) src2 + i1*sizeof(int32_t) + i2*nb21); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/add-id.cu:20:54: warning: cast from 'const float *' to 'char *' drops const qualifier [-Wcast-qual] 20 | const float * src0_row = (const float *)((char *)src0 + i1*nb01 + i2*nb02); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/add-id.cu:21:54: warning: cast from 'const float *' to 'char *' drops const qualifier [-Wcast-qual] 21 | const float * src1_row = (const float *)((char *)src1 + i11*nb11); | ^ 4 warnings generated when compiling for gfx1030. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/arange.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/arange.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1012. [ 13%] Building CXX object ggml/src/CMakeFiles/ggml-cpu.dir/ggml-cpu/traits.cpp.o cd /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build/ggml/src && /usr/bin/hipcc -DGGML_BACKEND_BUILD -DGGML_BACKEND_SHARED -DGGML_SCHED_MAX_COPIES=4 -DGGML_SHARED -DGGML_USE_CPU_REPACK -DGGML_USE_LLAMAFILE -D_GNU_SOURCE -D_XOPEN_SOURCE=600 -Dggml_cpu_EXPORTS -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/.. -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/. -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cpu -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -DNDEBUG -std=gnu++17 -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -MD -MT ggml/src/CMakeFiles/ggml-cpu.dir/ggml-cpu/traits.cpp.o -MF CMakeFiles/ggml-cpu.dir/ggml-cpu/traits.cpp.o.d -o CMakeFiles/ggml-cpu.dir/ggml-cpu/traits.cpp.o -c /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cpu/traits.cpp sh: line 1: /usr/bin/rocm_agent_enumerator: No such file or directory In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/acc.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/acc.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1031. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/add-id.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/add-id.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/add-id.cu:14:44: warning: cast from 'const int *' to 'char *' drops const qualifier [-Wcast-qual] 14 | const int i11 = *(int32_t *) ((char *) src2 + i1*sizeof(int32_t) + i2*nb21); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/add-id.cu:20:54: warning: cast from 'const float *' to 'char *' drops const qualifier [-Wcast-qual] 20 | const float * src0_row = (const float *)((char *)src0 + i1*nb01 + i2*nb02); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/add-id.cu:21:54: warning: cast from 'const float *' to 'char *' drops const qualifier [-Wcast-qual] 21 | const float * src1_row = (const float *)((char *)src1 + i11*nb11); | ^ 4 warnings generated when compiling for gfx1031. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/arange.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/arange.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1030. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/acc.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/acc.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1035. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/add-id.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/add-id.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/add-id.cu:14:44: warning: cast from 'const int *' to 'char *' drops const qualifier [-Wcast-qual] 14 | const int i11 = *(int32_t *) ((char *) src2 + i1*sizeof(int32_t) + i2*nb21); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/add-id.cu:20:54: warning: cast from 'const float *' to 'char *' drops const qualifier [-Wcast-qual] 20 | const float * src0_row = (const float *)((char *)src0 + i1*nb01 + i2*nb02); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/add-id.cu:21:54: warning: cast from 'const float *' to 'char *' drops const qualifier [-Wcast-qual] 21 | const float * src1_row = (const float *)((char *)src1 + i11*nb11); | ^ 4 warnings generated when compiling for gfx1035. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/arange.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/arange.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ [ 13%] Building CXX object ggml/src/CMakeFiles/ggml-cpu.dir/ggml-cpu/amx/amx.cpp.o cd /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build/ggml/src && /usr/bin/hipcc -DGGML_BACKEND_BUILD -DGGML_BACKEND_SHARED -DGGML_SCHED_MAX_COPIES=4 -DGGML_SHARED -DGGML_USE_CPU_REPACK -DGGML_USE_LLAMAFILE -D_GNU_SOURCE -D_XOPEN_SOURCE=600 -Dggml_cpu_EXPORTS -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/.. -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/. -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cpu -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -DNDEBUG -std=gnu++17 -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -MD -MT ggml/src/CMakeFiles/ggml-cpu.dir/ggml-cpu/amx/amx.cpp.o -MF CMakeFiles/ggml-cpu.dir/ggml-cpu/amx/amx.cpp.o.d -o CMakeFiles/ggml-cpu.dir/ggml-cpu/amx/amx.cpp.o -c /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cpu/amx/amx.cpp sh: line 1: /usr/bin/rocm_agent_enumerator: No such file or directory 1 warning generated when compiling for gfx1031. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/acc.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/acc.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1036. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/add-id.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/add-id.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/add-id.cu:14:44: warning: cast from 'const int *' to 'char *' drops const qualifier [-Wcast-qual] 14 | const int i11 = *(int32_t *) ((char *) src2 + i1*sizeof(int32_t) + i2*nb21); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/add-id.cu:20:54: warning: cast from 'const float *' to 'char *' drops const qualifier [-Wcast-qual] 20 | const float * src0_row = (const float *)((char *)src0 + i1*nb01 + i2*nb02); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/add-id.cu:21:54: warning: cast from 'const float *' to 'char *' drops const qualifier [-Wcast-qual] 21 | const float * src1_row = (const float *)((char *)src1 + i11*nb11); | ^ 4 warnings generated when compiling for gfx1036. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/arange.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/arange.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1035. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/acc.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/acc.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1100. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/add-id.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/add-id.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/add-id.cu:14:44: warning: cast from 'const int *' to 'char *' drops const qualifier [-Wcast-qual] 14 | const int i11 = *(int32_t *) ((char *) src2 + i1*sizeof(int32_t) + i2*nb21); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/add-id.cu:20:54: warning: cast from 'const float *' to 'char *' drops const qualifier [-Wcast-qual] 20 | const float * src0_row = (const float *)((char *)src0 + i1*nb01 + i2*nb02); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/add-id.cu:21:54: warning: cast from 'const float *' to 'char *' drops const qualifier [-Wcast-qual] 21 | const float * src1_row = (const float *)((char *)src1 + i11*nb11); | ^ 4 warnings generated when compiling for gfx1100. [ 14%] Building CXX object ggml/src/CMakeFiles/ggml-cpu.dir/ggml-cpu/amx/mmq.cpp.o cd /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build/ggml/src && /usr/bin/hipcc -DGGML_BACKEND_BUILD -DGGML_BACKEND_SHARED -DGGML_SCHED_MAX_COPIES=4 -DGGML_SHARED -DGGML_USE_CPU_REPACK -DGGML_USE_LLAMAFILE -D_GNU_SOURCE -D_XOPEN_SOURCE=600 -Dggml_cpu_EXPORTS -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/.. -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/. -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cpu -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -DNDEBUG -std=gnu++17 -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -MD -MT ggml/src/CMakeFiles/ggml-cpu.dir/ggml-cpu/amx/mmq.cpp.o -MF CMakeFiles/ggml-cpu.dir/ggml-cpu/amx/mmq.cpp.o.d -o CMakeFiles/ggml-cpu.dir/ggml-cpu/amx/mmq.cpp.o -c /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cpu/amx/mmq.cpp sh: line 1: /usr/bin/rocm_agent_enumerator: No such file or directory In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/arange.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/arange.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1036. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/acc.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/acc.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1101. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/add-id.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/add-id.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/add-id.cu:14:44: warning: cast from 'const int *' to 'char *' drops const qualifier [-Wcast-qual] 14 | const int i11 = *(int32_t *) ((char *) src2 + i1*sizeof(int32_t) + i2*nb21); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/add-id.cu:20:54: warning: cast from 'const float *' to 'char *' drops const qualifier [-Wcast-qual] 20 | const float * src0_row = (const float *)((char *)src0 + i1*nb01 + i2*nb02); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/add-id.cu:21:54: warning: cast from 'const float *' to 'char *' drops const qualifier [-Wcast-qual] 21 | const float * src1_row = (const float *)((char *)src1 + i11*nb11); | ^ 4 warnings generated when compiling for gfx1101. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/arange.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/arange.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1100. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/acc.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/acc.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1102. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/add-id.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/add-id.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/add-id.cu:14:44: warning: cast from 'const int *' to 'char *' drops const qualifier [-Wcast-qual] 14 | const int i11 = *(int32_t *) ((char *) src2 + i1*sizeof(int32_t) + i2*nb21); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/add-id.cu:20:54: warning: cast from 'const float *' to 'char *' drops const qualifier [-Wcast-qual] 20 | const float * src0_row = (const float *)((char *)src0 + i1*nb01 + i2*nb02); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/add-id.cu:21:54: warning: cast from 'const float *' to 'char *' drops const qualifier [-Wcast-qual] 21 | const float * src1_row = (const float *)((char *)src1 + i11*nb11); | ^ [ 14%] Building CXX object ggml/src/CMakeFiles/ggml-cpu.dir/ggml-cpu/binary-ops.cpp.o cd /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build/ggml/src && /usr/bin/hipcc -DGGML_BACKEND_BUILD -DGGML_BACKEND_SHARED -DGGML_SCHED_MAX_COPIES=4 -DGGML_SHARED -DGGML_USE_CPU_REPACK -DGGML_USE_LLAMAFILE -D_GNU_SOURCE -D_XOPEN_SOURCE=600 -Dggml_cpu_EXPORTS -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/.. -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/. -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cpu -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -DNDEBUG -std=gnu++17 -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -MD -MT ggml/src/CMakeFiles/ggml-cpu.dir/ggml-cpu/binary-ops.cpp.o -MF CMakeFiles/ggml-cpu.dir/ggml-cpu/binary-ops.cpp.o.d -o CMakeFiles/ggml-cpu.dir/ggml-cpu/binary-ops.cpp.o -c /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cpu/binary-ops.cpp sh: line 1: /usr/bin/rocm_agent_enumerator: No such file or directory 4 warnings generated when compiling for gfx1102. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/arange.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/arange.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1101. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/acc.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/acc.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/add-id.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/add-id.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1103. /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/add-id.cu:14:44: warning: cast from 'const int *' to 'char *' drops const qualifier [-Wcast-qual] 14 | const int i11 = *(int32_t *) ((char *) src2 + i1*sizeof(int32_t) + i2*nb21); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/add-id.cu:20:54: warning: cast from 'const float *' to 'char *' drops const qualifier [-Wcast-qual] 20 | const float * src0_row = (const float *)((char *)src0 + i1*nb01 + i2*nb02); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/add-id.cu:21:54: warning: cast from 'const float *' to 'char *' drops const qualifier [-Wcast-qual] 21 | const float * src1_row = (const float *)((char *)src1 + i11*nb11); | ^ 4 warnings generated when compiling for gfx1103. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/arange.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/arange.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1102. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/acc.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/acc.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/add-id.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/add-id.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/add-id.cu:14:44: warning: cast from 'const int *' to 'char *' drops const qualifier [-Wcast-qual] 14 | const int i11 = *(int32_t *) ((char *) src2 + i1*sizeof(int32_t) + i2*nb21); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/add-id.cu:20:54: warning: cast from 'const float *' to 'char *' drops const qualifier [-Wcast-qual] 20 | const float * src0_row = (const float *)((char *)src0 + i1*nb01 + i2*nb02); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/add-id.cu:21:54: warning: cast from 'const float *' to 'char *' drops const qualifier [-Wcast-qual] 21 | const float * src1_row = (const float *)((char *)src1 + i11*nb11); | ^ 1 warning generated when compiling for gfx1150. 4 warnings generated when compiling for gfx1150. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/arange.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/arange.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1103. [ 15%] Building CXX object ggml/src/CMakeFiles/ggml-cpu.dir/ggml-cpu/unary-ops.cpp.o cd /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build/ggml/src && /usr/bin/hipcc -DGGML_BACKEND_BUILD -DGGML_BACKEND_SHARED -DGGML_SCHED_MAX_COPIES=4 -DGGML_SHARED -DGGML_USE_CPU_REPACK -DGGML_USE_LLAMAFILE -D_GNU_SOURCE -D_XOPEN_SOURCE=600 -Dggml_cpu_EXPORTS -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/.. -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/. -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cpu -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -DNDEBUG -std=gnu++17 -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -MD -MT ggml/src/CMakeFiles/ggml-cpu.dir/ggml-cpu/unary-ops.cpp.o -MF CMakeFiles/ggml-cpu.dir/ggml-cpu/unary-ops.cpp.o.d -o CMakeFiles/ggml-cpu.dir/ggml-cpu/unary-ops.cpp.o -c /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cpu/unary-ops.cpp sh: line 1: /usr/bin/rocm_agent_enumerator: No such file or directory In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/acc.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/acc.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/add-id.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/add-id.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/add-id.cu:14:44: warning: cast from 'const int *' to 'char *' drops const qualifier [-Wcast-qual] 14 | const int i11 = *(int32_t *) ((char *) src2 + i1*sizeof(int32_t) + i2*nb21); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/add-id.cu:20:54: warning: cast from 'const float *' to 'char *' drops const qualifier [-Wcast-qual] 20 | const float * src0_row = (const float *)((char *)src0 + i1*nb01 + i2*nb02); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/add-id.cu:21:54: warning: cast from 'const float *' to 'char *' drops const qualifier [-Wcast-qual] 21 | const float * src1_row = (const float *)((char *)src1 + i11*nb11); | ^ 1 warning generated when compiling for gfx1151. 4 warnings generated when compiling for gfx1151. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/arange.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/arange.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1150. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/acc.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/acc.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/add-id.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/add-id.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/add-id.cu:14:44: warning: cast from 'const int *' to 'char *' drops const qualifier [-Wcast-qual] 14 | const int i11 = *(int32_t *) ((char *) src2 + i1*sizeof(int32_t) + i2*nb21); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/add-id.cu:20:54: warning: cast from 'const float *' to 'char *' drops const qualifier [-Wcast-qual] 20 | const float * src0_row = (const float *)((char *)src0 + i1*nb01 + i2*nb02); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/add-id.cu:21:54: warning: cast from 'const float *' to 'char *' drops const qualifier [-Wcast-qual] 21 | const float * src1_row = (const float *)((char *)src1 + i11*nb11); | ^ 1 warning generated when compiling for gfx1152. 4 warnings generated when compiling for gfx1152. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/arange.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/arange.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1151. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/acc.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/acc.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/add-id.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/add-id.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/add-id.cu:14:44: warning: cast from 'const int *' to 'char *' drops const qualifier [-Wcast-qual] 14 | const int i11 = *(int32_t *) ((char *) src2 + i1*sizeof(int32_t) + i2*nb21); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/add-id.cu:20:54: warning: cast from 'const float *' to 'char *' drops const qualifier [-Wcast-qual] 20 | const float * src0_row = (const float *)((char *)src0 + i1*nb01 + i2*nb02); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/add-id.cu:21:54: warning: cast from 'const float *' to 'char *' drops const qualifier [-Wcast-qual] 21 | const float * src1_row = (const float *)((char *)src1 + i11*nb11); | ^ 1 warning generated when compiling for gfx1153. 4 warnings generated when compiling for gfx1153. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/arange.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/arange.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ [ 15%] Building CXX object ggml/src/CMakeFiles/ggml-cpu.dir/ggml-cpu/vec.cpp.o cd /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build/ggml/src && /usr/bin/hipcc -DGGML_BACKEND_BUILD -DGGML_BACKEND_SHARED -DGGML_SCHED_MAX_COPIES=4 -DGGML_SHARED -DGGML_USE_CPU_REPACK -DGGML_USE_LLAMAFILE -D_GNU_SOURCE -D_XOPEN_SOURCE=600 -Dggml_cpu_EXPORTS -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/.. -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/. -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cpu -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -DNDEBUG -std=gnu++17 -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -MD -MT ggml/src/CMakeFiles/ggml-cpu.dir/ggml-cpu/vec.cpp.o -MF CMakeFiles/ggml-cpu.dir/ggml-cpu/vec.cpp.o.d -o CMakeFiles/ggml-cpu.dir/ggml-cpu/vec.cpp.o -c /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cpu/vec.cpp sh: line 1: /usr/bin/rocm_agent_enumerator: No such file or directory 1 warning generated when compiling for gfx1152. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/acc.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/acc.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/add-id.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/add-id.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/add-id.cu:14:44: warning: cast from 'const int *' to 'char *' drops const qualifier [-Wcast-qual] 14 | const int i11 = *(int32_t *) ((char *) src2 + i1*sizeof(int32_t) + i2*nb21); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/add-id.cu:20:54: warning: cast from 'const float *' to 'char *' drops const qualifier [-Wcast-qual] 20 | const float * src0_row = (const float *)((char *)src0 + i1*nb01 + i2*nb02); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/add-id.cu:21:54: warning: cast from 'const float *' to 'char *' drops const qualifier [-Wcast-qual] 21 | const float * src1_row = (const float *)((char *)src1 + i11*nb11); | ^ 4 warnings generated when compiling for gfx1200. 1 warning generated when compiling for gfx1200. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/arange.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/arange.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1153. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/acc.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/acc.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/add-id.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/add-id.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/add-id.cu:14:44: warning: cast from 'const int *' to 'char *' drops const qualifier [-Wcast-qual] 14 | const int i11 = *(int32_t *) ((char *) src2 + i1*sizeof(int32_t) + i2*nb21); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/add-id.cu:20:54: warning: cast from 'const float *' to 'char *' drops const qualifier [-Wcast-qual] 20 | const float * src0_row = (const float *)((char *)src0 + i1*nb01 + i2*nb02); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/add-id.cu:21:54: warning: cast from 'const float *' to 'char *' drops const qualifier [-Wcast-qual] 21 | const float * src1_row = (const float *)((char *)src1 + i11*nb11); | ^ [ 16%] Building CXX object ggml/src/CMakeFiles/ggml-cpu.dir/ggml-cpu/ops.cpp.o cd /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build/ggml/src && /usr/bin/hipcc -DGGML_BACKEND_BUILD -DGGML_BACKEND_SHARED -DGGML_SCHED_MAX_COPIES=4 -DGGML_SHARED -DGGML_USE_CPU_REPACK -DGGML_USE_LLAMAFILE -D_GNU_SOURCE -D_XOPEN_SOURCE=600 -Dggml_cpu_EXPORTS -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/.. -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/. -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cpu -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -DNDEBUG -std=gnu++17 -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -MD -MT ggml/src/CMakeFiles/ggml-cpu.dir/ggml-cpu/ops.cpp.o -MF CMakeFiles/ggml-cpu.dir/ggml-cpu/ops.cpp.o.d -o CMakeFiles/ggml-cpu.dir/ggml-cpu/ops.cpp.o -c /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cpu/ops.cpp sh: line 1: /usr/bin/rocm_agent_enumerator: No such file or directory 4 warnings generated when compiling for gfx1201. 1 warning generated when compiling for gfx1201. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/arange.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/arange.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1200. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/acc.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/acc.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/add-id.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/add-id.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/add-id.cu:14:44: warning: cast from 'const int *' to 'char *' drops const qualifier [-Wcast-qual] 14 | const int i11 = *(int32_t *) ((char *) src2 + i1*sizeof(int32_t) + i2*nb21); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/add-id.cu:20:54: warning: cast from 'const float *' to 'char *' drops const qualifier [-Wcast-qual] 20 | const float * src0_row = (const float *)((char *)src0 + i1*nb01 + i2*nb02); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/add-id.cu:21:54: warning: cast from 'const float *' to 'char *' drops const qualifier [-Wcast-qual] 21 | const float * src1_row = (const float *)((char *)src1 + i11*nb11); | ^ 4 warnings generated when compiling for gfx900. 1 warning generated when compiling for gfx900. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/arange.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/arange.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1201. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/acc.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/acc.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/add-id.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/add-id.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/add-id.cu:14:44: warning: cast from 'const int *' to 'char *' drops const qualifier [-Wcast-qual] 14 | const int i11 = *(int32_t *) ((char *) src2 + i1*sizeof(int32_t) + i2*nb21); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/add-id.cu:20:54: warning: cast from 'const float *' to 'char *' drops const qualifier [-Wcast-qual] 20 | const float * src0_row = (const float *)((char *)src0 + i1*nb01 + i2*nb02); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/add-id.cu:21:54: warning: cast from 'const float *' to 'char *' drops const qualifier [-Wcast-qual] 21 | const float * src1_row = (const float *)((char *)src1 + i11*nb11); | ^ 1 warning generated when compiling for gfx906. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/arange.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/arange.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 4 warnings generated when compiling for gfx906. 1 warning generated when compiling for gfx900. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/acc.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/acc.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/add-id.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/add-id.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/add-id.cu:14:44: warning: cast from 'const int *' to 'char *' drops const qualifier [-Wcast-qual] 14 | const int i11 = *(int32_t *) ((char *) src2 + i1*sizeof(int32_t) + i2*nb21); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/add-id.cu:20:54: warning: cast from 'const float *' to 'char *' drops const qualifier [-Wcast-qual] 20 | const float * src0_row = (const float *)((char *)src0 + i1*nb01 + i2*nb02); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/add-id.cu:21:54: warning: cast from 'const float *' to 'char *' drops const qualifier [-Wcast-qual] 21 | const float * src1_row = (const float *)((char *)src1 + i11*nb11); | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/arange.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/arange.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx908. 4 warnings generated when compiling for gfx908. 1 warning generated when compiling for gfx906. [ 16%] Building CXX object ggml/src/CMakeFiles/ggml-cpu.dir/ggml-cpu/llamafile/sgemm.cpp.o cd /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build/ggml/src && /usr/bin/hipcc -DGGML_BACKEND_BUILD -DGGML_BACKEND_SHARED -DGGML_SCHED_MAX_COPIES=4 -DGGML_SHARED -DGGML_USE_CPU_REPACK -DGGML_USE_LLAMAFILE -D_GNU_SOURCE -D_XOPEN_SOURCE=600 -Dggml_cpu_EXPORTS -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/.. -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/. -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cpu -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -DNDEBUG -std=gnu++17 -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -MD -MT ggml/src/CMakeFiles/ggml-cpu.dir/ggml-cpu/llamafile/sgemm.cpp.o -MF CMakeFiles/ggml-cpu.dir/ggml-cpu/llamafile/sgemm.cpp.o.d -o CMakeFiles/ggml-cpu.dir/ggml-cpu/llamafile/sgemm.cpp.o -c /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cpu/llamafile/sgemm.cpp sh: line 1: /usr/bin/rocm_agent_enumerator: No such file or directory In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/acc.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/acc.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/add-id.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/add-id.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/add-id.cu:14:44: warning: cast from 'const int *' to 'char *' drops const qualifier [-Wcast-qual] 14 | const int i11 = *(int32_t *) ((char *) src2 + i1*sizeof(int32_t) + i2*nb21); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/add-id.cu:20:54: warning: cast from 'const float *' to 'char *' drops const qualifier [-Wcast-qual] 20 | const float * src0_row = (const float *)((char *)src0 + i1*nb01 + i2*nb02); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/add-id.cu:21:54: warning: cast from 'const float *' to 'char *' drops const qualifier [-Wcast-qual] 21 | const float * src1_row = (const float *)((char *)src1 + i11*nb11); | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/arange.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/arange.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx90a. 4 warnings generated when compiling for gfx90a. 1 warning generated when compiling for gfx908. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/acc.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/acc.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/add-id.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/add-id.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/add-id.cu:14:44: warning: cast from 'const int *' to 'char *' drops const qualifier [-Wcast-qual] 14 | const int i11 = *(int32_t *) ((char *) src2 + i1*sizeof(int32_t) + i2*nb21); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/add-id.cu:20:54: warning: cast from 'const float *' to 'char *' drops const qualifier [-Wcast-qual] 20 | const float * src0_In file included from row = (const float *)((char *)src0 + i1*nb01 + i2*nb02); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/add-id.cu/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/arange.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/arange.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int :cc) { 21:54: warning: cast from 'const float *' to 'char *' drops const qualifier [-Wcast-qual] 21 | const float * src1_row = (const float *)((char *)src1 + i11*nb11); | ^ | ^ 4 warnings generated when compiling for gfx90a. 1 warning generated when compiling for gfx90a. 1 warning generated when compiling for gfx90a. [ 16%] Building C object ggml/src/CMakeFiles/ggml-cpu.dir/ggml-cpu/arch/x86/quants.c.o cd /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build/ggml/src && /usr/bin/hipcc -DGGML_BACKEND_BUILD -DGGML_BACKEND_SHARED -DGGML_SCHED_MAX_COPIES=4 -DGGML_SHARED -DGGML_USE_CPU_REPACK -DGGML_USE_LLAMAFILE -D_GNU_SOURCE -D_XOPEN_SOURCE=600 -Dggml_cpu_EXPORTS -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/.. -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/. -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cpu -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -fcf-protection -mtls-dialect=gnu2 -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -DNDEBUG -std=gnu11 -fPIC -Wshadow -Wstrict-prototypes -Wpointer-arith -Wmissing-prototypes -Werror=implicit-int -Werror=implicit-function-declaration -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wdouble-promotion -MD -MT ggml/src/CMakeFiles/ggml-cpu.dir/ggml-cpu/arch/x86/quants.c.o -MF CMakeFiles/ggml-cpu.dir/ggml-cpu/arch/x86/quants.c.o.d -o CMakeFiles/ggml-cpu.dir/ggml-cpu/arch/x86/quants.c.o -c /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cpu/arch/x86/quants.c sh: line 1: /usr/bin/rocm_agent_enumerator: No such file or directory [ 17%] Building CXX object ggml/src/CMakeFiles/ggml-cpu.dir/ggml-cpu/arch/x86/repack.cpp.o cd /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build/ggml/src && /usr/bin/hipcc -DGGML_BACKEND_BUILD -DGGML_BACKEND_SHARED -DGGML_SCHED_MAX_COPIES=4 -DGGML_SHARED -DGGML_USE_CPU_REPACK -DGGML_USE_LLAMAFILE -D_GNU_SOURCE -D_XOPEN_SOURCE=600 -Dggml_cpu_EXPORTS -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/.. -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/. -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cpu -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -DNDEBUG -std=gnu++17 -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -MD -MT ggml/src/CMakeFiles/ggml-cpu.dir/ggml-cpu/arch/x86/repack.cpp.o -MF CMakeFiles/ggml-cpu.dir/ggml-cpu/arch/x86/repack.cpp.o.d -o CMakeFiles/ggml-cpu.dir/ggml-cpu/arch/x86/repack.cpp.o -c /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cpu/arch/x86/repack.cpp sh: line 1: /usr/bin/rocm_agent_enumerator: No such file or directory In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/add-id.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/add-id.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/acc.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/acc.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/arange.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/arange.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/add-id.cu:14:44: warning: cast from 'const int *' to 'char *' drops const qualifier [-Wcast-qual] 14 | const int i11 = *(int32_t *) ((char *) src2 + i1*sizeof(int32_t) + i2*nb21); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/add-id.cu:20:54: warning: cast from 'const float *' to 'char *' drops const qualifier [-Wcast-qual] 20 | const float * src0_row = (const float *)((char *)src0 + i1*nb01 + i2*nb02); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/add-id.cu:21:54: warning: cast from 'const float *' to 'char *' drops const qualifier [-Wcast-qual] 21 | const float * src1_row = (const float *)((char *)src1 + i11*nb11); | ^ 1 warning generated when compiling for gfx90a. 4 warnings generated when compiling for gfx942. 1 warning generated when compiling for gfx942. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/add-id.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/add-id.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/arange.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/arange.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/acc.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/acc.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/add-id.cu:14:44: warning: cast from 'const int *' to 'char *' drops const qualifier [-Wcast-qual] 14 | const int i11 = *(int32_t *) ((char *) src2 + i1*sizeof(int32_t) + i2*nb21); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/add-id.cu:20:54: warning: cast from 'const float *' to 'char *' drops const qualifier [-Wcast-qual] 20 | const float * src0_row = (const float *)((char *)src0 + i1*nb01 + i2*nb02); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/add-id.cu:21:54: warning: cast from 'const float *' to 'char *' drops const qualifier [-Wcast-qual] 21 | const float * src1_row = (const float *)((char *)src1 + i11*nb11); | ^ 1 warning generated when compiling for gfx942. 4 warnings generated when compiling for gfx950. 1 warning generated when compiling for gfx950. [ 17%] Linking CXX shared library ../../bin/libggml-cpu.so cd /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build/ggml/src && /usr/bin/cmake -E cmake_link_script CMakeFiles/ggml-cpu.dir/link.txt --verbose=1 sh: line 1: /usr/bin/rocm_agent_enumerator: No such file or directory clang++: warning: argument unused during compilation: '-Xarch_host -fstack-protector-strong' [-Wunused-command-line-argument] clang++: warning: argument unused during compilation: '-Xarch_host -fcf-protection' [-Wunused-command-line-argument] clang++: warning: argument unused during compilation: '-specs=/usr/lib/rpm/redhat/redhat-package-notes' [-Wunused-command-line-argument] In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/arange.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/arange.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/add-id.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/add-id.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/add-id.cu:14:44: warning: cast from 'const int *' to 'char *' drops const qualifier [-Wcast-qual] 14 | const int i11 = *(int32_t *) ((char *) src2 + i1*sizeof(int32_t) + i2*nb21); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/add-id.cu:20:54: warning: cast from 'const float *' to 'char *' drops const qualifier [-Wcast-qual] 20 | const float * src0_row = (const float *)((char *)src0 + i1*nb01 + i2*nb02); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/add-id.cu:21:54: warning: cast from 'const float *' to 'char *' drops const qualifier [-Wcast-qual] 21 | const float * src1_row = (const float *)((char *)src1 + i11*nb11); | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/acc.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/acc.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx950. 4 warnings generated when compiling for host. [ 17%] Building CXX object ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/argmax.cu.o cd /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build/ggml/src/ggml-hip && /usr/bin/hipcc -DGGML_BACKEND_BUILD -DGGML_BACKEND_SHARED -DGGML_HIP_NO_VMM -DGGML_HIP_ROCWMMA_FATTN_GFX12 -DGGML_SCHED_MAX_COPIES=4 -DGGML_SHARED -DGGML_USE_HIP -DUSE_PROF_API=1 -D_GNU_SOURCE -D_XOPEN_SOURCE=600 -D__HIP_PLATFORM_AMD__=1 -Dggml_hip_EXPORTS -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/.. -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -DNDEBUG -std=gnu++17 -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -x hip --offload-arch=gfx900 --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx942 --offload-arch=gfx950 --offload-arch=gfx1010 --offload-arch=gfx1012 --offload-arch=gfx1030 --offload-arch=gfx1031 --offload-arch=gfx1035 --offload-arch=gfx1036 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1103 --offload-arch=gfx1150 --offload-arch=gfx1151 --offload-arch=gfx1152 --offload-arch=gfx1153 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/argmax.cu.o -MF CMakeFiles/ggml-hip.dir/__/ggml-cuda/argmax.cu.o.d -o CMakeFiles/ggml-hip.dir/__/ggml-cuda/argmax.cu.o -c /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/argmax.cu 1 warning generated when compiling for host. [ 18%] Building CXX object ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/argsort.cu.o cd /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build/ggml/src/ggml-hip && /usr/bin/hipcc -DGGML_BACKEND_BUILD -DGGML_BACKEND_SHARED -DGGML_HIP_NO_VMM -DGGML_HIP_ROCWMMA_FATTN_GFX12 -DGGML_SCHED_MAX_COPIES=4 -DGGML_SHARED -DGGML_USE_HIP -DUSE_PROF_API=1 -D_GNU_SOURCE -D_XOPEN_SOURCE=600 -D__HIP_PLATFORM_AMD__=1 -Dggml_hip_EXPORTS -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/.. -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -DNDEBUG -std=gnu++17 -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -x hip --offload-arch=gfx900 --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx942 --offload-arch=gfx950 --offload-arch=gfx1010 --offload-arch=gfx1012 --offload-arch=gfx1030 --offload-arch=gfx1031 --offload-arch=gfx1035 --offload-arch=gfx1036 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1103 --offload-arch=gfx1150 --offload-arch=gfx1151 --offload-arch=gfx1152 --offload-arch=gfx1153 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/argsort.cu.o -MF CMakeFiles/ggml-hip.dir/__/ggml-cuda/argsort.cu.o.d -o CMakeFiles/ggml-hip.dir/__/ggml-cuda/argsort.cu.o -c /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/argsort.cu In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/arange.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/arange.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/argmax.cu:4: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/argmax.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for host. [ 18%] Building CXX object ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/binbcast.cu.o cd /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build/ggml/src/ggml-hip && /usr/bin/hipcc -DGGML_BACKEND_BUILD -DGGML_BACKEND_SHARED -DGGML_HIP_NO_VMM -DGGML_HIP_ROCWMMA_FATTN_GFX12 -DGGML_SCHED_MAX_COPIES=4 -DGGML_SHARED -DGGML_USE_HIP -DUSE_PROF_API=1 -D_GNU_SOURCE -D_XOPEN_SOURCE=600 -D__HIP_PLATFORM_AMD__=1 -Dggml_hip_EXPORTS -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/.. -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -DNDEBUG -std=gnu++17 -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -x hip --offload-arch=gfx900 --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx942 --offload-arch=gfx950 --offload-arch=gfx1010 --offload-arch=gfx1012 --offload-arch=gfx1030 --offload-arch=gfx1031 --offload-arch=gfx1035 --offload-arch=gfx1036 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1103 --offload-arch=gfx1150 --offload-arch=gfx1151 --offload-arch=gfx1152 --offload-arch=gfx1153 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/binbcast.cu.o -MF CMakeFiles/ggml-hip.dir/__/ggml-cuda/binbcast.cu.o.d -o CMakeFiles/ggml-hip.dir/__/ggml-cuda/binbcast.cu.o -c /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/binbcast.cu 1 warning generated when compiling for gfx1010. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/argsort.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/argsort.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1010. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/binbcast.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/binbcast.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/binbcast.cu:361:11: warning: 'break' will never be executed [-Wunreachable-code-break] 361 | } break; | ^~~~~ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/argmax.cu:4: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/argmax.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1012. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/argsort.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/argsort.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1012. 2 warnings generated when compiling for gfx1010. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/argmax.cu:4: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/argmax.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /usr/bin/hipcc -fPIC -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -DNDEBUG -Xlinker --dependency-file=CMakeFiles/ggml-cpu.dir/link.d -Wl,-z,relro -Wl,--as-needed -Wl,-z,pack-relative-relocs -Wl,-z,now -Wl,-z,now -Wl,--build-id=sha1 -specs=/usr/lib/rpm/redhat/redhat-package-notes -shared -Wl,-soname,libggml-cpu.so.b6153 -o ../../bin/libggml-cpu.so.b6153 "CMakeFiles/ggml-cpu.dir/ggml-cpu/ggml-cpu.c.o" "CMakeFiles/ggml-cpu.dir/ggml-cpu/ggml-cpu.cpp.o" "CMakeFiles/ggml-cpu.dir/ggml-cpu/repack.cpp.o" "CMakeFiles/ggml-cpu.dir/ggml-cpu/hbm.cpp.o" "CMakeFiles/ggml-cpu.dir/ggml-cpu/quants.c.o" "CMakeFiles/ggml-cpu.dir/ggml-cpu/traits.cpp.o" "CMakeFiles/ggml-cpu.dir/ggml-cpu/amx/amx.cpp.o" "CMakeFiles/ggml-cpu.dir/ggml-cpu/amx/mmq.cpp.o" "CMakeFiles/ggml-cpu.dir/ggml-cpu/binary-ops.cpp.o" "CMakeFiles/ggml-cpu.dir/ggml-cpu/unary-ops.cpp.o" "CMakeFiles/ggml-cpu.dir/ggml-cpu/vec.cpp.o" "CMakeFiles/ggml-cpu.dir/ggml-cpu/ops.cpp.o" "CMakeFiles/ggml-cpu.dir/ggml-cpu/llamafile/sgemm.cpp.o" "CMakeFiles/ggml-cpu.dir/ggml-cpu/arch/x86/quants.c.o" "CMakeFiles/ggml-cpu.dir/ggml-cpu/arch/x86/repack.cpp.o" ../../bin/libggml-base.so.b6153 cd /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build/ggml/src && /usr/bin/cmake -E cmake_symlink_library ../../bin/libggml-cpu.so.b6153 ../../bin/libggml-cpu.so.b6153 ../../bin/libggml-cpu.so gmake[2]: Leaving directory '/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build' [ 18%] Built target ggml-cpu [ 19%] Building CXX object ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/clamp.cu.o cd /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build/ggml/src/ggml-hip && /usr/bin/hipcc -DGGML_BACKEND_BUILD -DGGML_BACKEND_SHARED -DGGML_HIP_NO_VMM -DGGML_HIP_ROCWMMA_FATTN_GFX12 -DGGML_SCHED_MAX_COPIES=4 -DGGML_SHARED -DGGML_USE_HIP -DUSE_PROF_API=1 -D_GNU_SOURCE -D_XOPEN_SOURCE=600 -D__HIP_PLATFORM_AMD__=1 -Dggml_hip_EXPORTS -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/.. -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -DNDEBUG -std=gnu++17 -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -x hip --offload-arch=gfx900 --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx942 --offload-arch=gfx950 --offload-arch=gfx1010 --offload-arch=gfx1012 --offload-arch=gfx1030 --offload-arch=gfx1031 --offload-arch=gfx1035 --offload-arch=gfx1036 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1103 --offload-arch=gfx1150 --offload-arch=gfx1151 --offload-arch=gfx1152 --offload-arch=gfx1153 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/clamp.cu.o -MF CMakeFiles/ggml-hip.dir/__/ggml-cuda/clamp.cu.o.d -o CMakeFiles/ggml-hip.dir/__/ggml-cuda/clamp.cu.o -c /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/clamp.cu 1 warning generated when compiling for gfx1030. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/argsort.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/argsort.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1030. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/binbcast.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/binbcast.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/binbcast.cu:361:11: warning: 'break' will never be executed [-Wunreachable-code-break] 361 | } break; | ^~~~~ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/argmax.cu:4: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/argmax.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/clamp.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/clamp.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1010. 1 warning generated when compiling for gfx1031. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/argsort.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/argsort.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1031. 2 warnings generated when compiling for gfx1012. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/clamp.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/clamp.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/argmax.cu:4: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/argmax.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1012. 1 warning generated when compiling for gfx1035. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/binbcast.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/binbcast.cuhIn file included from :1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/argsort.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/argsort.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/binbcast.cu:361:11: warning: 'break' will never be executed [-Wunreachable-code-break] 361 | } break; | ^~~~~ 1 warning generated when compiling for gfx1035. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/clamp.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/clamp.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/argmax.cu:4: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/argmax.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1030. 1 warning generated when compiling for gfx1036. 2 warnings generated when compiling for gfx1030. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/argsort.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/argsort.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1036. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/clamp.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/clamp.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/argmax.cu:4: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/argmax.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1031. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/binbcast.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/binbcast.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1100. /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/binbcast.cu:361:11: warning: 'break' will never be executed [-Wunreachable-code-break] 361 | } break; | ^~~~~ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/argsort.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/argsort.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1100. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/clamp.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/clamp.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 2 warnings generated when compiling for gfx1031. 1 warning generated when compiling for gfx1035. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/argmax.cu:4: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/argmax.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1101. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/argsort.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/argsort.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1101. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/binbcast.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/binbcast.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/binbcast.cu:361:11: warning: 'break' will never be executed [-Wunreachable-code-break] 361 | } break; | ^~~~~ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/clamp.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/clamp.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1036. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/argmax.cu:4: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/argmax.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1102. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/argsort.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/argsort.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 2 warnings generated when compiling for gfx1035. 1 warning generated when compiling for gfx1102. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/clamp.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/clamp.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1100. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/argmax.cu:4: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/argmax.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1103. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/binbcast.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/binbcast.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/argsort.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/argsort.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/binbcast.cu:361:11: warning: 'break' will never be executed [-Wunreachable-code-break] 361 | } break; | ^~~~~ 1 warning generated when compiling for gfx1103. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/clamp.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/clamp.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1101. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/argmax.cu:4: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/argmax.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1150. 2 warnings generated when compiling for gfx1036. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/argsort.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/argsort.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1150. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/clamp.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/clamp.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1102. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/argmax.cu:4: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/argmax.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1151. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/binbcast.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/binbcast.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/binbcast.cu:361:11: warning: 'break' will never be executed [-Wunreachable-code-break] 361 | } break; | ^~~~~ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/argsort.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/argsort.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1151. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/clamp.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/clamp.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1103. 2 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/argmax.cu:4: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/argmax.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1152. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/argsort.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/argsort.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1152. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/clamp.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/clamp.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1150. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/binbcast.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/binbcast.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/binbcast.cu:361:11: warning: 'break' will never be executed [-Wunreachable-code-break] 361 | } break; | ^~~~~ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/argmax.cu:4: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/argmax.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1153. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/argsort.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/argsort.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1153. 2 warnings generated when compiling for gfx1101. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/clamp.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/clamp.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1151. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/argmax.cu:4: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/argmax.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1200. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/argsort.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/argsort.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/binbcast.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/binbcast.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/binbcast.cu:361:11: warning: 'break' will never be executed [-Wunreachable-code-break] 361 | } break; | ^~~~~ 1 warning generated when compiling for gfx1200. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/clamp.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/clamp.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1152. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/argmax.cu:4: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/argmax.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1201. 2 warnings generated when compiling for gfx1102. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/argsort.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/argsort.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1201. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/clamp.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/clamp.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1153. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/argmax.cu:4: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/argmax.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx900. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/binbcast.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/binbcast.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/binbcast.cu:361:11: warning: 'break' will never be executed [-Wunreachable-code-break] 361 | } break; | ^~~~~ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/argsort.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/argsort.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx900. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/clamp.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/clamp.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1200. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/argmax.cu:4: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/argmax.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx906. 2 warnings generated when compiling for gfx1103. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/argsort.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/argsort.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx906. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/clamp.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/clamp.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1201. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/argmax.cu:4: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/argmax.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/binbcast.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/binbcast.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/binbcast.cu:361:11: warning: 'break' will never be executed [-Wunreachable-code-break] 361 | } break; | ^~~~~ 1 warning generated when compiling for gfx908. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/argsort.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/argsort.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx908. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/clamp.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/clamp.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 2 warnings generated when compiling for gfx1150. 1 warning generated when compiling for gfx900. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/argmax.cu:4: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/argmax.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/argsort.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/argsort.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/binbcast.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/binbcast.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/clamp.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/clamp.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/binbcast.cu:361:11: warning: 'break' will never be executed [-Wunreachable-code-break] 361 | } break; | ^~~~~ 1 warning generated when compiling for gfx906. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/argmax.cu:4: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/argmax.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/argsort.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/argsort.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 2 warnings generated when compiling for gfx1151. 1 warning generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/clamp.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/clamp.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx908. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/argmax.cu:4: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/argmax.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx942. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/binbcast.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/binbcast.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/binbcast.cu:361:11: warning: 'break' will never be executed [-Wunreachable-code-break] 361 | } break; | ^~~~~ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/argsort.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/argsort.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx942. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/clamp.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/clamp.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/argmax.cu:4: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/argmax.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx950. 2 warnings generated when compiling for gfx1152. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/argsort.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/argsort.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx950. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/clamp.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/clamp.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/argmax.cu:4: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/argmax.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/binbcast.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/binbcast.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/binbcast.cu:361:11: warning: 'break' will never be executed [-Wunreachable-code-break] 361 | 1 } break; | ^~~~~ warning generated when compiling for host. [ 19%] Building CXX object ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/concat.cu.o cd /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build/ggml/src/ggml-hip && /usr/bin/hipcc -DGGML_BACKEND_BUILD -DGGML_BACKEND_SHARED -DGGML_HIP_NO_VMM -DGGML_HIP_ROCWMMA_FATTN_GFX12 -DGGML_SCHED_MAX_COPIES=4 -DGGML_SHARED -DGGML_USE_HIP -DUSE_PROF_API=1 -D_GNU_SOURCE -D_XOPEN_SOURCE=600 -D__HIP_PLATFORM_AMD__=1 -Dggml_hip_EXPORTS -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/.. -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -DNDEBUG -std=gnu++17 -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -x hip --offload-arch=gfx900 --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx942 --offload-arch=gfx950 --offload-arch=gfx1010 --offload-arch=gfx1012 --offload-arch=gfx1030 --offload-arch=gfx1031 --offload-arch=gfx1035 --offload-arch=gfx1036 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1103 --offload-arch=gfx1150 --offload-arch=gfx1151 --offload-arch=gfx1152 --offload-arch=gfx1153 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/concat.cu.o -MF CMakeFiles/ggml-hip.dir/__/ggml-cuda/concat.cu.o.d -o CMakeFiles/ggml-hip.dir/__/ggml-cuda/concat.cu.o -c /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/concat.cu In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/argsort.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/argsort.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/clamp.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/clamp.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for host. [ 20%] Building CXX object ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/conv-transpose-1d.cu.o cd /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build/ggml/src/ggml-hip && /usr/bin/hipcc -DGGML_BACKEND_BUILD -DGGML_BACKEND_SHARED -DGGML_HIP_NO_VMM -DGGML_HIP_ROCWMMA_FATTN_GFX12 -DGGML_SCHED_MAX_COPIES=4 -DGGML_SHARED -DGGML_USE_HIP -DUSE_PROF_API=1 -D_GNU_SOURCE -D_XOPEN_SOURCE=600 -D__HIP_PLATFORM_AMD__=1 -Dggml_hip_EXPORTS -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/.. -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -DNDEBUG -std=gnu++17 -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -x hip --offload-arch=gfx900 --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx942 --offload-arch=gfx950 --offload-arch=gfx1010 --offload-arch=gfx1012 --offload-arch=gfx1030 --offload-arch=gfx1031 --offload-arch=gfx1035 --offload-arch=gfx1036 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1103 --offload-arch=gfx1150 --offload-arch=gfx1151 --offload-arch=gfx1152 --offload-arch=gfx1153 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/conv-transpose-1d.cu.o -MF CMakeFiles/ggml-hip.dir/__/ggml-cuda/conv-transpose-1d.cu.o.d -o CMakeFiles/ggml-hip.dir/__/ggml-cuda/conv-transpose-1d.cu.o -c /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/conv-transpose-1d.cu 1 warning generated when compiling for gfx942. 2 warnings generated when compiling for gfx1153. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/concat.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/concat.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/concat.cu:218:17: warning: 'break' will never be executed [-Wunreachable-code-break] 218 | break; | ^~~~~ 2 warnings generated when compiling for gfx1010. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/conv-transpose-1d.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/conv-transpose-1d.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/clamp.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/clamp.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1010. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/binbcast.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/binbcast.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/binbcast.cu:361:11: warning: 'break' will never be executed [-Wunreachable-code-break] 361 | } break; | ^~~~~ 1 warning generated when compiling for gfx950. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/concat.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/concat.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/concat.cu:218:17: warning: 'break' will never be executed [-Wunreachable-code-break] 218 | break; | ^~~~~ 2 warnings generated when compiling for gfx1012. 2 warnings generated when compiling for gfx1200. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/conv-transpose-1d.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/conv-transpose-1d.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/clamp.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/clamp.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1012. 1 warning generated when compiling for host. [ 20%] Building CXX object ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/conv2d-dw.cu.o cd /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build/ggml/src/ggml-hip && /usr/bin/hipcc -DGGML_BACKEND_BUILD -DGGML_BACKEND_SHARED -DGGML_HIP_NO_VMM -DGGML_HIP_ROCWMMA_FATTN_GFX12 -DGGML_SCHED_MAX_COPIES=4 -DGGML_SHARED -DGGML_USE_HIP -DUSE_PROF_API=1 -D_GNU_SOURCE -D_XOPEN_SOURCE=600 -D__HIP_PLATFORM_AMD__=1 -Dggml_hip_EXPORTS -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/.. -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -DNDEBUG -std=gnu++17 -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -x hip --offload-arch=gfx900 --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx942 --offload-arch=gfx950 --offload-arch=gfx1010 --offload-arch=gfx1012 --offload-arch=gfx1030 --offload-arch=gfx1031 --offload-arch=gfx1035 --offload-arch=gfx1036 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1103 --offload-arch=gfx1150 --offload-arch=gfx1151 --offload-arch=gfx1152 --offload-arch=gfx1153 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/conv2d-dw.cu.o -MF CMakeFiles/ggml-hip.dir/__/ggml-cuda/conv2d-dw.cu.o.d -o CMakeFiles/ggml-hip.dir/__/ggml-cuda/conv2d-dw.cu.o -c /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-dw.cu In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/concat.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/concat.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/concat.cu:218:17: warning: 'break' will never be executed [-Wunreachable-code-break] 218 | break; | ^~~~~ 2 warnings generated when compiling for gfx1030. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/binbcast.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/binbcast.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/binbcast.cu:361:11: warning: 'break' will never be executed [-Wunreachable-code-break] 361 | } break; | ^~~~~ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/conv-transpose-1d.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/conv-transpose-1d.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-dw.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-dw.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1030. 1 warning generated when compiling for gfx1010. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/concat.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/concat.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/concat.cu:218:17: warning: 'break' will never be executed [-Wunreachable-code-break] 218 | break; | ^~~~~ 2 warnings generated when compiling for gfx1201. 2 warnings generated when compiling for gfx1031. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/conv-transpose-1d.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/conv-transpose-1d.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-dw.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-dw.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1031. 1 warning generated when compiling for gfx1012. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/binbcast.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/binbcast.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/binbcast.cu:361:11: warning: 'break' will never be executed [-Wunreachable-code-break] 361 | } break; | ^~~~~ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/concat.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/concat.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/concat.cu:218:17: warning: 'break' will never be executed [-Wunreachable-code-break] 218 | break; | ^~~~~ 2 warnings generated when compiling for gfx1035. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/conv-transpose-1d.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/conv-transpose-1d.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1035. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-dw.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-dw.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1030. 2 warnings generated when compiling for gfx900. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/concat.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/concat.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/concat.cu:218:17: warning: 'break' will never be executed [-Wunreachable-code-break] 218 | break; | ^~~~~ 2 warnings generated when compiling for gfx1036. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/conv-transpose-1d.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/conv-transpose-1d.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1036. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-dw.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-dw.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/binbcast.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/binbcast.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/binbcast.cu:361:11: warning: 'break' will never be executed [-Wunreachable-code-break] 361 | } break; | ^~~~~ 1 warning generated when compiling for gfx1031. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/concat.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/concat.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/concat.cu:218:17: warning: 'break' will never be executed [-Wunreachable-code-break] 218 | break; | ^~~~~ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/conv-transpose-1d.cu:1: 2 warningIn file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/conv-transpose-1d.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ s generated when compiling for gfx1100. 2 warnings generated when compiling for gfx906. 1 warning generated when compiling for gfx1100. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-dw.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-dw.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1035. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/concat.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/concat.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/concat.cu:218:17: warning: 'break' will never be executed [-Wunreachable-code-break] 218 | break; | ^~~~~ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/binbcast.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/binbcast.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/binbcast.cu:361:11: warning: 'break' will never be executed [-Wunreachable-code-break] 361 | } break; | ^~~~~ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/conv-transpose-1d.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/conv-transpose-1d.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 2 warnings generated when compiling for gfx1101. 1 warning generated when compiling for gfx1101. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-dw.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-dw.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1036. 2 warnings generated when compiling for gfx908. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/concat.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/concat.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/concat.cu:218:17: warning: 'break' will never be executed [-Wunreachable-code-break] 218 | break; | ^~~~~ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/conv-transpose-1d.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/conv-transpose-1d.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1102. 2 warnings generated when compiling for gfx1102. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-dw.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-dw.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1100. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/binbcast.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/binbcast.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/binbcast.cu:361:11: warning: 'break' will never be executed [-Wunreachable-code-break] 361 | } break; | ^~~~~ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/conv-transpose-1d.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/conv-transpose-1d.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/concat.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/concat.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/concat.cu:218:17: warning: 'break' will never be executed [-Wunreachable-code-break] 218 | break; | ^~~~~ 1 warning generated when compiling for gfx1103. 2 warnings generated when compiling for gfx1103. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-dw.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-dw.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 2 warnings generated when compiling for gfx90a. 1 warning generated when compiling for gfx1101. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/conv-transpose-1d.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/conv-transpose-1d.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/concat.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/concat.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1150. /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/concat.cu:218:17: warning: 'break' will never be executed [-Wunreachable-code-break] 218 | break; | ^~~~~ 2 warnings generated when compiling for gfx1150. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/binbcast.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/binbcast.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/binbcast.cu:361:11: warning: 'break' will never be executed [-Wunreachable-code-break] 361 | } break; | ^~~~~ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-dw.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-dw.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1102. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/conv-transpose-1d.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/conv-transpose-1d.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1151. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/concat.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/concat.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/concat.cu:218:17: warning: 'break' will never be executed [-Wunreachable-code-break] 218 | break; | ^~~~~ 2 warnings generated when compiling for gfx90a. 2 warnings generated when compiling for gfx1151. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-dw.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-dw.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1103. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/conv-transpose-1d.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/conv-transpose-1d.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/binbcast.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/binbcast.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1152. /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/binbcast.cu:361:11: warning: 'break' will never be executed [-Wunreachable-code-break] 361 | } break; | ^~~~~ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/concat.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/concat.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/concat.cu:218:17: warning: 'break' will never be executed [-Wunreachable-code-break] 218 | break; | ^~~~~ 2 warnings generated when compiling for gfx1152. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-dw.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-dw.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1150. 2 warnings generated when compiling for gfx942. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/conv-transpose-1d.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/conv-transpose-1d.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1153. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/concat.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/concat.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/concat.cu:218:17: warning: 'break' will never be executed [-Wunreachable-code-break] 218 | break; | ^~~~~ 2 warnings generated when compiling for gfx1153. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-dw.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-dw.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/binbcast.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/binbcast.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/binbcast.cu:361:11: warning: 'break' will never be executed [-Wunreachable-code-break] 361 | } break; | ^~~~~ 1 warning generated when compiling for gfx1151. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/conv-transpose-1d.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/conv-transpose-1d.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1200. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/concat.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/concat.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/concat.cu:218:17: warning: 'break' will never be executed [-Wunreachable-code-break] 218 | break; | ^~~~~ 2 warnings generated when compiling for gfx1200. 2 warnings generated when compiling for gfx950. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-dw.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-dw.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/conv-transpose-1d.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/conv-transpose-1d.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 1 warning generated when compiling for gfx1152. 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1201. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/concat.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/concat.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/concat.cu:218:17: warning: 'break' will never be executed [-Wunreachable-code-break] 218 | break; | ^~~~~ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/binbcast.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/binbcast.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/binbcast.cu:361:11: warning: 'break' will never be executed [-Wunreachable-code-break] 361 | } break; | ^~~~~ 2 warnings generated when compiling for gfx1201. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-dw.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-dw.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/conv-transpose-1d.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/conv-transpose-1d.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1153. 1 warning generated when compiling for gfx900. 2 warnings generated when compiling for host. [ 21%] Building CXX object ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/conv2d-transpose.cu.o cd /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build/ggml/src/ggml-hip && /usr/bin/hipcc -DGGML_BACKEND_BUILD -DGGML_BACKEND_SHARED -DGGML_HIP_NO_VMM -DGGML_HIP_ROCWMMA_FATTN_GFX12 -DGGML_SCHED_MAX_COPIES=4 -DGGML_SHARED -DGGML_USE_HIP -DUSE_PROF_API=1 -D_GNU_SOURCE -D_XOPEN_SOURCE=600 -D__HIP_PLATFORM_AMD__=1 -Dggml_hip_EXPORTS -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/.. -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -DNDEBUG -std=gnu++17 -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -x hip --offload-arch=gfx900 --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx942 --offload-arch=gfx950 --offload-arch=gfx1010 --offload-arch=gfx1012 --offload-arch=gfx1030 --offload-arch=gfx1031 --offload-arch=gfx1035 --offload-arch=gfx1036 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1103 --offload-arch=gfx1150 --offload-arch=gfx1151 --offload-arch=gfx1152 --offload-arch=gfx1153 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/conv2d-transpose.cu.o -MF CMakeFiles/ggml-hip.dir/__/ggml-cuda/conv2d-transpose.cu.o.d -o CMakeFiles/ggml-hip.dir/__/ggml-cuda/conv2d-transpose.cu.o -c /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-transpose.cu In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/concat.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/concat.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/concat.cu:218:17: warning: 'break' will never be executed [-Wunreachable-code-break] 218 | break; | ^~~~~ 2 warnings generated when compiling for gfx900. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-dw.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-dw.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/conv-transpose-1d.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/conv-transpose-1d.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1200. 1 warning generated when compiling for gfx906. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-transpose.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-transpose.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-transpose.cu:6:17: warning: no previous prototype for function 'conv2d_transpose_kernel' [-Wmissing-prototypes] 6 | __global__ void conv2d_transpose_kernel(const float * __restrict__ input, const half * __restrict__ kernel, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-transpose.cu:6:12: note: declare 'static' if the function is not intended to be used outside of this translation unit 6 | __global__ void conv2d_transpose_kernel(const float * __restrict__ input, const half * __restrict__ kernel, | ^ | static 2 warnings generated when compiling for gfx1010. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/concat.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/concat.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/concat.cu:218:17: warning: 'break' will never be executed [-Wunreachable-code-break] 218 | break; | ^~~~~ 2 warnings generated when compiling for gfx906. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-dw.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-dw.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/conv-transpose-1d.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/conv-transpose-1d.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx908. 1 warning generated when compiling for gfx1201. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-transpose.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-transpose.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-transpose.cu:6:17: warning: no previous prototype for function 'conv2d_transpose_kernel' [-Wmissing-prototypes] 6 | __global__ void conv2d_transpose_kernel(const float * __restrict__ input, const half * __restrict__ kernel, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-transpose.cu:6:12: note: declare 'static' if the function is not intended to be used outside of this translation unit 6 | __global__ void conv2d_transpose_kernel(const float * __restrict__ input, const half * __restrict__ kernel, | ^ | static 2 warnings generated when compiling for gfx1012. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/concat.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/concat.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/concat.cu:218:17: warning: 'break' will never be executed [-Wunreachable-code-break] 218 | break; | ^~~~~ 2 warnings generated when compiling for gfx908. In file included from In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-dw.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-dw.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/conv-transpose-1d.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/conv-transpose-1d.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx90a. 1 warning generated when compiling for gfx900. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-transpose.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-transpose.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-transpose.cu:6:17: warning: no previous prototype for function 'conv2d_transpose_kernel' [-Wmissing-prototypes] 6 | __global__ void conv2d_transpose_kernel(const float * __restrict__ input, const half * __restrict__ kernel, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-transpose.cu:6:12: note: declare 'static' if the function is not intended to be used outside of this translation unit 6 | __global__ void conv2d_transpose_kernel(const float * __restrict__ input, const half * __restrict__ kernel, | ^ | static 2 warnings generated when compiling for gfx1030. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/concat.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/concat.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/concat.cu:218:17: warning: 'break' will never be executed [-Wunreachable-code-break] 218 | break; | ^~~~~ 2 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/conv-transpose-1d.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/conv-transpose-1d.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-dw.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-dw.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx90a. 1 warning generated when compiling for gfx906. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-transpose.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-transpose.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-transpose.cu:6:17: warning: no previous prototype for function 'conv2d_transpose_kernel' [-Wmissing-prototypes] 6 | __global__ void conv2d_transpose_kernel(const float * __restrict__ input, const half * __restrict__ kernel, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-transpose.cu:6:12: note: declare 'static' if the function is not intended to be used outside of this translation unit 6 | __global__ void conv2d_transpose_kernel(const float * __restrict__ input, const half * __restrict__ kernel, | ^ | static 2 warnings generated when compiling for gfx1031. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/concat.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/concat.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/concat.cu:218:17: warning: 'break' will never be executed [-Wunreachable-code-break] 218 | break; | ^~~~~ 2 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/conv-transpose-1d.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/conv-transpose-1d.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-dw.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-dw.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx942. 1 warning generated when compiling for gfx908. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-transpose.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-transpose.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-transpose.cu:6:17: warning: no previous prototype for function 'conv2d_transpose_kernel' [-Wmissing-prototypes] 6 | __global__ void conv2d_transpose_kernel(const float * __restrict__ input, const half * __restrict__ kernel, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-transpose.cu:6:12: note: declare 'static' if the function is not intended to be used outside of this translation unit 6 | __global__ void conv2d_transpose_kernel(const float * __restrict__ input, const half * __restrict__ kernel, | ^ | static 2 warnings generated when compiling for gfx1035. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/concat.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/concat.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/concat.cu:218:17: warning: 'break' will never be executed [-Wunreachable-code-break] 218 | break; | ^~~~~ 2 warnings generated when compiling for gfx942. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/conv-transpose-1d.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/conv-transpose-1d.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx950. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-dw.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-dw.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-transpose.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-transpose.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-transpose.cu:6:17: warning: no previous prototype for function 'conv2d_transpose_kernel' [-Wmissing-prototypes] 6 | __global__ void conv2d_transpose_kernel(const float * __restrict__ input, const half * __restrict__ kernel, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-transpose.cu:6:12: note: declare 'static' if the function is not intended to be used outside of this translation unit 6 | __global__ void conv2d_transpose_kernel(const float * __restrict__ input, const half * __restrict__ kernel, | ^ | static 2 warnings generated when compiling for gfx1036. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/concat.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/concat.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/concat.cu:218:17: warning: 'break' will never be executed [-Wunreachable-code-break] 218 | break; | ^~~~~ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/conv-transpose-1d.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/conv-transpose-1d.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 2 warnings generated when compiling for gfx950. 1 warning generated when compiling for host. [ 21%] Building CXX object ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/convert.cu.o cd /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build/ggml/src/ggml-hip && /usr/bin/hipcc -DGGML_BACKEND_BUILD -DGGML_BACKEND_SHARED -DGGML_HIP_NO_VMM -DGGML_HIP_ROCWMMA_FATTN_GFX12 -DGGML_SCHED_MAX_COPIES=4 -DGGML_SHARED -DGGML_USE_HIP -DUSE_PROF_API=1 -D_GNU_SOURCE -D_XOPEN_SOURCE=600 -D__HIP_PLATFORM_AMD__=1 -Dggml_hip_EXPORTS -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/.. -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -DNDEBUG -std=gnu++17 -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -x hip --offload-arch=gfx900 --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx942 --offload-arch=gfx950 --offload-arch=gfx1010 --offload-arch=gfx1012 --offload-arch=gfx1030 --offload-arch=gfx1031 --offload-arch=gfx1035 --offload-arch=gfx1036 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1103 --offload-arch=gfx1150 --offload-arch=gfx1151 --offload-arch=gfx1152 --offload-arch=gfx1153 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/convert.cu.o -MF CMakeFiles/ggml-hip.dir/__/ggml-cuda/convert.cu.o.d -o CMakeFiles/ggml-hip.dir/__/ggml-cuda/convert.cu.o -c /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-dw.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-dw.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-transpose.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-transpose.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx90a. /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-transpose.cu:6:17: warning: no previous prototype for function 'conv2d_transpose_kernel' [-Wmissing-prototypes] 6 | __global__ void conv2d_transpose_kernel(const float * __restrict__ input, const half * __restrict__ kernel, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-transpose.cu:6:12: note: declare 'static' if the function is not intended to be used outside of this translation unit 6 | __global__ void conv2d_transpose_kernel(const float * __restrict__ input, const half * __restrict__ kernel, | ^ | static 2 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/concat.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/concat.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/concat.cu:218:17: warning: 'break' will never be executed [-Wunreachable-code-break] 218 | break; | ^~~~~ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:44:31: warning: cast from 'const void *' to 'int *' drops const qualifier [-Wcast-qual] 44 | const int * x0 = ((int *) vx) + blockIdx.x * nint; | ^ 2 warnings generated when compiling for host. [ 22%] Building CXX object ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/count-equal.cu.o cd /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build/ggml/src/ggml-hip && /usr/bin/hipcc -DGGML_BACKEND_BUILD -DGGML_BACKEND_SHARED -DGGML_HIP_NO_VMM -DGGML_HIP_ROCWMMA_FATTN_GFX12 -DGGML_SCHED_MAX_COPIES=4 -DGGML_SHARED -DGGML_USE_HIP -DUSE_PROF_API=1 -D_GNU_SOURCE -D_XOPEN_SOURCE=600 -D__HIP_PLATFORM_AMD__=1 -Dggml_hip_EXPORTS -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/.. -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -DNDEBUG -std=gnu++17 -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -x hip --offload-arch=gfx900 --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx942 --offload-arch=gfx950 --offload-arch=gfx1010 --offload-arch=gfx1012 --offload-arch=gfx1030 --offload-arch=gfx1031 --offload-arch=gfx1035 --offload-arch=gfx1036 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1103 --offload-arch=gfx1150 --offload-arch=gfx1151 --offload-arch=gfx1152 --offload-arch=gfx1153 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/count-equal.cu.o -MF CMakeFiles/ggml-hip.dir/__/ggml-cuda/count-equal.cu.o.d -o CMakeFiles/ggml-hip.dir/__/ggml-cuda/count-equal.cu.o -c /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/count-equal.cu /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:44:31: warning: cast from 'const void *' to 'int *' drops const qualifier [-Wcast-qual] /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:504:9: note: in instantiation of function template specialization 'dequantize_block_q8_0_f16' requested here 504 | dequantize_block_q8_0_f16<<>>(vx, y, k); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:44:31: warning: cast from 'const void *' to 'int *' drops const qualifier [-Wcast-qual] 44 | const int * x0 = ((int *) vx) + blockIdx.x * nint; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:507:9: note: in instantiation of function template specialization 'dequantize_block_q8_0_f16' requested here 507 | dequantize_block_q8_0_f16<<>>(vx, y, k); | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-dw.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-dw.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-transpose.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-transpose.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-transpose.cu:6:17: warning: no previous prototype for function 'conv2d_transpose_kernel' [-Wmissing-prototypes] 6 | __global__ void conv2d_transpose_kernel(const float * __restrict__ input, const half * __restrict__ kernel, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-transpose.cu:6:12: note: declare 'static' if the function is not intended to be used outside of this translation unit 6 | __global__ void conv2d_transpose_kernel(const float * __restrict__ input, const half * __restrict__ kernel, | ^ | static 1 warning generated when compiling for gfx942. 2 warnings generated when compiling for gfx1101. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/count-equal.cu:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 4 warnings generated when compiling for gfx1010. /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/count-equal.cu:62:13: warning: 'break' will never be executed [-Wunreachable-code-break] 62 | break; | ^~~~~ 2 warnings generated when compiling for gfx1010. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-dw.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-dw.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-transpose.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-transpose.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-transpose.cu:6:17: warning: no previous prototype for function 'conv2d_transpose_kernel' [-Wmissing-prototypes] 6 | __global__ void conv2d_transpose_kernel(const float * __restrict__ input, const half * __restrict__ kernel, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-transpose.cu:6:12: note: declare 'static' if the function is not intended to be used outside of this translation unit 6 | __global__ void conv2d_transpose_kernel(const float * __restrict__ input, const half * __restrict__ kernel, | ^ | static 1 warning generated when compiling for gfx950. 2 warnings generated when compiling for gfx1102. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:44:31: warning: cast from 'const void *' to 'int *' drops const qualifier [-Wcast-qual] 44 | const int * x0 = ((int *) vx) + blockIdx.x * nint; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:44:31: warning: cast from 'const void *' to 'int *' drops const qualifier [-Wcast-qual] /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:504:9: note: in instantiation of function template specialization 'dequantize_block_q8_0_f16' requested here 504 | dequantize_block_q8_0_f16<<>>(vx, y, k); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:44:31: warning: cast from 'const void *' to 'int *' drops const qualifier [-Wcast-qual] 44 | const int * x0 = ((int *) vx) + blockIdx.x * nint; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:507:9: note: in instantiation of function template specialization 'dequantize_block_q8_0_f16' requested here 507 | dequantize_block_q8_0_f16<<>>(vx, y, k); | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/count-equal.cu:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/count-equal.cu:62:13: warning: 'break' will never be executed [-Wunreachable-code-break] 62 | break; | ^~~~~ 2 warnings generated when compiling for gfx1012. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-dw.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-dw.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-transpose.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-transpose.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-transpose.cu:6:17: warning: no previous prototype for function 'conv2d_transpose_kernel' [-Wmissing-prototypes] 6 | __global__ void conv2d_transpose_kernel(const float * __restrict__ input, const half * __restrict__ kernel, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-transpose.cu:6:12: note: declare 'static' if the function is not intended to be used outside of this translation unit 6 | __global__ void conv2d_transpose_kernel(const float * __restrict__ input, const half * __restrict__ kernel, | ^ | static 1 warning generated when compiling for host. [ 22%] Building CXX object ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/cpy.cu.o cd /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build/ggml/src/ggml-hip && /usr/bin/hipcc -DGGML_BACKEND_BUILD -DGGML_BACKEND_SHARED -DGGML_HIP_NO_VMM -DGGML_HIP_ROCWMMA_FATTN_GFX12 -DGGML_SCHED_MAX_COPIES=4 -DGGML_SHARED -DGGML_USE_HIP -DUSE_PROF_API=1 -D_GNU_SOURCE -D_XOPEN_SOURCE=600 -D__HIP_PLATFORM_AMD__=1 -Dggml_hip_EXPORTS -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/.. -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -DNDEBUG -std=gnu++17 -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -x hip --offload-arch=gfx900 --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx942 --offload-arch=gfx950 --offload-arch=gfx1010 --offload-arch=gfx1012 --offload-arch=gfx1030 --offload-arch=gfx1031 --offload-arch=gfx1035 --offload-arch=gfx1036 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1103 --offload-arch=gfx1150 --offload-arch=gfx1151 --offload-arch=gfx1152 --offload-arch=gfx1153 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/cpy.cu.o -MF CMakeFiles/ggml-hip.dir/__/ggml-cuda/cpy.cu.o.d -o CMakeFiles/ggml-hip.dir/__/ggml-cuda/cpy.cu.o -c /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/cpy.cu 2 warnings generated when compiling for gfx1103. 4 warnings generated when compiling for gfx1012. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/count-equal.cu:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/count-equal.cu:62:13: warning: 'break' will never be executed [-Wunreachable-code-break] 62 | break; | ^~~~~ 2 warnings generated when compiling for gfx1030. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/cpy.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/cpy.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-transpose.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-transpose.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-transpose.cu:6:17: warning: no previous prototype for function 'conv2d_transpose_kernel' [-Wmissing-prototypes] 6 | __global__ void conv2d_transpose_kernel(const float * __restrict__ input, const half * __restrict__ kernel, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-transpose.cu:6:12: note: declare 'static' if the function is not intended to be used outside of this translation unit 6 | __global__ void conv2d_transpose_kernel(const float * __restrict__ input, const half * __restrict__ kernel, | ^ | static 2 warnings generated when compiling for gfx1150. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:44:31: warning: cast from 'const void *' to 'int *' drops const qualifier [-Wcast-qual] 44 | const int * x0 = ((int *) vx) + blockIdx.x * nint; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:44:31: warning: cast from 'const void *' to 'int *' drops const qualifier [-Wcast-qual] /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:504:9: note: in instantiation of function template specialization 'dequantize_block_q8_0_f16' requested here 504 | dequantize_block_q8_0_f16<<>>(vx, y, k); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:44:31: warning: cast from 'const void *' to 'int *' drops const qualifier [-Wcast-qual] 44 | const int * x0 = ((int *) vx) + blockIdx.x * nint; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:507:9: note: in instantiation of function template specialization 'dequantize_block_q8_0_f16' requested here 507 | dequantize_block_q8_0_f16<<>>(vx, y, k); | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/count-equal.cu:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/count-equal.cu:62:13: warning: 'break' will never be executed [-Wunreachable-code-break] 62 | break; | ^~~~~ 2 warnings generated when compiling for gfx1031. 1 warning generated when compiling for gfx1010. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-transpose.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-transpose.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-transpose.cu:6:17: warning: no previous prototype for function 'conv2d_transpose_kernel' [-Wmissing-prototypes] 6 | __global__ void conv2d_transpose_kernel(const float * __restrict__ input, const half * __restrict__ kernel, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-transpose.cu:6:12: note: declare 'static' if the function is not intended to be used outside of this translation unit 6 | __global__ void conv2d_transpose_kernel(const float * __restrict__ input, const half * __restrict__ kernel, | ^ | static 2 warnings generated when compiling for gfx1151. 4 warnings generated when compiling for gfx1030. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/count-equal.cu:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/count-equal.cu:62:13: warning: 'break' will never be executed [-Wunreachable-code-break] 62 | break; | ^~~~~ 2 warnings generated when compiling for gfx1035. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/cpy.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/cpy.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-transpose.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-transpose.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-transpose.cu:6:17: warning: no previous prototype for function 'conv2d_transpose_kernel' [-Wmissing-prototypes] 6 | __global__ void conv2d_transpose_kernel(const float * __restrict__ input, const half * __restrict__ kernel, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-transpose.cu:6:12: note: declare 'static' if the function is not intended to be used outside of this translation unit 6 | __global__ void conv2d_transpose_kernel(const float * __restrict__ input, const half * __restrict__ kernel, | ^ | static /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:44:31: warning: cast from 'const void *' to 'int *' drops const qualifier [-Wcast-qual] 44 | const int * x0 = ((int *) vx) + blockIdx.x * nint; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:44:31: warning: cast from 'const void *' to 'int *' drops const qualifier [-Wcast-qual] /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:504:9: note: in instantiation of function template specialization 'dequantize_block_q8_0_f16' requested here 504 | dequantize_block_q8_0_f16<<>>(vx, y, k); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:44:31: warning: cast from 'const void *' to 'int *' drops const qualifier [-Wcast-qual] 44 | const int * x0 = ((int *) vx) + blockIdx.x * nint; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:507:9: note: in instantiation of function template specialization 'dequantize_block_q8_0_f16' requested here 507 | dequantize_block_q8_0_f16<<>>(vx, y, k); | ^ 2 warnings generated when compiling for gfx1152. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/count-equal.cu:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/count-equal.cu:62:13: warning: 'break' will never be executed [-Wunreachable-code-break] 62 | break; | ^~~~~ 2 warnings generated when compiling for gfx1036. 1 warning generated when compiling for gfx1012. 4 warnings generated when compiling for gfx1031. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-transpose.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-transpose.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-transpose.cu:6:17: warning: no previous prototype for function 'conv2d_transpose_kernel' [-Wmissing-prototypes] 6 | __global__ void conv2d_transpose_kernel(const float * __restrict__ input, const half * __restrict__ kernel, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-transpose.cu:6:12: note: declare 'static' if the function is not intended to be used outside of this translation unit 6 | __global__ void conv2d_transpose_kernel(const float * __restrict__ input, const half * __restrict__ kernel, | ^ | static 2 warnings generated when compiling for gfx1153. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/count-equal.cu:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/count-equal.cu:62:13: warning: 'break' will never be executed [-Wunreachable-code-break] 62 | break; | ^~~~~ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/cpy.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/cpy.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 2 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:44:31: warning: cast from 'const void *' to 'int *' drops const qualifier [-Wcast-qual] 44 | const int * x0 = ((int *) vx) + blockIdx.x * nint; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:44:31: warning: cast from 'const void *' to 'int *' drops const qualifier [-Wcast-qual] /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:504:9: note: in instantiation of function template specialization 'dequantize_block_q8_0_f16' requested here 504 | dequantize_block_q8_0_f16<<>>(vx, y, k); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:44:31: warning: cast from 'const void *' to 'int *' drops const qualifier [-Wcast-qual] 44 | const int * x0 = ((int *) vx) + blockIdx.x * nint; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:507:9: note: in instantiation of function template specialization 'dequantize_block_q8_0_f16' requested here 507 | dequantize_block_q8_0_f16<<>>(vx, y, k); | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-transpose.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-transpose.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-transpose.cu:6:17: warning: no previous prototype for function 'conv2d_transpose_kernel' [-Wmissing-prototypes] 6 | __global__ void conv2d_transpose_kernel(const float * __restrict__ input, const half * __restrict__ kernel, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-transpose.cu:6:12: note: declare 'static' if the function is not intended to be used outside of this translation unit 6 | __global__ void conv2d_transpose_kernel(const float * __restrict__ input, const half * __restrict__ kernel, | ^ | static 2 warnings generated when compiling for gfx1200. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/count-equal.cu:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/count-equal.cu:62:13: warning: 'break' will never be executed [-Wunreachable-code-break] 62 | break; | ^~~~~ 1 warning generated when compiling for gfx1030. 2 warnings generated when compiling for gfx1101. 4 warnings generated when compiling for gfx1035. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-transpose.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-transpose.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-transpose.cu:6:17: warning: no previous prototype for function 'conv2d_transpose_kernel' [-Wmissing-prototypes] 6 | __global__ void conv2d_transpose_kernel(const float * __restrict__ input, const half * __restrict__ kernel, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-transpose.cu:6:12: note: declare 'static' if the function is not intended to be used outside of this translation unit 6 | __global__ void conv2d_transpose_kernel(const float * __restrict__ input, const half * __restrict__ kernel, | ^ | static 2 warnings generated when compiling for gfx1201. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/cpy.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/cpy.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/count-equal.cu:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/count-equal.cu:62:13: warning: 'break' will never be executed [-Wunreachable-code-break] 62 | break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:44:31: warning: cast from 'const void *' to 'int *' drops const qualifier [-Wcast-qual] 44 | const int * x0 = ((int *) vx) + blockIdx.x * nint; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:44:31: warning: cast from 'const void *' to 'int *' drops const qualifier [-Wcast-qual] /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:504:9: note: in instantiation of function template specialization 'dequantize_block_q8_0_f16' requested here 504 | dequantize_block_q8_0_f16<<>>(vx, y, k); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:44:31: warning: cast from 'const void *' to 'int *' drops const qualifier [-Wcast-qual] 44 | const int * x0 = ((int *) vx) + blockIdx.x * nint; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:507:9: note: in instantiation of function template specialization 'dequantize_block_q8_0_f16' requested here 507 | dequantize_block_q8_0_f16<<>>(vx, y, k); | ^ 2 warnings generated when compiling for gfx1102. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-transpose.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-transpose.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-transpose.cu:6:17: warning: no previous prototype for function 'conv2d_transpose_kernel' [-Wmissing-prototypes] 6 | __global__ void conv2d_transpose_kernel(const float * __restrict__ input, const half * __restrict__ kernel, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-transpose.cu:6:12: note: declare 'static' if the function is not intended to be used outside of this translation unit 6 | __global__ void conv2d_transpose_kernel(const float * __restrict__ input, const half * __restrict__ kernel, | ^ | static 2 warnings generated when compiling for gfx900. 1 warning generated when compiling for gfx1031. 4 warnings generated when compiling for gfx1036. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/count-equal.cu:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/count-equal.cu:62:13: warning: 'break' will never be executed [-Wunreachable-code-break] 62 | break; | ^~~~~ 2 warnings generated when compiling for gfx1103. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-transpose.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-transpose.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-transpose.cu:6:17: warning: no previous prototype for function 'conv2d_transpose_kernel' [-Wmissing-prototypes] 6 | __global__ void conv2d_transpose_kernel(const float * __restrict__ input, const half * __restrict__ kernel, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-transpose.cu:6:12: note: declare 'static' if the function is not intended to be used outside of this translation unit 6 | __global__ void conv2d_transpose_kernel(const float * __restrict__ input, const half * __restrict__ kernel, | ^ | static 2 warnings generated when compiling for gfx906. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/cpy.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/cpy.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:44:31: warning: cast from 'const void *' to 'int *' drops const qualifier [-Wcast-qual] 44 | const int * x0 = ((int *) vx) + blockIdx.x * nint; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:44:31: warning: cast from 'const void *' to 'int *' drops const qualifier [-Wcast-qual] /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:504:9: note: in instantiation of function template specialization 'dequantize_block_q8_0_f16' requested here 504 | dequantize_block_q8_0_f16<<>>(vx, y, k); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:44:31: warning: cast from 'const void *' to 'int *' drops const qualifier [-Wcast-qual] 44 | const int * x0 = ((int *) vx) + blockIdx.x * nint; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:507:9: note: in instantiation of function template specialization 'dequantize_block_q8_0_f16' requested here 507 | dequantize_block_q8_0_f16<<>>(vx, y, k); | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/count-equal.cu:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/count-equal.cu:62:13: warning: 'break' will never be executed [-Wunreachable-code-break] 62 | break; | ^~~~~ 2 warnings generated when compiling for gfx1150. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-transpose.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-transpose.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-transpose.cu:6:17: warning: no previous prototype for function 'conv2d_transpose_kernel' [-Wmissing-prototypes] 6 | __global__ void conv2d_transpose_kernel(const float * __restrict__ input, const half * __restrict__ kernel, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-transpose.cu:6:12: note: declare 'static' if the function is not intended to be used outside of this translation unit 6 | __global__ void conv2d_transpose_kernel(const float * __restrict__ input, const half * __restrict__ kernel, | ^ | static 2 warnings generated when compiling for gfx908. 1 warning generated when compiling for gfx1035. 4 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/count-equal.cu:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/count-equal.cu:62:13: warning: 'break' will never be executed [-Wunreachable-code-break] 62 | break; | ^~~~~ 2 warnings generated when compiling for gfx1151. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-transpose.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-transpose.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-transpose.cu:6:17: warning: no previous prototype for function 'conv2d_transpose_kernel' [-Wmissing-prototypes] 6 | __global__ void conv2d_transpose_kernel(const float * __restrict__ input, const half * __restrict__ kernel, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-transpose.cu:6:12: note: declare 'static' if the function is not intended to be used outside of this translation unit 6 | __global__ void conv2d_transpose_kernel(const float * __restrict__ input, const half * __restrict__ kernel, | ^ | static 2 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/cpy.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/cpy.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:44:31: warning: cast from 'const void *' to 'int *' drops const qualifier [-Wcast-qual] 44 | const int * x0 = ((int *) vx) + blockIdx.x * nint; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:44:31: warning: cast from 'const void *' to 'int *' drops const qualifier [-Wcast-qual] /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:504:9: note: in instantiation of function template specialization 'dequantize_block_q8_0_f16' requested here 504 | dequantize_block_q8_0_f16<<>>(vx, y, k); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:44:31: warning: cast from 'const void *' to 'int *' drops const qualifier [-Wcast-qual] 44 | const int * x0 = ((int *) vx) + blockIdx.x * nint; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:507:9: note: in instantiation of function template specialization 'dequantize_block_q8_0_f16' requested here 507 | dequantize_block_q8_0_f16<<>>(vx, y, k); | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/count-equal.cu:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/count-equal.cu:62:13: warning: 'break' will never be executed [-Wunreachable-code-break] 62 | break; | ^~~~~ 2 warnings generated when compiling for gfx1152. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-transpose.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-transpose.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-transpose.cu:6:17: warning: no previous prototype for function 'conv2d_transpose_kernel' [-Wmissing-prototypes] 6 | __global__ void conv2d_transpose_kernel(const float * __restrict__ input, const half * __restrict__ kernel, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-transpose.cu:6:12: note: declare 'static' if the function is not intended to be used outside of this translation unit 6 | __global__ void conv2d_transpose_kernel(const float * __restrict__ input, const half * __restrict__ kernel, | ^ | static 2 warnings generated when compiling for gfx90a. 1 warning generated when compiling for gfx1036. 4 warnings generated when compiling for gfx1101. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/count-equal.cu:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/count-equal.cu:62:13: warning: 'break' will never be executed [-Wunreachable-code-break] 62 | break; | ^~~~~ 2 warnings generated when compiling for gfx1153. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-transpose.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-transpose.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-transpose.cu:6:17: warning: no previous prototype for function 'conv2d_transpose_kernel' [-Wmissing-prototypes] 6 | __global__ void conv2d_transpose_kernel(const float * __restrict__ input, const half * __restrict__ kernel, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-transpose.cu:6:12: note: declare 'static' if the function is not intended to be used outside of this translation unit 6 | __global__ void conv2d_transpose_kernel(const float * __restrict__ input, const half * __restrict__ kernel, | ^ | static In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/cpy.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/cpy.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:44:31: warning: cast from 'const void *' to 'int *' drops const qualifier [-Wcast-qual] 44 | const int * x0 = ((int *) vx) + blockIdx.x * nint; | ^ 2 warnings generated when compiling for gfx942. /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:44:31: warning: cast from 'const void *' to 'int *' drops const qualifier [-Wcast-qual] /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:504:9: note: in instantiation of function template specialization 'dequantize_block_q8_0_f16' requested here 504 | dequantize_block_q8_0_f16<<>>(vx, y, k); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:44:31: warning: cast from 'const void *' to 'int *' drops const qualifier [-Wcast-qual] 44 | const int * x0 = ((int *) vx) + blockIdx.x * nint; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:507:9: note: in instantiation of function template specialization 'dequantize_block_q8_0_f16' requested here 507 | dequantize_block_q8_0_f16<<>>(vx, y, k); | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/count-equal.cu:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/count-equal.cu:62:13: warning: 'break' will never be executed [-Wunreachable-code-break] 62 | break; | ^~~~~ 2 warnings generated when compiling for gfx1200. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-transpose.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-transpose.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-transpose.cu:6:17: warning: no previous prototype for function 'conv2d_transpose_kernel' [-Wmissing-prototypes] 6 | __global__ void conv2d_transpose_kernel(const float * __restrict__ input, const half * __restrict__ kernel, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-transpose.cu:6:12: note: declare 'static' if the function is not intended to be used outside of this translation unit 6 | __global__ void conv2d_transpose_kernel(const float * __restrict__ input, const half * __restrict__ kernel, | ^ | static 4 warnings generated when compiling for gfx1102. 1 warning generated when compiling for gfx1100. 2 warnings generated when compiling for gfx950. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/count-equal.cu:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/count-equal.cu:62:13: warning: 'break' will never be executed [-Wunreachable-code-break] 62 | break; | ^~~~~ 2 warnings generated when compiling for gfx1201. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/cpy.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/cpy.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:44:31: warning: cast from 'const void *' to 'int *' drops const qualifier [-Wcast-qual] 44 | const int * x0 = ((int *) vx) + blockIdx.x * nint; | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-transpose.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-transpose.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:44:31: warning: cast from 'const void *' to 'int *' drops const qualifier [-Wcast-qual] /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:504:9: note: in instantiation of function template specialization 'dequantize_block_q8_0_f16' requested here 504 | dequantize_block_q8_0_f16<<>>(vx, y, k); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:44:31: warning: cast from 'const void *' to 'int *' drops const qualifier [-Wcast-qual] 44 | const int * x0 = ((int *) vx) + blockIdx.x * nint; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:507:9: note: in instantiation of function template specialization 'dequantize_block_q8_0_f16' requested here 507 | dequantize_block_q8_0_f16<<>>(vx, y, k); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-transpose.cu:6:17: warning: no previous prototype for function 'conv2d_transpose_kernel' [-Wmissing-prototypes] 6 | __global__ void conv2d_transpose_kernel(const float * __restrict__ input, const half * __restrict__ kernel, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-transpose.cu:6:12: note: declare 'static' if the function is not intended to be used outside of this translation unit 6 | __global__ void conv2d_transpose_kernel(const float * __restrict__ input, const half * __restrict__ kernel, | ^ | static 2 warnings generated when compiling for host. [ 23%] Building CXX object ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/cross-entropy-loss.cu.o cd /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build/ggml/src/ggml-hip && /usr/bin/hipcc -DGGML_BACKEND_BUILD -DGGML_BACKEND_SHARED -DGGML_HIP_NO_VMM -DGGML_HIP_ROCWMMA_FATTN_GFX12 -DGGML_SCHED_MAX_COPIES=4 -DGGML_SHARED -DGGML_USE_HIP -DUSE_PROF_API=1 -D_GNU_SOURCE -D_XOPEN_SOURCE=600 -D__HIP_PLATFORM_AMD__=1 -Dggml_hip_EXPORTS -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/.. -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -DNDEBUG -std=gnu++17 -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -x hip --offload-arch=gfx900 --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx942 --offload-arch=gfx950 --offload-arch=gfx1010 --offload-arch=gfx1012 --offload-arch=gfx1030 --offload-arch=gfx1031 --offload-arch=gfx1035 --offload-arch=gfx1036 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1103 --offload-arch=gfx1150 --offload-arch=gfx1151 --offload-arch=gfx1152 --offload-arch=gfx1153 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/cross-entropy-loss.cu.o -MF CMakeFiles/ggml-hip.dir/__/ggml-cuda/cross-entropy-loss.cu.o.d -o CMakeFiles/ggml-hip.dir/__/ggml-cuda/cross-entropy-loss.cu.o -c /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/cross-entropy-loss.cu In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/count-equal.cu:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/count-equal.cu:62:13: warning: 'break' will never be executed [-Wunreachable-code-break] 62 | break; | ^~~~~ 2 warnings generated when compiling for gfx900. 4 warnings generated when compiling for gfx1103. 1 warning generated when compiling for gfx1101. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/cross-entropy-loss.cu:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1010. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/count-equal.cu:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/count-equal.cu:62:13: warning: 'break' will never be executed [-Wunreachable-code-break] 62 | break; | ^~~~~ 2 warnings generated when compiling for gfx906. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/cpy.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/cpy.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:44:31: warning: cast from 'const void *' to 'int *' drops const qualifier [-Wcast-qual] 44 | const int * x0 = ((int *) vx) + blockIdx.x * nint; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:44:31: warning: cast from 'const void *' to 'int *' drops const qualifier [-Wcast-qual] /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:504:9: note: in instantiation of function template specialization 'dequantize_block_q8_0_f16' requested here 504 | dequantize_block_q8_0_f16<<>>(vx, y, k); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:44:31: warning: cast from 'const void *' to 'int *' drops const qualifier [-Wcast-qual] 44 | const int * x0 = ((int *) vx) + blockIdx.x * nint; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:507:9: note: in instantiation of function template specialization 'dequantize_block_q8_0_f16' requested here 507 | dequantize_block_q8_0_f16<<>>(vx, y, k); | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/cross-entropy-loss.cu:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1012. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/count-equal.cu:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/count-equal.cu:62:13: warning: 'break' will never be executed [-Wunreachable-code-break] 62 | break; | ^~~~~ 4 warnings generated when compiling for gfx1150. 2 warnings generated when compiling for gfx908. 1 warning generated when compiling for gfx1102. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/cross-entropy-loss.cu:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1030. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:44:31: warning: cast from 'const void *' to 'int *' drops const qualifier [-Wcast-qual] 44 | const int * x0 = ((int *) vx) + blockIdx.x * nint; In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/count-equal.cu:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/count-equal.cu:62:13: warning: 'break' will never be executed [-Wunreachable-code-break] 62 | break; | ^~~~~ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/cpy.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/cpy.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:44:31: warning: cast from 'const void *' to 'int *' drops const qualifier [-Wcast-qual] /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:504:9: note: in instantiation of function template specialization 'dequantize_block_q8_0_f16' requested here 504 | dequantize_block_q8_0_f16<<>>(vx, y, k); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:44:31: warning: cast from 'const void *' to 'int *' drops const qualifier [-Wcast-qual] 44 | const int * x0 = ((int *) vx) + blockIdx.x * nint; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:507:9: note: in instantiation of function template specialization 'dequantize_block_q8_0_f16' requested here 507 | dequantize_block_q8_0_f16<<>>(vx, y, k); | ^ 2 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/cross-entropy-loss.cu:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1031. 4 warnings generated when compiling for gfx1151. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/count-equal.cu:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/count-equal.cu:62:13: warning: 'break' will never be executed [-Wunreachable-code-break] 62 | break; | ^~~~~ 1 warning generated when compiling for gfx1103. 2 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/cross-entropy-loss.cu:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1035. /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:44:31: warning: cast from 'const void *' to 'int *' drops const qualifier [-Wcast-qual] 44 | const int * x0 = ((int *) vx) + blockIdx.x * nint; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:44:31: warning: cast from 'const void *' to 'int *' drops const qualifier [-Wcast-qual] /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:504:9: note: in instantiation of function template specialization 'dequantize_block_q8_0_f16' requested here 504 | dequantize_block_q8_0_f16<<>>(vx, y, k); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:44:31: warning: cast from 'const void *' to 'int *' drops const qualifier [-Wcast-qual] 44 | const int * x0 = ((int *) vx) + blockIdx.x * nint; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:507:9: note: in instantiation of function template specialization 'dequantize_block_q8_0_f16' requested here 507 | dequantize_block_q8_0_f16<<>>(vx, y, k); | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/cpy.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/cpy.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/count-equal.cu:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/count-equal.cu:62:13: warning: 'break' will never be executed [-Wunreachable-code-break] 62 | break; | ^~~~~ 2 warnings generated when compiling for gfx942. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/cross-entropy-loss.cu:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 4 warnings generated when compiling for gfx1152. 1 warning generated when compiling for gfx1036. 1 warning generated when compiling for gfx1150. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/count-equal.cu:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/count-equal.cu:62:13: warning: 'break' will never be executed [-Wunreachable-code-break] 62 | break; | ^~~~~ 2 warnings generated when compiling for gfx950. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:44:31: warning: cast from 'const void *' to 'int *' drops const qualifier [-Wcast-qual] 44 | const int * x0 = ((int *) vx) + blockIdx.x * nint; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:44:31: warning: cast from 'const void *' to 'int *' drops const qualifier [-Wcast-qual] /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:504:9: note: in instantiation of function template specialization 'dequantize_block_q8_0_f16' requested here 504 | dequantize_block_q8_0_f16<<>>(vx, y, k); | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/cross-entropy-loss.cu:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:44:31: warning: cast from 'const void *' to 'int *' drops const qualifier [-Wcast-qual] 44 | const int * x0 = ((int *) vx) + blockIdx.x * nint; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:507:9: note: in instantiation of function template specialization 'dequantize_block_q8_0_f16' requested here 507 | dequantize_block_q8_0_f16<<>>(vx, y, k); | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/cpy.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/cpy.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1100. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/count-equal.cu:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/count-equal.cu:62:13: warning: 'break' will never be executed [-Wunreachable-code-break] 62 | break; | ^~~~~ 2 warnings generated when compiling for host. [ 23%] Building CXX object ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/diagmask.cu.o cd /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build/ggml/src/ggml-hip && /usr/bin/hipcc -DGGML_BACKEND_BUILD -DGGML_BACKEND_SHARED -DGGML_HIP_NO_VMM -DGGML_HIP_ROCWMMA_FATTN_GFX12 -DGGML_SCHED_MAX_COPIES=4 -DGGML_SHARED -DGGML_USE_HIP -DUSE_PROF_API=1 -D_GNU_SOURCE -D_XOPEN_SOURCE=600 -D__HIP_PLATFORM_AMD__=1 -Dggml_hip_EXPORTS -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/.. -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -DNDEBUG -std=gnu++17 -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -x hip --offload-arch=gfx900 --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx942 --offload-arch=gfx950 --offload-arch=gfx1010 --offload-arch=gfx1012 --offload-arch=gfx1030 --offload-arch=gfx1031 --offload-arch=gfx1035 --offload-arch=gfx1036 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1103 --offload-arch=gfx1150 --offload-arch=gfx1151 --offload-arch=gfx1152 --offload-arch=gfx1153 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/diagmask.cu.o -MF CMakeFiles/ggml-hip.dir/__/ggml-cuda/diagmask.cu.o.d -o CMakeFiles/ggml-hip.dir/__/ggml-cuda/diagmask.cu.o -c /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/diagmask.cu 4 warnings generated when compiling for gfx1153. 1 warning generated when compiling for gfx1151. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/cross-entropy-loss.cu:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1101. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/diagmask.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/diagmask.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1010. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:44:31: warning: cast from 'const void *' to 'int *' drops const qualifier [-Wcast-qual] 44 | const int * x0 = ((int *) vx) + blockIdx.x * nint; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:44:31: warning: cast from 'const void *' to 'int *' drops const qualifier [-Wcast-qual] /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:504:9: note: in instantiation of function template specialization 'dequantize_block_q8_0_f16' requested here 504 | dequantize_block_q8_0_f16<<>>(vx, y, k); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:44:31: warning: cast from 'const void *' to 'int *' drops const qualifier [-Wcast-qual] 44 | const int * x0 = ((int *) vx) + blockIdx.x * nint; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:507:9: note: in instantiation of function template specialization 'dequantize_block_q8_0_f16' requested here 507 | dequantize_block_q8_0_f16<<>>(vx, y, k); | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/cpy.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/cpy.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/cross-entropy-loss.cu:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1102. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/diagmask.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/diagmask.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1012. 4 warnings generated when compiling for gfx1200. 1 warning generated when compiling for gfx1152. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/cross-entropy-loss.cu:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1103. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/diagmask.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/diagmask.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1030. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:44:31: warning: cast from 'const void *' to 'int *' drops const qualifier [-Wcast-qual] 44 | const int * x0 = ((int *) vx) + blockIdx.x * nint; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:44:31: warning: cast from 'const void *' to 'int *' drops const qualifier [-Wcast-qual] /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:504:9: note: in instantiation of function template specialization 'dequantize_block_q8_0_f16' requested here 504 | dequantize_block_q8_0_f16<<>>(vx, y, k); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:44:31: warning: cast from 'const void *' to 'int *' drops const qualifier [-Wcast-qual] 44 | const int * x0 = ((int *) vx) + blockIdx.x * nint; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:507:9: note: in instantiation of function template specialization 'dequantize_block_q8_0_f16' requested here 507 | dequantize_block_q8_0_f16<<>>(vx, y, k); | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/cpy.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/cpy.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/cross-entropy-loss.cu:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/diagmask.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/diagmask.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1150. 4 warnings generated when compiling for gfx1201. 1 warning generated when compiling for gfx1031. 1 warning generated when compiling for gfx1153. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/cross-entropy-loss.cu:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:44:31: warning: cast from 'const void *' to 'int *' drops const qualifier [-Wcast-qual] 44 | const int * x0 = ((int *) vx) + blockIdx.x * nint; | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/diagmask.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/diagmask.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:44:31: warning: cast from 'const void *' to 'int *' drops const qualifier [-Wcast-qual] /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:504:9: note: in instantiation of function template specialization 'dequantize_block_q8_0_f16' requested here 504 | dequantize_block_q8_0_f16<<>>(vx, y, k); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:44:31: warning: cast from 'const void *' to 'int *' drops const qualifier [-Wcast-qual] 44 | const int * x0 = ((int *) vx) + blockIdx.x * nint; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:507:9: note: in instantiation of function template specialization 'dequantize_block_q8_0_f16' requested here 507 | dequantize_block_q8_0_f16<<>>(vx, y, k); | ^ 1 warning generated when compiling for gfx1151. 1 warning generated when compiling for gfx1035. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/cpy.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/cpy.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 4 warnings generated when compiling for gfx900. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/cross-entropy-loss.cu:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/diagmask.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/diagmask.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1036. 1 warning generated when compiling for gfx1152. 1 warning generated when compiling for gfx1200. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:44:31: warning: cast from 'const void *' to 'int *' drops const qualifier [-Wcast-qual] 44 | const int * x0 = ((int *) vx) + blockIdx.x * nint; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:44:31: warning: cast from 'const void *' to 'int *' drops const qualifier [-Wcast-qual] /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:504:9: note: in instantiation of function template specialization 'dequantize_block_q8_0_f16' requested here 504 | dequantize_block_q8_0_f16<<>>(vx, y, k); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:44:31: warning: cast from 'const void *' to 'int *' drops const qualifier [-Wcast-qual] 44 | const int * x0 = ((int *) vx) + blockIdx.x * nint; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:507:9: note: in instantiation of function template specialization 'dequantize_block_q8_0_f16' requested here 507 | dequantize_block_q8_0_f16<<>>(vx, y, k); | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/diagmask.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/diagmask.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/cross-entropy-loss.cu:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/cpy.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/cpy.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1100. 1 warning generated when compiling for gfx1153. 4 warnings generated when compiling for gfx906. 1 warning generated when compiling for gfx1201. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/diagmask.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/diagmask.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/cross-entropy-loss.cu:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1101. 1 warning generated when compiling for gfx1200. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:44:31: warning: cast from 'const void *' to 'int *' drops const qualifier [-Wcast-qual] 44 | const int * x0 = ((int *) vx) + blockIdx.x * nint; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:44:31: warning: cast from 'const void *' to 'int *' drops const qualifier [-Wcast-qual] /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:504:9: note: in instantiation of function template specialization 'dequantize_block_q8_0_f16' requested here 504 | dequantize_block_q8_0_f16<<>>(vx, y, k); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:44:31: warning: cast from 'const void *' to 'int *' drops const qualifier [-Wcast-qual] 44 | const int * x0 = ((int *) vx) + blockIdx.x * nint; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:507:9: note: in instantiation of function template specialization 'dequantize_block_q8_0_f16' requested here 507 | dequantize_block_q8_0_f16<<>>(vx, y, k); | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/cpy.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/cpy.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/diagmask.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/diagmask.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1102. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/cross-entropy-loss.cu:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1201. 4 warnings generated when compiling for gfx908. 1 warning generated when compiling for gfx900. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/diagmask.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/diagmask.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1103. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/cross-entropy-loss.cu:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx900. /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:44:31: warning: cast from 'const void *' to 'int *' drops const qualifier [-Wcast-qual] 44 | const int * x0 = ((int *) vx) + blockIdx.x * nint; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:44:31: warning: cast from 'const void *' to 'int *' drops const qualifier [-Wcast-qual] /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:504:9: note: in instantiation of function template specialization 'dequantize_block_q8_0_f16' requested here 504 | dequantize_block_q8_0_f16<<>>(vx, y, k); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:44:31: warning: cast from 'const void *' to 'int *' drops const qualifier [-Wcast-qual] 44 | const int * x0 = ((int *) vx) + blockIdx.x * nint; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:507:9: note: in instantiation of function template specialization 'dequantize_block_q8_0_f16' requested here 507 | dequantize_block_q8_0_f16<<>>(vx, y, k); | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/cpy.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/cpy.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/diagmask.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/diagmask.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1150. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/cross-entropy-loss.cu:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 4 warnings generated when compiling for gfx90a. 1 warning generated when compiling for gfx906. 1 warning generated when compiling for gfx906. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/diagmask.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/diagmask.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1151. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:44:31: warning: cast from 'const void *' to 'int *' drops const qualifier [-Wcast-qual] 44 | const int * x0 = ((int *) vx) + blockIdx.x * nint; | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/cross-entropy-loss.cu:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:44:31: warning: cast from 'const void *' to 'int *' drops const qualifier [-Wcast-qual] /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:504:9: note: in instantiation of function template specialization 'dequantize_block_q8_0_f16' requested here 504 | dequantize_block_q8_0_f16<<>>(vx, y, k); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:44:31: warning: cast from 'const void *' to 'int *' drops const qualifier [-Wcast-qual] 44 | const int * x0 = ((int *) vx) + blockIdx.x * nint; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:507:9: note: in instantiation of function template specialization 'dequantize_block_q8_0_f16' requested here 507 | dequantize_block_q8_0_f16<<>>(vx, y, k); | ^ 1 warning generated when compiling for gfx908. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/cpy.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/cpy.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/diagmask.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/diagmask.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1152. 4 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/cross-entropy-loss.cu:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx908. 1 warning generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/diagmask.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/diagmask.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1153. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:44:31: warning: cast from 'const void *' to 'int *' drops const qualifier [-Wcast-qual] 44 | const int * x0 = ((int *) vx) + blockIdx.x * nint; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:44:31: warning: cast from 'const void *' to 'int *' drops const qualifier [-Wcast-qual] /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:504:9: note: in instantiation of function template specialization 'dequantize_block_q8_0_f16' requested here 504 | dequantize_block_q8_0_f16<<>>(vx, y, k); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:44:31: warning: cast from 'const void *' to 'int *' drops const qualifier [-Wcast-qual] 44 | const int * x0 = ((int *) vx) + blockIdx.x * nint; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:507:9: note: in instantiation of function template specialization 'dequantize_block_q8_0_f16' requested here 507 | dequantize_block_q8_0_f16<<>>(vx, y, k); | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/cpy.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/cpy.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/cross-entropy-loss.cu:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/diagmask.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/diagmask.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 4 warnings generated when compiling for gfx942. 1 warning generated when compiling for gfx1200. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/cross-entropy-loss.cu:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 1 warning generated when compiling for gfx90a. 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx942. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/diagmask.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/diagmask.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:44:31: warning: cast from 'const void *' to 'int *' drops const qualifier [-Wcast-qual] 44 | const int * x0 = ((int *) vx) + blockIdx.x * nint; | ^ 1 warning generated when compiling for gfx1201. /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:44:31: warning: cast from 'const void *' to 'int *' drops const qualifier [-Wcast-qual] /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:504:9: note: in instantiation of function template specialization 'dequantize_block_q8_0_f16' requested here 504 | dequantize_block_q8_0_f16<<>>(vx, y, k); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:44:31: warning: cast from 'const void *' to 'int *' drops const qualifier [-Wcast-qual] 44 | const int * x0 = ((int *) vx) + blockIdx.x * nint; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:507:9: note: in instantiation of function template specialization 'dequantize_block_q8_0_f16' requested here 507 | dequantize_block_q8_0_f16<<>>(vx, y, k); | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/cpy.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/cpy.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/cross-entropy-loss.cu:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx950. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/diagmask.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/diagmask.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 4 warnings generated when compiling for gfx950. 1 warning generated when compiling for gfx900. 1 warning generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/cross-entropy-loss.cu:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:44:31: warning: cast from 'const void *' to 'int *' drops const qualifier [-Wcast-qual] 44 | const int * x0 = ((int *) vx) + blockIdx.x * nint; | ^ 1 warning generated when compiling for host. [ 23%] Building CXX object ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/fattn-tile-f16.cu.o cd /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build/ggml/src/ggml-hip && /usr/bin/hipcc -DGGML_BACKEND_BUILD -DGGML_BACKEND_SHARED -DGGML_HIP_NO_VMM -DGGML_HIP_ROCWMMA_FATTN_GFX12 -DGGML_SCHED_MAX_COPIES=4 -DGGML_SHARED -DGGML_USE_HIP -DUSE_PROF_API=1 -D_GNU_SOURCE -D_XOPEN_SOURCE=600 -D__HIP_PLATFORM_AMD__=1 -Dggml_hip_EXPORTS -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/.. -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -DNDEBUG -std=gnu++17 -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -x hip --offload-arch=gfx900 --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx942 --offload-arch=gfx950 --offload-arch=gfx1010 --offload-arch=gfx1012 --offload-arch=gfx1030 --offload-arch=gfx1031 --offload-arch=gfx1035 --offload-arch=gfx1036 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1103 --offload-arch=gfx1150 --offload-arch=gfx1151 --offload-arch=gfx1152 --offload-arch=gfx1153 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/fattn-tile-f16.cu.o -MF CMakeFiles/ggml-hip.dir/__/ggml-cuda/fattn-tile-f16.cu.o.d -o CMakeFiles/ggml-hip.dir/__/ggml-cuda/fattn-tile-f16.cu.o -c /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/diagmask.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/diagmask.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:44:31: warning: cast from 'const void *' to 'int *' drops const qualifier [-Wcast-qual] /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:504:9: note: in instantiation of function template specialization 'dequantize_block_q8_0_f16' requested here 504 | dequantize_block_q8_0_f16<<>>(vx, y, k); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:44:31: warning: cast from 'const void *' to 'int *' drops const qualifier [-Wcast-qual] 44 | const int * x0 = ((int *) vx) + blockIdx.x * nint; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:507:9: note: in instantiation of function template specialization 'dequantize_block_q8_0_f16' requested here 507 | dequantize_block_q8_0_f16<<>>(vx, y, k); | ^ 1 warning generated when compiling for gfx906. 4 warnings generated when compiling for host. [ 24%] Building CXX object ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/fattn-tile-f32.cu.o cd /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build/ggml/src/ggml-hip && /usr/bin/hipcc -DGGML_BACKEND_BUILD -DGGML_BACKEND_SHARED -DGGML_HIP_NO_VMM -DGGML_HIP_ROCWMMA_FATTN_GFX12 -DGGML_SCHED_MAX_COPIES=4 -DGGML_SHARED -DGGML_USE_HIP -DUSE_PROF_API=1 -D_GNU_SOURCE -D_XOPEN_SOURCE=600 -D__HIP_PLATFORM_AMD__=1 -Dggml_hip_EXPORTS -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/.. -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -DNDEBUG -std=gnu++17 -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -x hip --offload-arch=gfx900 --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx942 --offload-arch=gfx950 --offload-arch=gfx1010 --offload-arch=gfx1012 --offload-arch=gfx1030 --offload-arch=gfx1031 --offload-arch=gfx1035 --offload-arch=gfx1036 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1103 --offload-arch=gfx1150 --offload-arch=gfx1151 --offload-arch=gfx1152 --offload-arch=gfx1153 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/fattn-tile-f32.cu.o -MF CMakeFiles/ggml-hip.dir/__/ggml-cuda/fattn-tile-f32.cu.o.d -o CMakeFiles/ggml-hip.dir/__/ggml-cuda/fattn-tile-f32.cu.o -c /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f32.cu In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/cpy.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/cpy.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/diagmask.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/diagmask.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:26:23: warning: unused parameter 'ne00' [-Wunused-parameter] 26 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:26:83: warning: unused parameter 'ne03' [-Wunused-parameter] 26 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:28:23: warning: unused parameter 'ne10' [-Wunused-parameter] 28 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:28:83: warning: unused parameter 'ne13' [-Wunused-parameter] 28 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:30:43: warning: unused parameter 'nb21' [-Wunused-parameter] 30 | const int32_t nb21, const int32_t nb22, const int64_t nb23, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:30:63: warning: unused parameter 'nb22' [-Wunused-parameter] 30 | const int32_t nb21, const int32_t nb22, const int64_t nb23, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:30:83: warning: unused parameter 'nb23' [-Wunused-parameter] 30 | const int32_t nb21, const int32_t nb22, const int64_t nb23, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:31:43: warning: unused parameter 'ne31' [-Wunused-parameter] 31 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:31:63: warning: unused parameter 'ne32' [-Wunused-parameter] 31 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:32:63: warning: unused parameter 'nb32' [-Wunused-parameter] 32 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ 1 warning generated when compiling for gfx908. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f32.cu:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx942. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/diagmask.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/diagmask.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/cpy.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/cpy.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/diagmask.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/diagmask.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx90a. 1 warning generated when compiling for gfx950. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/diagmask.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/diagmask.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx942. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/cpy.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/cpy.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1010. 1 warning generated when compiling for host. [ 24%] Building CXX object ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/fattn-wmma-f16.cu.o cd /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build/ggml/src/ggml-hip && /usr/bin/hipcc -DGGML_BACKEND_BUILD -DGGML_BACKEND_SHARED -DGGML_HIP_NO_VMM -DGGML_HIP_ROCWMMA_FATTN_GFX12 -DGGML_SCHED_MAX_COPIES=4 -DGGML_SHARED -DGGML_USE_HIP -DUSE_PROF_API=1 -D_GNU_SOURCE -D_XOPEN_SOURCE=600 -D__HIP_PLATFORM_AMD__=1 -Dggml_hip_EXPORTS -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/.. -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -DNDEBUG -std=gnu++17 -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -x hip --offload-arch=gfx900 --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx942 --offload-arch=gfx950 --offload-arch=gfx1010 --offload-arch=gfx1012 --offload-arch=gfx1030 --offload-arch=gfx1031 --offload-arch=gfx1035 --offload-arch=gfx1036 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1103 --offload-arch=gfx1150 --offload-arch=gfx1151 --offload-arch=gfx1152 --offload-arch=gfx1153 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/fattn-wmma-f16.cu.o -MF CMakeFiles/ggml-hip.dir/__/ggml-cuda/fattn-wmma-f16.cu.o.d -o CMakeFiles/ggml-hip.dir/__/ggml-cuda/fattn-wmma-f16.cu.o -c /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu 11 warnings generated when compiling for gfx1010. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/diagmask.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/diagmask.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx950. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f32.cu:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:26:23: warning: unused parameter 'ne00' [-Wunused-parameter] 26 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:26:83: warning: unused parameter 'ne03' [-Wunused-parameter] 26 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:28:23: warning: unused parameter 'ne10' [-Wunused-parameter] 28 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:28:83: warning: unused parameter 'ne13' [-Wunused-parameter] 28 | const int32_t ne10, co/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cunst int32_t ne:32:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:30:43: warning: unused parameter 'nb21' [-Wunused-parameter] 30 | const int32_t nb21, const int32_t nb22, const int64_t nb23, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:30:63: warning: unused parameter 'nb22' [-Wunused-parameter] 30 | const int32_t nb21, const int32_t nb22, const int64_t nb23, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:30:83: warning: unused parameter 'nb23' [-Wunused-parameter] 30 | const int32_t nb21, const int32_t nb22, const int64_t nb23, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:31:43: warning: unused parameter 'ne31' [-Wunused-parameter] 31 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:31:63: warning: unused parameter 'ne32' [-Wunused-parameter] 31 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:32:63: warning: unused parameter 'nb32' [-Wunused-parameter] 32 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ 32 | const int * __restrict__ KV_max, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:542:15: warning: unused variable 'warp_size' [-Wunused-variable] 542 | const int warp_size = ggml_cuda_info().devices[ctx.device].warp_size; | ^~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:673:13: warning: 'break' will never be executed [-Wunreachable-code-break] 673 | break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:646:17: warning: 'break' will never be executed [-Wunreachable-code-break] 646 | break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:593:21: warning: 'break' will never be executed [-Wunreachable-code-break] 593 | break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:568:21: warning: 'break' will never be executed [-Wunreachable-code-break] 568 | break; | ^~~~~ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/diagmask.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/diagmask.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for host. [ 25%] Building CXX object ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/fattn.cu.o cd /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build/ggml/src/ggml-hip && /usr/bin/hipcc -DGGML_BACKEND_BUILD -DGGML_BACKEND_SHARED -DGGML_HIP_NO_VMM -DGGML_HIP_ROCWMMA_FATTN_GFX12 -DGGML_SCHED_MAX_COPIES=4 -DGGML_SHARED -DGGML_USE_HIP -DUSE_PROF_API=1 -D_GNU_SOURCE -D_XOPEN_SOURCE=600 -D__HIP_PLATFORM_AMD__=1 -Dggml_hip_EXPORTS -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/.. -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -DNDEBUG -std=gnu++17 -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -x hip --offload-arch=gfx900 --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx942 --offload-arch=gfx950 --offload-arch=gfx1010 --offload-arch=gfx1012 --offload-arch=gfx1030 --offload-arch=gfx1031 --offload-arch=gfx1035 --offload-arch=gfx1036 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1103 --offload-arch=gfx1150 --offload-arch=gfx1151 --offload-arch=gfx1152 --offload-arch=gfx1153 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/fattn.cu.o -MF CMakeFiles/ggml-hip.dir/__/ggml-cuda/fattn.cu.o.d -o CMakeFiles/ggml-hip.dir/__/ggml-cuda/fattn.cu.o -c /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu 7 warnings generated when compiling for gfx1010. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:6: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:117:13: warning: 'break' will never be executed [-Wunreachable-code-break] 117 | break; | ^~~~~ 22 warnings generated when compiling for gfx1010. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:32:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 32 | const int * __restrict__ KV_max, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:542:15: warning: unused variable 'warp_size' [-Wunused-variable] 542 | const int warp_size = ggml_cuda_info().devices[ctx.device].warp_size; | ^~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:673:13: warning: 'break' will never be executed [-Wunreachable-code-break] 673 | break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:646:17: warning: 'break' will never be executed [-Wunreachable-code-break] 646 | break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:593:21: warning: 'break' will never be executed [-Wunreachable-code-break] 593 | break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:568:21: warning: 'break' will never be executed [-Wunreachable-code-break] 568 | break; | ^~~~~ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:6: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:117:13: warning: 'break' will never be executed [-Wunreachable-code-break] 117 | break; | ^~~~~ 22 warnings generated when compiling for gfx1012. 1 warning generated when compiling for gfx1012. 7 warnings generated when compiling for gfx1012. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:6: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:117:13: warning: 'break' will never be executed [-Wunreachable-code-break] 117 | break; | ^~~~~ 22 warnings generated when compiling for gfx1030. 11 warnings generated when compiling for gfx1012. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f32.cu:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:32:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 32 | const int * __restrict__ KV_max, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:542:15: warning: unused variable 'warp_size' [-Wunused-variable] 542 | const int warp_size = ggml_cuda_info().devices[ctx.device].warp_size; | ^~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:673:13: warning: 'break' will never be executed [-Wunreachable-code-break] 673 | break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:646:17: warning: 'break' will never be executed [-Wunreachable-code-break] 646 | break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:593:21: warning: 'break' will never be executed [-Wunreachable-code-break] 593 | break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:568:21: warning: 'break' will never be executed [-Wunreachable-code-break] 568 | break; | ^~~~~ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | In file included from tile<16, 8, int> & D, cons/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:1: t tile<16, 4, int> & /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:6: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:117:13: warning: 'break' will never be executed [-Wunreachable-code-break] 117 | break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:26:23: warning: unused parameter 'ne00' [-Wunused-parameter] 26 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:26:83: warning: unused parameter 'ne03' [-Wunused-parameter] 26 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:28:23: warning: unused parameter 'ne10' [-Wunused-parameter] 28 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:28:83: warning: unused parameter 'ne13' [-Wunused-parameter] 28 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:30:43: warning: unused parameter 'nb21' [-Wunused-parameter] 30 | const int32_t nb21, const int32_t nb22, const int64_t nb23, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:30:63: warning: unused parameter 'nb22' [-Wunused-parameter] 30 | const int32_t nb21, const int32_t nb22, const int64_t nb23, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:30:83: warning: unused parameter 'nb23' [-Wunused-parameter] 30 | const int32_t nb21, const int32_t nb22, const int64_t nb23, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:31:43: warning: unused parameter 'ne31' [-Wunused-parameter] 31 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:31:63: warning: unused parameter 'ne32' [-Wunused-parameter] 31 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:32:63: warning: unused parameter 'nb32' [-Wunused-parameter] 32 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ 22 warnings generated when compiling for gfx1031. 7 warnings generated when compiling for gfx1030. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:6: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:117:13: warning: 'break' will never be executed [-Wunreachable-code-break] 117 | break; | ^~~~~ 22 warnings generated when compiling for gfx1035. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:32:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 32 | const int * __restrict__ KV_max, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:542:15: warning: unused variable 'warp_size' [-Wunused-variable] 542 | const int warp_size = ggml_cuda_info().devices[ctx.device].warp_size; | ^~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:673:13: warning: 'break' will never be executed [-Wunreachable-code-break] 673 | break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:646:17: warning: 'break' will never be executed [-Wunreachable-code-break] 646 | break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:593:21: warning: 'break' will never be executed [-Wunreachable-code-break] 593 | break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:568:21: warning: 'break' will never be executed [-Wunreachable-code-break] 568 | break; | ^~~~~ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:6: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:117:13: warning: 'break' will never be executed [-Wunreachable-code-break] 117 | break; | ^~~~~ 22 warnings generated when compiling for gfx1036. 1 warning generated when compiling for gfx1030. 7 warnings generated when compiling for gfx1031. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:6: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:117:13: warning: 'break' will never be executed [-Wunreachable-code-break] 117 | break; | ^~~~~ In file included from In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f32.cu:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:32:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 32 | const int * __restrict__ KV_max, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:542:15: warning: unused variable 'warp_size' [-Wunused-variable] 542 | const int warp_size = ggml_cuda_info().devices[ctx.device].warp_size; | ^~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:673:13: warning: 'break' will never be executed [-Wunreachable-code-break] 673 | break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:646:17: warning: 'break' will never be executed [-Wunreachable-code-break] 646 | break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:593:21: warning: 'break' will never be executed [-Wunreachable-code-break] 593 | break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:568:21: warning: 'break' will never be executed [-Wunreachable-code-break] 568 | break; | ^~~~~ 22 warnings generated when compiling for gfx1100. 11 warnings generated when compiling for gfx1030. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:6: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:117:13: warning: 'break' will never be executed [-Wunreachable-code-break] 117 | break; | ^~~~~ 7 warnings generated when compiling for gfx1035. 22 warnings generated when compiling for gfx1101. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:26:23: warning: unused parameter 'ne00' [-Wunused-parameter] 26 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:26:83: warning: unused parameter 'ne03' [-Wunused-parameter] 26 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:28:23: warning: unused parameter 'ne10' [-Wunused-parameter] 28 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:28:83: warning: unused parameter 'ne13' [-Wunused-parameter] 28 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:30:43: warning: unused parameter 'nb21' [-Wunused-parameter] 30 | const int32_t nb21, const int32_t nb22, const int64_t nb23, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:30:63: warning: unused parameter 'nb22' [-Wunused-parameter] 30 | const int32_t nb21, const int32_t nb22, const int64_t nb23, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:30:83: warning: unused parameter 'nb23' [-Wunused-parameter] 30 | const int32_t nb21, const int32_t nb22, const int64_t nb23, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:31:43: warning: unused parameter 'ne31' [-Wunused-parameter] 31 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:31:63: warning: unused parameter 'ne32' [-Wunused-parameter] 31 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:32:63: warning: unused parameter 'nb32' [-Wunused-parameter] 32 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:32:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 32 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:542:15: warning: unused variable 'warp_size' [-Wunused-variable] 542 | const int warp_size = ggml_cuda_info().devices[ctx.device].warp_size; | ^~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:673:13: warning: 'break' will never be executed [-Wunreachable-code-break] 673 | break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:646:17: warning: 'break' will never be executed [-Wunreachable-code-break] 646 | break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:593:21: warning: 'break' will never be executed [-Wunreachable-code-break] 593 | break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:568:21: warning: 'break' will never be executed [-Wunreachable-code-break] 568 | break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:6: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:117:13: warning: 'break' will never be executed [-Wunreachable-code-break] 117 | break; | ^~~~~ 22 warnings generated when compiling for gfx1102. 1 warning generated when compiling for gfx1031. 7 warnings generated when compiling for gfx1036. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:6: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:117:13: warning: 'break' will never be executed [-Wunreachable-code-break] 117 | break; | ^~~~~ 22 warnings generated when compiling for gfx1103. In file included from In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f32.cu:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:32:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 32 | const int * __restrict__ KV_max, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:542:15: warning: unused variable 'warp_size' [-Wunused-variable] 542 | const int warp_size = ggml_cuda_info().devices[ctx.device].warp_size; | ^~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:673:13: warning: 'break' will never be executed [-Wunreachable-code-break] 673 | break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:646:17: warning: 'break' will never be executed [-Wunreachable-code-break] 646 | break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:593:21: warning: 'break' will never be executed [-Wunreachable-code-break] 593 | break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:568:21: warning: 'break' will never be executed [-Wunreachable-code-break] 568 | break; | ^~~~~ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:6: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:117:13: warning: 'break' will never be executed [-Wunreachable-code-break] 117 | break; | ^~~~~ 22 warnings generated when compiling for gfx1150. 11 warnings generated when compiling for gfx1031. 7 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:6: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:117:13: warning: 'break' will never be executed [-Wunreachable-code-break] 117 | break; | ^~~~~ 22 warnings generated when compiling for gfx1151. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:26:23: warning: unused parameter 'ne00' [-Wunused-parameter] 26 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:26:83: warning: unused parameter 'ne03' [-Wunused-parameter] 26 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:28:23: warning: unused parameter 'ne10' [-Wunused-parameter] 28 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:28:83: warning: unused parameter 'ne13' [-Wunused-parameter] 28 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:30:43: warning: unused parameter 'nb21' [-Wunused-parameter] 30 | const int32_t nb21, const int32_t nb22, const int64_t nb23, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:30:63: warning: unused parameter 'nb22' [-Wunused-parameter] 30 | const int32_t nb21, const int32_t nb22, const int64_t nb23, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:30:83: warning: unused parameter 'nb23' [-Wunused-parameter] 30 | const int32_t nb21, const int32_t nb22, const int64_t nb23, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:31:43: warning: unused parameter 'ne31' [-Wunused-parameter] 31 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:31:63: warning: unused parameter 'ne32' [-Wunused-parameter] 31 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:32:63: warning: unused parameter 'nb32' [-Wunused-parameter] 32 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:32:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 32 | const int * __restrict__ KV_max, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:542:15: warning: unused variable 'warp_size' [-Wunused-variable] 542 | const int warp_size = ggml_cuda_info().devices[ctx.device].warp_size; | ^~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:673:13: warning: 'break' will never be executed [-Wunreachable-code-break] 673 | break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:646:17: warning: 'break' will never be executed [-Wunreachable-code-break] 646 | break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:593:21: warning: 'break' will never be executed [-Wunreachable-code-break] 593 | break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:568:21: warning: 'break' will never be executed [-Wunreachable-code-break] 568 | break; | ^~~~~ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:6: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:117:13: warning: 'break' will never be executed [-Wunreachable-code-break] 117 | break; | ^~~~~ 22 warnings generated when compiling for gfx1152. 1 warning generated when compiling for gfx1035. 7 warnings generated when compiling for gfx1101. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:6: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:117:13: warning: 'break' will never be executed [-Wunreachable-code-break] 117 | break; | ^~~~~ 22 warnings generated when compiling for gfx1153. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f32.cu:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:32:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 32 | const int * __restrict__ KV_max, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:542:15: warning: unused variable 'warp_size' [-Wunused-variable] 542 | const int warp_size = ggml_cuda_info().devices[ctx.device].warp_size; | ^~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:673:13: warning: 'break' will never be executed [-Wunreachable-code-break] 673 | break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:646:17: warning: 'break' will never be executed [-Wunreachable-code-break] 646 | break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:593:21: warning: 'break' will never be executed [-Wunreachable-code-break] 593 | break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:568:21: warning: 'break' will never be executed [-Wunreachable-code-break] 568 | break; | ^~~~~ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:6: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:117:13: warning: 'break' will never be executed [-Wunreachable-code-break] 117 | break; | ^~~~~ 22 warnings generated when compiling for gfx1200. 7 warnings generated when compiling for gfx1102. 11 warnings generated when compiling for gfx1035. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:6: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:117:13: warning: 'break' will never be executed [-Wunreachable-code-break] 117 | break; | ^~~~~ 22 warnings generated when compiling for gfx1201. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:32:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 32 | const int * __restrict__ KV_max, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:542:15: warning: unused variable 'warp_size' [-Wunused-variable] 542 | const int warp_size = ggml_cuda_info().devices[ctx.device].warp_size; | ^~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:673:13: warning: 'break' will never be executed [-Wunreachable-code-break] 673 | break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:646:17: warning: 'break' will never be executed [-Wunreachable-code-break] 646 | break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:593:21: warning: 'break' will never be executed [-Wunreachable-code-break] 593 | break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:568:21: warning: 'break' will never be executed [-Wunreachable-code-break] 568 | break; | ^~~~~ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:26:23: warning: unused parameter 'ne00' [-Wunused-parameter] 26 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:26:83: warning: unused parameter 'ne03' [-Wunused-parameter] 26 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:28:23: warning: unused parameter 'ne10' [-Wunused-parameter] 28 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:28:83: warning: unused parameter 'ne13' [-Wunused-parameter] 28 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:30:43: warning: unused parameter 'nb21' [-Wunused-parameter] 30 | const int32_t nb21, const int32_t nb22, const int64_t nb23, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:30:63: warning: unused parameter 'nb22' [-Wunused-parameter] 30 | const int32_t nb21, const int32_t nb22, const int64_t nb23, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:30:83: warning: unused parameter 'nb23' [-Wunused-parameter] 30 | const int32_t nb21, const int32_t nb22, const int64_t nb23, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:31:43: warning: unused parameter 'ne31' [-Wunused-parameter] 31 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:31:63: warning: unused parameter 'ne32' [-Wunused-parameter] 31 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:32:63: warning: unused parameter 'nb32' [-Wunused-parameter] 32 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:6: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:117:13: warning: 'break' will never be executed [-Wunreachable-code-break] 117 | break; | ^~~~~ 1 warning generated when compiling for gfx1036. 22 warnings generated when compiling for gfx900. 7 warnings generated when compiling for gfx1103. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f32.cu:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:6: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:117:13: warning: 'break' will never be executed [-Wunreachable-code-break] 117 | break; | ^~~~~ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 22 warnings generated when compiling for gfx906. /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:32:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 32 | const int * __restrict__ KV_max, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:542:15: warning: unused variable 'warp_size' [-Wunused-variable] 542 | const int warp_size = ggml_cuda_info().devices[ctx.device].warp_size; | ^~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:673:13: warning: 'break' will never be executed [-Wunreachable-code-break] 673 | break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:646:17: warning: 'break' will never be executed [-Wunreachable-code-break] 646 | break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:593:21: warning: 'break' will never be executed [-Wunreachable-code-break] 593 | break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:568:21: warning: 'break' will never be executed [-Wunreachable-code-break] 568 | break; | ^~~~~ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:6: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:117:13: warning: 'break' will never be executed [-Wunreachable-code-break] 117 | break; | ^~~~~ 7 warnings generated when compiling for gfx1150. 20 warnings generated when compiling for gfx908. 11 warnings generated when compiling for gfx1036. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:32:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 32 | const int * __restrict__ KV_max, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:542:15: warning: unused variable 'warp_size' [-Wunused-variable] 542 | const int warp_size = ggml_cuda_info().devices[ctx.device].warp_size; | ^~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:673:13: warning: 'break' will never be executed [-Wunreachable-code-break] 673 | break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:646:17: warning: 'break' will never be executed [-Wunreachable-code-break] 646 | break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:593:21: warning: 'break' will never be executed [-Wunreachable-code-break] 593 | break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:568:21: warning: 'break' will never be executed [-Wunreachable-code-break] 568 | break; | ^~~~~ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:6: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:117:13: warning: 'break' will never be executed [-Wunreachable-code-break] 117 | break; | ^~~~~ 20 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:26:23: warning: unused parameter 'ne00' [-Wunused-parameter] 26 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:26:83: warning: unused parameter 'ne03' [-Wunused-parameter] 26 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:28:23: warning: unused parameter 'ne10' [-Wunused-parameter] 28 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:28:83: warning: unused parameter 'ne13' [-Wunused-parameter] 28 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:30:43: warning: unused parameter 'nb21' [-Wunused-parameter] 30 | const int32_t nb21, const int32_t nb22, const int64_t nb23, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:30:63: warning: unused parameter 'nb22' [-Wunused-parameter] 30 | const int32_t nb21, const int32_t nb22, const int64_t nb23, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:30:83: warning: unused parameter 'nb23' [-Wunused-parameter] 30 | const int32_t nb21, const int32_t nb22, const int64_t nb23, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:31:43: warning: unused parameter 'ne31' [-Wunused-parameter] 31 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:31:63: warning: unused parameter 'ne32' [-Wunused-parameter] 31 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:32:63: warning: unused parameter 'nb32' [-Wunused-parameter] 32 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ 1 warning generated when compiling for gfx1100. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 7 warnings generated when compiling for gfx1151. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:6: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:117:13: warning: 'break' will never be executed [-Wunreachable-code-break] 117 | break; | ^~~~~ 20 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f32.cu:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:32:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 32 | const int * __restrict__ KV_max, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:542:15: warning: unused variable 'warp_size' [-Wunused-variable] 542 | const int warp_size = ggml_cuda_info().devices[ctx.device].warp_size; | ^~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:673:13: warning: 'break' will never be executed [-Wunreachable-code-break] 673 | break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:646:17: warning: 'break' will never be executed [-Wunreachable-code-break] 646 | break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:593:21: warning: 'break' will never be executed [-Wunreachable-code-break] 593 | break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:568:21: warning: 'break' will never be executed [-Wunreachable-code-break] 568 | break; | ^~~~~ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:521:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 521 | acc[0] = __builtin_amdgcn_mfma_i32_16x16x32_i8(((int64_t *) A.x)[0], | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:522:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 522 | ((int64_t *) B.x)[0], | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:549:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 549 | acc[0] = __builtin_amdgcn_mfma_i32_32x32x16_i8(((int64_t *) A.x)[0], | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:550:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 550 | ((int64_t *) B.x)[0], | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:6: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:117:13: warning: 'break' will never be executed [-Wunreachable-code-break] 117 | break; | ^~~~~ 24 warnings generated when compiling for gfx942. 7 warnings generated when compiling for gfx1152. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:6: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:117:13: warning: 'break' will never be executed [-Wunreachable-code-break] 117 | break; | ^~~~~ 22 warnings generated when compiling for gfx950. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:32:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 32 | const int * __restrict__ KV_max, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:542:15: warning: unused variable 'warp_size' [-Wunused-variable] 542 | const int warp_size = ggml_cuda_info().devices[ctx.device].warp_size; | ^~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:673:13: warning: 'break' will never be executed [-Wunreachable-code-break] 673 | break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:646:17: warning: 'break' will never be executed [-Wunreachable-code-break] 646 | break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:593:21: warning: 'break' will never be executed [-Wunreachable-code-break] 593 | break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:568:21: warning: 'break' will never be executed [-Wunreachable-code-break] 568 | break; | ^~~~~ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:6: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ 11 warnings generated when compiling for gfx1100. /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:117:13: warning: 'break' will never be executed [-Wunreachable-code-break] 117 | break; | ^~~~~ 22 warnings generated when compiling for host. [ 25%] Building CXX object ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/getrows.cu.o cd /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build/ggml/src/ggml-hip && /usr/bin/hipcc -DGGML_BACKEND_BUILD -DGGML_BACKEND_SHARED -DGGML_HIP_NO_VMM -DGGML_HIP_ROCWMMA_FATTN_GFX12 -DGGML_SCHED_MAX_COPIES=4 -DGGML_SHARED -DGGML_USE_HIP -DUSE_PROF_API=1 -D_GNU_SOURCE -D_XOPEN_SOURCE=600 -D__HIP_PLATFORM_AMD__=1 -Dggml_hip_EXPORTS -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/.. -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -DNDEBUG -std=gnu++17 -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -x hip --offload-arch=gfx900 --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx942 --offload-arch=gfx950 --offload-arch=gfx1010 --offload-arch=gfx1012 --offload-arch=gfx1030 --offload-arch=gfx1031 --offload-arch=gfx1035 --offload-arch=gfx1036 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1103 --offload-arch=gfx1150 --offload-arch=gfx1151 --offload-arch=gfx1152 --offload-arch=gfx1153 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/getrows.cu.o -MF CMakeFiles/ggml-hip.dir/__/ggml-cuda/getrows.cu.o.d -o CMakeFiles/ggml-hip.dir/__/ggml-cuda/getrows.cu.o -c /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/getrows.cu 1 warning generated when compiling for gfx1101. 7 warnings generated when compiling for gfx1153. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:26:23: warning: unused parameter 'ne00' [-Wunused-parameter] 26 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:26:83: warning: unused parameter 'ne03' [-Wunused-parameter] 26 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:28:23: warning: unused parameter 'ne10' [-Wunused-parameter] 28 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:28:83: warning: unused parameter 'ne13' [-Wunused-parameter] 28 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:30:43: warning: unused parameter 'nb21' [-Wunused-parameter] 30 | const int32_t nb21, const int32_t nb22, const int64_t nb23, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:30:63: warning: unused parameter 'nb22' [-Wunused-parameter] 30 | const int32_t nb21, const int32_t nb22, const int64_t nb23, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:30:83: warning: unused parameter 'nb23' [-Wunused-parameter] 30 | const int32_t nb21, const int32_t nb22, const int64_t nb23, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:31:43: warning: unused parameter 'ne31' [-Wunused-parameter] 31 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:31:63: warning: unused parameter 'ne32' [-Wunused-parameter] 31 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:32:63: warning: unused parameter 'nb32' [-Wunused-parameter] 32 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/getrows.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/getrows.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/getrows.cu:231:13: warning: 'break' will never be executed [-Wunreachable-code-break] 231 | break; | ^~~~~ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f32.cu:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:32:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 32 | const int * __restrict__ KV_max, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:542:15: warning: unused variable 'warp_size' [-Wunused-variable] 542 | const int warp_size = ggml_cuda_info().devices[ctx.device].warp_size; | ^~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:673:13: warning: 'break' will never be executed [-Wunreachable-code-break] 673 | break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:646:17: warning: 'break' will never be executed [-Wunreachable-code-break] 646 | break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:593:21: warning: 'break' will never be executed [-Wunreachable-code-break] 593 | break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:568:21: warning: 'break' will never be executed [-Wunreachable-code-break] 568 | break; | ^~~~~ 2 warnings generated when compiling for gfx1010. 7 warnings generated when compiling for gfx1200. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/getrows.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/getrows.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/getrows.cu:231:13: warning: 'break' will never be executed [-Wunreachable-code-break] 231 | break; | ^~~~~ 2 warnings generated when compiling for gfx1012. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:32:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 32 | const int * __restrict__ KV_max, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:542:15: warning: unused variable 'warp_size' [-Wunused-variable] 542 | const int warp_size = ggml_cuda_info().devices[ctx.device].warp_size; | ^~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:673:13: warning: 'break' will never be executed [-Wunreachable-code-break] 673 | break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:646:17: warning: 'break' will never be executed [-Wunreachable-code-break] 646 | break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:593:21: warning: 'break' will never be executed [-Wunreachable-code-break] 593 | break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:568:21: warning: 'break' will never be executed [-Wunreachable-code-break] 568 | break; | ^~~~~ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/getrows.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/getrows.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/getrows.cu:231:13: warning: 'break' will never be executed [-Wunreachable-code-break] 231 | break; | ^~~~~ 1 warning generated when compiling for gfx1102. 7 warnings generated when compiling for gfx1201. 11 warnings generated when compiling for gfx1101. 2 warnings generated when compiling for gfx1030. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f32.cu:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:32:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 32 | const int * __restrict__ KV_max, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:542:15: warning: unused variable 'warp_size' [-Wunused-variable] 542 | const int warp_size = ggml_cuda_info().devices[ctx.device].warp_size; | ^~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:673:13: warning: 'break' will never be executed [-Wunreachable-code-break] 673 | break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:646:17: warning: 'break' will never be executed [-Wunreachable-code-break] 646 | break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:593:21: warning: 'break' will never be executed [-Wunreachable-code-break] 593 | break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:568:21: warning: 'break' will never be executed [-Wunreachable-code-break] 568 | break; | ^~~~~ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:26:23: warning: unused parameter 'ne00' [-Wunused-parameter] 26 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:26:83: warning: unused parameter 'ne03' [-Wunused-parameter] 26 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:28:23: warning: unused parameter 'ne10' [-Wunused-parameter] 28 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:28:83: warning: unused parameter 'ne13' [-Wunused-parameter] 28 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:30:43: warning: unused parameter 'nb21' [-Wunused-parameter] 30 | const int32_t nb21, const int32_t nb22, const int64_t nb23, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:30:63: warning: unused parameter 'nb22' [-Wunused-parameter] 30 | const int32_t nb21, const int32_t nb22, const int64_t nb23, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:30:83: warning: unused parameter 'nb23' [-Wunused-parameter] 30 | const int32_t nb21, const int32_t nb22, const int64_t nb23, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:31:43: warning: unused parameter 'ne31' [-Wunused-parameter] 31 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:31:63: warning: unused parameter 'ne32' [-Wunused-parameter] 31 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:32:63: warning: unused parameter 'nb32' [-Wunused-parameter] 32 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/getrows.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/getrows.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/getrows.cu:231:13: warning: 'break' will never be executed [-Wunreachable-code-break] 231 | break; | ^~~~~ 7 warnings generated when compiling for gfx900. 2 warnings generated when compiling for gfx1031. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/getrows.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/getrows.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/getrows.cu:231:13: warning: 'break' will never be executed [-Wunreachable-code-break] 231 | break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:32:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 32 | const int * __restrict__ KV_max, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:542:15: warning: unused variable 'warp_size' [-Wunused-variable] 542 | const int warp_size = ggml_cuda_info().devices[ctx.device].warp_size; | ^~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:673:13: warning: 'break' will never be executed [-Wunreachable-code-break] 673 | break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:646:17: warning: 'break' will never be executed [-Wunreachable-code-break] 646 | break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:593:21: warning: 'break' will never be executed [-Wunreachable-code-break] 593 | break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:568:21: warning: 'break' will never be executed [-Wunreachable-code-break] 568 | break; | ^~~~~ 2 warnings generated when compiling for gfx1035. 7 warnings generated when compiling for gfx906. 1 warning generated when compiling for gfx1103. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/getrows.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/getrows.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/getrows.cu:231:13: warning: 'break' will never be executed [-Wunreachable-code-break] 231 | break; | ^~~~~ 11 warnings generated when compiling for gfx1102. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:32:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 32 | const int * __restrict__ KV_max, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:542:15: warning: unused variable 'warp_size' [-Wunused-variable] 542 | const int warp_size = ggml_cuda_info().devices[ctx.device].warp_size; | ^~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:673:13: warning: 'break' will never be executed [-Wunreachable-code-break] 673 | break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:646:17: warning: 'break' will never be executed [-Wunreachable-code-break] 646 | break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:593:21: warning: 'break' will never be executed [-Wunreachable-code-break] 593 | break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:568:21: warning: 'break' will never be executed [-Wunreachable-code-break] 568 | break; | ^~~~~ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f32.cu:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 2 warnings generated when compiling for gfx1036. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:26:23: warning: unused parameter 'ne00' [-Wunused-parameter] 26 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:26:83: warning: unused parameter 'ne03' [-Wunused-parameter] 26 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:28:23: warning: unused parameter 'ne10' [-Wunused-parameter] 28 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:28:83: warning: unused parameter 'ne13' [-Wunused-parameter] 28 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:30:43: warning: unused parameter 'nb21' [-Wunused-parameter] 30 | const int32_t nb21, const int32_t nb22, const int64_t nb23, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:30:63: warning: unused parameter 'nb22' [-Wunused-parameter] 30 | const int32_t nb21, const int32_t nb22, const int64_t nb23, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:30:83: warning: unused parameter 'nb23' [-Wunused-parameter] 30 | const int32_t nb21, const int32_t nb22, const int64_t nb23, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:31:43: warning: unused parameter 'ne31' [-Wunused-parameter] 31 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:31:63: warning: unused parameter 'ne32' [-Wunused-parameter] 31 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:32:63: warning: unused parameter 'nb32' [-Wunused-parameter] 32 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ 7 warnings generated when compiling for gfx908. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/getrows.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/getrows.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/getrows.cu:231:13: warning: 'break' will never be executed [-Wunreachable-code-break] 231 | break; | ^~~~~ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static boo2 warnings generated when compiling for gfx1100. l fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:32:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 32 | const int * __restrict__ KV_max, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:542:15: warning: unused variable 'warp_size' [-Wunused-variable] 542 | const int warp_size = ggml_cuda_info().devices[ctx.device].warp_size; | ^~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:673:13: warning: 'break' will never be executed [-Wunreachable-code-break] 673 | break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:646:17: warning: 'break' will never be executed [-Wunreachable-code-break] 646 | break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:593:21: warning: 'break' will never be executed [-Wunreachable-code-break] 593 | break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:568:21: warning: 'break' will never be executed [-Wunreachable-code-break] 568 | break; | ^~~~~ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/getrows.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/getrows.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/getrows.cu:231:13: warning: 'break' will never be executed [-Wunreachable-code-break] 231 | break; | ^~~~~ 1 warning generated when compiling for gfx1150. 7 warnings generated when compiling for gfx90a. 2 warnings generated when compiling for gfx1101. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f32.cu:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:32:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 32 | const int * __restrict__ KV_max, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:542:15: warning: unused variable 'warp_size' [-Wunused-variable] 542 | const int warp_size = ggml_cuda_info().devices[ctx.device].warp_size; | ^~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:673:13: warning: 'break' will never be executed [-Wunreachable-code-break] 673 | break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:646:17: warning: 'break' will never be executed [-Wunreachable-code-break] 646 | break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:593:21: warning: 'break' will never be executed [-Wunreachable-code-break] 593 | break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:568:21: warning: 'break' will never be executed [-Wunreachable-code-break] 568 | break; | ^~~~~ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/getrows.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/getrows.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/getrows.cu:231:13: warning: 'break' will never be executed [-Wunreachable-code-break] 231 | break; | ^~~~~ 11 warnings generated when compiling for gfx1103. 2 warnings generated when compiling for gfx1102. 7 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:26:23: warning: unused parameter 'ne00' [-Wunused-parameter] 26 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:26:83: warning: unused parameter 'ne03' [-Wunused-parameter] 26 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:28:23: warning: unused parameter 'ne10' [-Wunused-parameter] 28 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:28:83: warning: unused parameter 'ne13' [-Wunused-parameter] 28 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:30:43: warning: unused parameter 'nb21' [-Wunused-parameter] 30 | const int32_t nb21, const int32_t nb22, const int64_t nb23, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:30:63: warning: unused parameter 'nb22' [-Wunused-parameter] 30 | const int32_t nb21, const int32_t nb22, const int64_t nb23, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:30:83: warning: unused parameter 'nb23' [-Wunused-parameter] 30 | const int32_t nb21, const int32_t nb22, const int64_t nb23, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:31:43: warning: unused parameter 'ne31' [-Wunused-parameter] 31 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:31:63: warning: unused parameter 'ne32' [-Wunused-parameter] 31 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:32:63: warning: unused parameter 'nb32' [-Wunused-parameter] 32 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/getrows.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/getrows.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/getrows.cu:231:13: warning: 'break' will never be executed [-Wunreachable-code-break] 231 | break; | ^~~~~ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:32:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 32 | const int * __restrict__ KV_max, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:542:15: warning: unused variable 'warp_size' [-Wunused-variable] 542 | const int warp_size = ggml_cuda_info().devices[ctx.device].warp_size; | ^~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:673:13: warning: 'break' will never be executed [-Wunreachable-code-break] 673 | break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:646:17: warning: 'break' will never be executed [-Wunreachable-code-break] 646 | break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:593:21: warning: 'break' will never be executed [-Wunreachable-code-break] 593 | break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:568:21: warning: 'break' will never be executed [-Wunreachable-code-break] 568 | break; | ^~~~~ 2 warnings generated when compiling for gfx1103. 1 warning generated when compiling for gfx1151. 7 warnings generated when compiling for gfx942. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/getrows.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/getrows.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/getrows.cu:231:13: warning: 'break' will never be executed [-Wunreachable-code-break] 231 | break; | ^~~~~ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f32.cu:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 2 warnings generated when compiling for gfx1150. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:32:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 32 | const int * __restrict__ KV_max, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:542:15: warning: unused variable 'warp_size' [-Wunused-variable] 542 | const int warp_size = ggml_cuda_info().devices[ctx.device].warp_size; | ^~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:673:13: warning: 'break' will never be executed [-Wunreachable-code-break] 673 | break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:646:17: warning: 'break' will never be executed [-Wunreachable-code-break] 646 | break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:593:21: warning: 'break' will never be executed [-Wunreachable-code-break] 593 | break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:568:21: warning: 'break' will never be executed [-Wunreachable-code-break] 568 | break; | ^~~~~ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/getrows.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/getrows.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/getrows.cu:231:13: warning: 'break' will never be executed [-Wunreachable-code-break] 231 | break; | ^~~~~ 11 warnings generated when compiling for gfx1150. 7 warnings generated when compiling for gfx950. 2 warnings generated when compiling for gfx1151. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:26:23: warning: unused parameter 'ne00' [-Wunused-parameter] 26 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:26:83: warning: unused parameter 'ne03' [-Wunused-parameter] 26 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:28:23: warning: unused parameter 'ne10' [-Wunused-parameter] 28 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:28:83: warning: unused parameter 'ne13' [-Wunused-parameter] 28 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:30:43: warning: unused parameter 'nb21' [-Wunused-parameter] 30 | const int32_t nb21, const int32_t nb22, const int64_t nb23, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:30:63: warning: unused parameter 'nb22' [-Wunused-parameter] 30 | const int32_t nb21, const int32_t nb22, const int64_t nb23, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:30:83: warning: unused parameter 'nb23' [-Wunused-parameter] 30 | const int32_t nb21, const int32_t nb22, const int64_t nb23, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:31:43: warning: unused parameter 'ne31' [-Wunused-parameter] 31 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:31:63: warning: unused parameter 'ne32' [-Wunused-parameter] 31 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:32:63: warning: unused parameter 'nb32' [-Wunused-parameter] 32 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:32:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 32 | const int * __restrict__ KV_max, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:542:15: warning: unused variable 'warp_size' [-Wunused-variable] 542 | const int warp_size = ggml_cuda_info().devices[ctx.device].warp_size; | ^~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:673:13: warning: 'break' will never be executed [-Wunreachable-code-break] 673 | break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:646:17: warning: 'break' will never be executed [-Wunreachable-code-break] 646 | break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:593:21: warning: 'break' will never be executed [-Wunreachable-code-break] 593 | break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:568:21: warning: 'break' will never be executed [-Wunreachable-code-break] 568 | break; | ^~~~~ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/getrows.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/getrows.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/getrows.cu:231:13: warning: 'break' will never be executed [-Wunreachable-code-break] 231 | break; | ^~~~~ 1 warning generated when compiling for gfx1152. 7 warnings generated when compiling for host. 2 warnings generated when compiling for gfx1152. [ 26%] Building CXX object ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/ggml-cuda.cu.o cd /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build/ggml/src/ggml-hip && /usr/bin/hipcc -DGGML_BACKEND_BUILD -DGGML_BACKEND_SHARED -DGGML_HIP_NO_VMM -DGGML_HIP_ROCWMMA_FATTN_GFX12 -DGGML_SCHED_MAX_COPIES=4 -DGGML_SHARED -DGGML_USE_HIP -DUSE_PROF_API=1 -D_GNU_SOURCE -D_XOPEN_SOURCE=600 -D__HIP_PLATFORM_AMD__=1 -Dggml_hip_EXPORTS -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/.. -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -DNDEBUG -std=gnu++17 -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -x hip --offload-arch=gfx900 --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx942 --offload-arch=gfx950 --offload-arch=gfx1010 --offload-arch=gfx1012 --offload-arch=gfx1030 --offload-arch=gfx1031 --offload-arch=gfx1035 --offload-arch=gfx1036 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1103 --offload-arch=gfx1150 --offload-arch=gfx1151 --offload-arch=gfx1152 --offload-arch=gfx1153 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/ggml-cuda.cu.o -MF CMakeFiles/ggml-hip.dir/__/ggml-cuda/ggml-cuda.cu.o.d -o CMakeFiles/ggml-hip.dir/__/ggml-cuda/ggml-cuda.cu.o -c /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f32.cu:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/getrows.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/getrows.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/getrows.cu:231:13: warning: 'break' will never be executed [-Wunreachable-code-break] 231 | break; | ^~~~~ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:26: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:26: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:142:19: warning: variable length arrays in C++ are a Clang extension [-Wvla-cxx-extension] 142 | char archName[archLen + 1]; | ^~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:142:19: note: read of non-const variable 'archLen' is not allowed in a constant expression /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:141:9: note: declared here 141 | int archLen = strlen(devName); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3422:13: warning: 'break' will never be executed [-Wunreachable-code-break] 3422 | break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3415:13: warning: 'break' will never be executed [-Wunreachable-code-break] 3415 | break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3412:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3412 | } break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3403:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3403 | } break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3396:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3396 | } break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3391:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3391 | } break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3386:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3386 | } break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3381:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3381 | } break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3334:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3334 | } break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3326:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3326 | } break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3322:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3322 | } break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3303:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3303 | } break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3238:13: warning: 'break' will never be executed [-Wunreachable-code-break] 3238 | break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3225:13: warning: 'break' will never be executed [-Wunreachable-code-break] 3225 | break; | ^~~~~ 27 warnings generated when compiling for gfx1010. 2 warnings generated when compiling for gfx1153. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 11 warnings generated when compiling for gfx1151. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:26: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:26: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/getrows.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/getrows.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:142:19: warning: variable length arrays in C++ are a Clang extension [-Wvla-cxx-extension] 142 | char archName[archLen + 1]; | ^~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:142:19: note: read of non-const variable 'archLen' is not allowed in a constant expression /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:141:9: note: declared here 141 | int archLen = strlen(devName); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/getrows.cu:231:13: warning: 'break' will never be executed [-Wunreachable-code-break] 231 | break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3422:13: warning: 'break' will never be executed [-Wunreachable-code-break] 3422 | break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3415:13: warning: 'break' will never be executed [-Wunreachable-code-break] 3415 | break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3412:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3412 | } break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3403:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3403 | } break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3396:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3396 | } break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3391:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3391 | } break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3386:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3386 | } break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3381:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3381 | } break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3334:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3334 | } break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3326:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3326 | } break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3322:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3322 | } break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3303:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3303 | } break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3238:13: warning: 'break' will never be executed [-Wunreachable-code-break] 3238 | break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3225:13: warning: 'break' will never be executed [-Wunreachable-code-break] 3225 | break; | ^~~~~ 27 warnings generated when compiling for gfx1012. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 2 warnings generated when compiling for gfx1200. /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:26:23: warning: unused parameter 'ne00' [-Wunused-parameter] 26 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:26:83: warning: unused parameter 'ne03' [-Wunused-parameter] 26 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:28:23: warning: unused parameter 'ne10' [-Wunused-parameter] 28 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:28:83: warning: unused parameter 'ne13' [-Wunused-parameter] 28 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:30:43: warning: unused parameter 'nb21' [-Wunused-parameter] 30 | const int32_t nb21, const int32_t nb22, const int64_t nb23, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:30:63: warning: unused parameter 'nb22' [-Wunused-parameter] 30 | const int32_t nb21, const int32_t nb22, const int64_t nb23, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:30:83: warning: unused parameter 'nb23' [-Wunused-parameter] 30 | const int32_t nb21, const int32_t nb22, const int64_t nb23, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:31:43: warning: unused parameter 'ne31' [-Wunused-parameter] 31 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:31:63: warning: unused parameter 'ne32' [-Wunused-parameter] 31 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:32:63: warning: unused parameter 'nb32' [-Wunused-parameter] 32 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ 1 warning generated when compiling for gfx1153. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:26: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:26: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:142:19: warning: variable length arrays in C++ are a Clang extension [-Wvla-cxx-extension] 142 | char archName[archLen + 1]; | ^~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:142:19: note: read of non-const variable 'archLen' is not allowed in a constant expression /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:141:9: note: declared here 141 | int archLen = strlen(devName); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3422:13: warning: 'break' will never be executed [-Wunreachable-code-break] 3422 | break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3415:13: warning: 'break' will never be executed [-Wunreachable-code-break] 3415 | break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3412:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3412 | } break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3403:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3403 | } break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3396:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3396 | } break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3391:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3391 | } break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3386:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3386 | } break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3381:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3381 | } break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3334:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3334 | } break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3326:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3326 | } break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3322:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3322 | } break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3303:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3303 | } break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3238:13: warning: 'break' will never be executed [-Wunreachable-code-break] 3238 | break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3225:13: warning: 'break' will never be executed [-Wunreachable-code-break] 3225 | break; | ^~~~~ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/getrows.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/getrows.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/getrows.cu:231:13: warning: 'break' will never be executed [-Wunreachable-code-break] 231 | break; | ^~~~~ 27 warnings generated when compiling for gfx1030. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f32.cu:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 2 warnings generated when compiling for gfx1201. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:26: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:26: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:142:19: warning: variable length arrays in C++ are a Clang extension [-Wvla-cxx-extension] 142 | char archName[archLen + 1]; | ^~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:142:19: note: read of non-const variable 'archLen' is not allowed in a constant expression /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:141:9: note: declared here 141 | int archLen = strlen(devName); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3422:13: warning: 'break' will never be executed [-Wunreachable-code-break] 3422 | break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3415:13: warning: 'break' will never be executed [-Wunreachable-code-break] 3415 | break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3412:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3412 | } break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3403:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3403 | } break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3396:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3396 | } break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3391:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3391 | } break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3386:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3386 | } break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3381:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3381 | } break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3334:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3334 | } break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3326:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3326 | } break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3322:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3322 | } break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3303:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3303 | } break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3238:13: warning: 'break' will never be executed [-Wunreachable-code-break] 3238 | break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3225:13: warning: 'break' will never be executed [-Wunreachable-code-break] 3225 | break; | ^~~~~ 27 warnings generated when compiling for gfx1031. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/getrows.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/getrows.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/getrows.cu:231:13: warning: 'break' will never be executed [-Wunreachable-code-break] 231 | break; | ^~~~~ 2 warnings generated when compiling for gfx900. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:26: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:26: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:142:19: warning: variable length arrays in C++ are a Clang extension [-Wvla-cxx-extension] 142 | char archName[archLen + 1]; | ^~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:142:19: note: read of non-const variable 'archLen' is not allowed in a constant expression /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:141:9: note: declared here 141 | int archLen = strlen(devName); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3422:13: warning: 'break' will never be executed [-Wunreachable-code-break] 3422 | break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3415:13: warning: 'break' will never be executed [-Wunreachable-code-break] 3415 | break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3412:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3412 | } break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3403:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3403 | } break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3396:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3396 | } break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3391:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3391 | } break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3386:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3386 | } break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3381:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3381 | } break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3334:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3334 | } break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3326:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3326 | } break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3322:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3322 | } break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3303:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3303 | } break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3238:13: warning: 'break' will never be executed [-Wunreachable-code-break] 3238 | break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3225:13: warning: 'break' will never be executed [-Wunreachable-code-break] 3225 | break; | ^~~~~ 11 warnings generated when compiling for gfx1152. 27 warnings generated when compiling for gfx1035. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/getrows.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/getrows.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/getrows.cu:231:13: warning: 'break' will never be executed [-Wunreachable-code-break] 231 | break; | ^~~~~ 1 warning generated when compiling for gfx1200. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_availabl/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:26:23: warning: unused parameter 'ne00' [-Wunused-parameter] 26 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:26:83: warning: unused parameter 'ne03' [-Wunused-parameter] 26 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:28:23: warning: unused parameter 'ne10' [-Wunused-parameter] 28 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:28:83: warning: unused parameter 'ne13' [-Wunused-parameter] 28 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:30:43: warning: unused parameter 'nb21' [-Wunused-parameter] 30 | const int32_t nb21, const int32_t nb22, const int64_t nb23, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:30:63: warning: unused parameter 'nb22' [-Wunused-parameter] 30 | const int32_t nb21, const int32_t nb22, const int64_t nb23, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:30:83: warning: unused parameter 'nb23' [-Wunused-parameter] 30 | const int32_t nb21, const int32_t nb22, const int64_t nb23, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:31:43: warning: unused parameter 'ne31' [-Wunused-parameter] 31 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:31:63: warning: unused parameter 'ne32' [-Wunused-parameter] 31 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:32:63: warning: unused parameter 'nb32' [-Wunused-parameter] 32 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ e(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:26: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, con2st tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh warnings generated when compiling for gfx906. :326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:26: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:142:19: warning: variable length arrays in C++ are a Clang extension [-Wvla-cxx-extension] 142 | char archName[archLen + 1]; | ^~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:142:19: note: read of non-const variable 'archLen' is not allowed in a constant expression /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:141:9: note: declared here 141 | int archLen = strlen(devName); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3422:13: warning: 'break' will never be executed [-Wunreachable-code-break] 3422 | break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3415:13: warning: 'break' will never be executed [-Wunreachable-code-break] 3415 | break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3412:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3412 | } break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3403:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3403 | } break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3396:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3396 | } break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3391:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3391 | } break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3386:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3386 | } break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3381:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3381 | } break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3334:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3334 | } break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3326:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3326 | } break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3322:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3322 | } break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3303:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3303 | } break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3238:13: warning: 'break' will never be executed [-Wunreachable-code-break] 3238 | break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3225:13: warning: 'break' will never be executed [-Wunreachable-code-break] 3225 | break; | ^~~~~ 27 warnings generated when compiling for gfx1036. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f32.cu:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/getrows.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/getrows.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/getrows.cu:231:13: warning: 'break' will never be executed [-Wunreachable-code-break] 231 | break; | ^~~~~ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:26: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:26: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:142:19: warning: variable length arrays in C++ are a Clang extension [-Wvla-cxx-extension] 142 | char archName[archLen + 1]; | ^~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:142:19: note: read of non-const variable 'archLen' is not allowed in a constant expression /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:141:9: note: declared here 141 | int archLen = strlen(devName); | ^ 2 warnings generated when compiling for gfx908. /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3422:13: warning: 'break' will never be executed [-Wunreachable-code-break] 3422 | break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3415:13: warning: 'break' will never be executed [-Wunreachable-code-break] 3415 | break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3412:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3412 | } break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3403:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3403 | } break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3396:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3396 | } break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3391:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3391 | } break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3386:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3386 | } break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3381:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3381 | } break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3334:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3334 | } break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3326:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3326 | } break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3322:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3322 | } break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3303:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3303 | } break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3238:13: warning: 'break' will never be executed [-Wunreachable-code-break] 3238 | break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3225:13: warning: 'break' will never be executed [-Wunreachable-code-break] 3225 | break; | ^~~~~ 27 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/getrows.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/getrows.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/getrows.cu:231:13: warning: 'break' will never be executed [-Wunreachable-code-break] 231 | break; | ^~~~~ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:26: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:26: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:142:19: warning: variable length arrays in C++ are a Clang extension [-Wvla-cxx-extension] 142 | char archName[archLen + 1]; | ^~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:142:19: note: read of non-const variable 'archLen' is not allowed in a constant expression /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:141:9: note: declared here 141 | int archLen = strlen(devName); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3422:13: warning: 'break' will never be executed [-Wunreachable-code-break] 3422 | break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3415:13: warning: 'break' will never be executed [-Wunreachable-code-break] 3415 | break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3412:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3412 | } break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3403:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3403 | } break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3396:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3396 | } break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3391:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3391 | } break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3386:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3386 | } break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3381:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3381 | } break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3334:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3334 | } break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3326:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3326 | } break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3322:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3322 | } break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3303:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3303 | } break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3238:13: warning: 'break' will never be executed [-Wunreachable-code-break] 3238 | break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3225:13: warning: 'break' will never be executed [-Wunreachable-code-break] 3225 | break; | ^~~~~ 2 warnings generated when compiling for gfx90a. 27 warnings generated when compiling for gfx1101. 11 warnings generated when compiling for gfx1153. 1 warning generated when compiling for gfx1201. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/getrows.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/getrows.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/getrows.cu:231:13: warning: 'break' will never be executed [-Wunreachable-code-break] 231 | break; | ^~~~~ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:26: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:26: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:142:19: warning: variable length arrays in C++ are a Clang extension [-Wvla-cxx-extension] 142 | char archName[archLen + 1]; | ^~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:142:19: note: read of non-const variable 'archLen' is not allowed in a constant expression /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:141:9: note: declared here 141 | int archLen = strlen(devName); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:26:23: warning: unused parameter 'ne00' [-Wunused-parameter] 26 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:26:83: warning: unused parameter 'ne03' [-Wunused-parameter] 26 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:28:23: warning: unused parameter 'ne10' [-Wunused-parameter] 28 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:28:83: warning: unused parameter 'ne13' [-Wunused-parameter] 28 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:30:43: warning: unused parameter 'nb21' [-Wunused-parameter] 30 | const int32_t nb21, const int32_t nb22, const int64_t nb23, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:30:63: warning: unused parameter 'nb22' [-Wunused-parameter] 30 | const int32_t nb21, const int32_t nb22, const int64_t nb23, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:30:83: warning: unused parameter 'nb23' [-Wunused-parameter] 30 | const int32_t nb21, const int32_t nb22, const int64_t nb23, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:31:43: warning: unused parameter 'ne31' [-Wunused-parameter] 31 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:31:63: warning: unused parameter 'ne32' [-Wunused-parameter] 31 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:32:63: warning: unused parameter 'nb32' [-Wunused-parameter] 32 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3422:13: warning: 'break' will never be executed [-Wunreachable-code-break] 3422 | break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3415:13: warning: 'break' will never be executed [-Wunreachable-code-break] 3415 | break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3412:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3412 | } break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3403:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3403 | } break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3396:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3396 | } break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3391:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3391 | } break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3386:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3386 | } break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3381:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3381 | } break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3334:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3334 | } break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3326:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3326 | } break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3322:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3322 | } break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3303:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3303 | } break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3238:13: warning: 'break' will never be executed [-Wunreachable-code-break] 3238 | break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3225:13: warning: 'break' will never be executed [-Wunreachable-code-break] 3225 | break; | ^~~~~ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f32.cu:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 27 warnings generated when compiling for gfx1102. 2 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:26: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:26: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/getrows.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/getrows.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:142:19: warning: variable length arrays in C++ are a Clang extension [-Wvla-cxx-extension] 142 | char archName[archLen + 1]; | ^~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:142:19: note: read of non-const variable 'archLen' is not allowed in a constant expression /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:141:9: note: declared here 141 | int archLen = strlen(devName); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/getrows.cu:231:13: warning: 'break' will never be executed [-Wunreachable-code-break] 231 | break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3422:13: warning: 'break' will never be executed [-Wunreachable-code-break] 3422 | break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3415:13: warning: 'break' will never be executed [-Wunreachable-code-break] 3415 | break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3412:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3412 | } break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3403:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3403 | } break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3396:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3396 | } break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3391:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3391 | } break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3386:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3386 | } break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3381:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3381 | } break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3334:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3334 | } break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3326:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3326 | } break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3322:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3322 | } break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3303:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3303 | } break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3238:13: warning: 'break' will never be executed [-Wunreachable-code-break] 3238 | break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3225:13: warning: 'break' will never be executed [-Wunreachable-code-break] 3225 | break; | ^~~~~ 27 warnings generated when compiling for gfx1103. 2 warnings generated when compiling for gfx942. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:26: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:26: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:142:19: warning: variable length arrays in C++ are a Clang extension [-Wvla-cxx-extension] 142 | char archName[archLen + 1]; | ^~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:142:19: note: read of non-const variable 'archLen' is not allowed in a constant expression /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:141:9: note: declared here 141 | int archLen = strlen(devName); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3422:13: warning: 'break' will never be executed [-Wunreachable-code-break] 3422 | break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3415:13: warning: 'break' will never be executed [-Wunreachable-code-break] 3415 | break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3412:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3412 | } break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3403:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3403 | } break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3396:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3396 | } break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3391:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3391 | } break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3386:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3386 | } break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3381:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3381 | } break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3334:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3334 | } break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3326:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3326 | } break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3322:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3322 | } break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3303:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3303 | } break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3238:13: warning: 'break' will never be executed [-Wunreachable-code-break] 3238 | break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3225:13: warning: 'break' will never be executed [-Wunreachable-code-break] 3225 | break; | ^~~~~ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/getrows.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/getrows.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/getrows.cu:231:13: warning: 'break' will never be executed [-Wunreachable-code-break] 231 | break; | ^~~~~ 27 warnings generated when compiling for gfx1150. 1 warning generated when compiling for gfx900. 2 warnings generated when compiling for gfx950. 11 warnings generated when compiling for gfx1200. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:26: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:26: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:142:19: warning: variable length arrays in C++ are a Clang extension [-Wvla-cxx-extension] 142 | char archName[archLen + 1]; | ^~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:142:19: note: read of non-const variable 'archLen' is not allowed in a constant expression /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:141:9: note: declared here 141 | int archLen = strlen(devName); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3422:13: warning: 'break' will never be executed [-Wunreachable-code-break] 3422 | break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3415:13: warning: 'break' will never be executed [-Wunreachable-code-break] 3415 | break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3412:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3412 | } break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3403:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3403 | } break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3396:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3396 | } break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3391:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3391 | } break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3386:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3386 | } break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3381:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3381 | } break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3334:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3334 | } break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3326:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3326 | } break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3322:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3322 | } break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3303:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3303 | } break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3238:13: warning: 'break' will never be executed [-Wunreachable-code-break] 3238 | break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3225:13: warning: 'break' will never be executed [-Wunreachable-code-break] 3225 | break; | ^~~~~ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f32.cu:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 27 warnings generated when compiling for gfx1151. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/getrows.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/getrows.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/getrows.cu:231:13: warning: 'break' will never be executed [-Wunreachable-code-break] 231 | break; | ^~~~~ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:26:23: warning: unused parameter 'ne00' [-Wunused-parameter] 26 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:26:83: warning: unused parameter 'ne03' [-Wunused-parameter] 26 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:28:23: warning: unused parameter 'ne10' [-Wunused-parameter] 28 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:28:83: warning: unused parameter 'ne13' [-Wunused-parameter] 28 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:30:43: warning: unused parameter 'nb21' [-Wunused-parameter] 30 | const int32_t nb21, const int32_t nb22, const int64_t nb23, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:30:63: warning: unused parameter 'nb22' [-Wunused-parameter] 30 | const int32_t nb21, const int32_t nb22, const int64_t nb23, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:30:83: warning: unused parameter 'nb23' [-Wunused-parameter] 30 | const int32_t nb21, const int32_t nb22, const int64_t nb23, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:31:43: warning: unused parameter 'ne31' [-Wunused-parameter] 31 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:31:63: warning: unused parameter 'ne32' [-Wunused-parameter] 31 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:32:63: warning: unused parameter 'nb32' [-Wunused-parameter] 32 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ 2 warnings generated when compiling for host. [ 26%] Building CXX object ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/gla.cu.o cd /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build/ggml/src/ggml-hip && /usr/bin/hipcc -DGGML_BACKEND_BUILD -DGGML_BACKEND_SHARED -DGGML_HIP_NO_VMM -DGGML_HIP_ROCWMMA_FATTN_GFX12 -DGGML_SCHED_MAX_COPIES=4 -DGGML_SHARED -DGGML_USE_HIP -DUSE_PROF_API=1 -D_GNU_SOURCE -D_XOPEN_SOURCE=600 -D__HIP_PLATFORM_AMD__=1 -Dggml_hip_EXPORTS -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/.. -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -DNDEBUG -std=gnu++17 -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -x hip --offload-arch=gfx900 --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx942 --offload-arch=gfx950 --offload-arch=gfx1010 --offload-arch=gfx1012 --offload-arch=gfx1030 --offload-arch=gfx1031 --offload-arch=gfx1035 --offload-arch=gfx1036 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1103 --offload-arch=gfx1150 --offload-arch=gfx1151 --offload-arch=gfx1152 --offload-arch=gfx1153 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/gla.cu.o -MF CMakeFiles/ggml-hip.dir/__/ggml-cuda/gla.cu.o.d -o CMakeFiles/ggml-hip.dir/__/ggml-cuda/gla.cu.o -c /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/gla.cu In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:26: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:26: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:142:19: warning: variable length arrays in C++ are a Clang extension [-Wvla-cxx-extension] 142 | char archName[archLen + 1]; | ^~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:142:19: note: read of non-const variable 'archLen' is not allowed in a constant expression /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:141:9: note: declared here 141 | int archLen = strlen(devName); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3422:13: warning: 'break' will never be executed [-Wunreachable-code-break] 3422 | break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3415:13: warning: 'break' will never be executed [-Wunreachable-code-break] 3415 | break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3412:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3412 | } break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3403:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3403 | } break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3396:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3396 | } break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3391:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3391 | } break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3386:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3386 | } break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3381:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3381 | } break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3334:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3334 | } break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3326:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3326 | } break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3322:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3322 | } break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3303:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3303 | } break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3238:13: warning: 'break' will never be executed [-Wunreachable-code-break] 3238 | break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3225:13: warning: 'break' will never be executed [-Wunreachable-code-break] 3225 | break; | ^~~~~ 27 warnings generated when compiling for gfx1152. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/gla.cu:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1010. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:26: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:26: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:142:19: warning: variable length arrays in C++ are a Clang extension [-Wvla-cxx-extension] 142 | char archName[archLen + 1]; | ^~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:142:19: note: read of non-const variable 'archLen' is not allowed in a constant expression /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:141:9: note: declared here 141 | int archLen = strlen(devName); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3422:13: warning: 'break' will never be executed [-Wunreachable-code-break] 3422 | break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3415:13: warning: 'break' will never be executed [-Wunreachable-code-break] 3415 | break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3412:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3412 | } break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3403:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3403 | } break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3396:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3396 | } break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3391:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3391 | } break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3386:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3386 | } break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3381:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3381 | } break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3334:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3334 | } break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3326:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3326 | } break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3322:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3322 | } break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3303:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3303 | } break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3238:13: warning: 'break' will never be executed [-Wunreachable-code-break] 3238 | break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3225:13: warning: 'break' will never be executed [-Wunreachable-code-break] 3225 | break; | ^~~~~ 27 warnings generated when compiling for gfx1153. 1 warning generated when compiling for gfx906. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/gla.cu:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:26: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:26: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:142:19: warning: variable length arrays in C++ are a Clang extension [-Wvla-cxx-extension] 142 | char archName[archLen + 1]; | ^~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:142:19: note: read of non-const variable 'archLen' is not allowed in a constant expression /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:141:9: note: declared here 141 | int archLen = strlen(devName); | ^ 1 warning generated when compiling for gfx1012. 11 warnings generated when compiling for gfx1201. /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3422:13: warning: 'break' will never be executed [-Wunreachable-code-break] 3422 | break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3415:13: warning: 'break' will never be executed [-Wunreachable-code-break] 3415 | break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3412:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3412 | } break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3403:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3403 | } break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3396:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3396 | } break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3391:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3391 | } break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3386:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3386 | } break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3381:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3381 | } break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3334:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3334 | } break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3326:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3326 | } break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3322:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3322 | } break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3303:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3303 | } break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3238:13: warning: 'break' will never be executed [-Wunreachable-code-break] 3238 | break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3225:13: warning: 'break' will never be executed [-Wunreachable-code-break] 3225 | break; | ^~~~~ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f32.cu:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 27 warnings generated when compiling for gfx1200. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/gla.cu:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:26:23: warning: unused parameter 'ne00' [-Wunused-parameter] 26 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:26:83: warning: unused parameter 'ne03' [-Wunused-parameter] 26 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:28:23: warning: unused parameter 'ne10' [-Wunused-parameter] 28 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:28:83: warning: unused parameter 'ne13' [-Wunused-parameter] 28 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:30:43: warning: unused parameter 'nb21' [-Wunused-parameter] 30 | const int32_t nb21, const int32_t nb22, const int64_t nb23, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:30:63: warning: unused parameter 'nb22' [-Wunused-parameter] 30 | const int32_t nb21, const int32_t nb22, const int64_t nb23, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:30:83: warning: unused parameter 'nb23' [-Wunused-parameter] 30 | const int32_t nb21, const int32_t nb22, const int64_t nb23, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:31:43: warning: unused parameter 'ne31' [-Wunused-parameter] 31 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:31:63: warning: unused parameter 'ne32' [-Wunused-parameter] 31 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:32:63: warning: unused parameter 'nb32' [-Wunused-parameter] 32 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:26: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:26: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:142:19: warning: variable length arrays in C++ are a Clang extension [-Wvla-cxx-extension] 142 | char archName[archLen + 1]; | ^~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:142:19: note: read of non-const variable 'archLen' is not allowed in a constant expression /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:141:9: note: declared here 141 | int archLen = strlen(devName); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3422:13: warning: 'break' will never be executed [-Wunreachable-code-break] 3422 | break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3415:13: warning: 'break' will never be executed [-Wunreachable-code-break] 3415 | break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3412:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3412 | } break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3403:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3403 | } break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3396:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3396 | } break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3391:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3391 | } break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3386:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3386 | } break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3381:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3381 | } break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3334:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3334 | } break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3326:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3326 | } break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3322:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3322 | } break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3303:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3303 | } break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3238:13: warning: 'break' will never be executed [-Wunreachable-code-break] 3238 | break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3225:13: warning: 'break' will never be executed [-Wunreachable-code-break] 3225 | break; | ^~~~~ 27 warnings generated when compiling for gfx1201. 1 warning generated when compiling for gfx1030. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/gla.cu:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:26: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:26: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:142:19: warning: variable length arrays in C++ are a Clang extension [-Wvla-cxx-extension] 142 | char archName[archLen + 1]; | ^~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:142:19: note: read of non-const variable 'archLen' is not allowed in a constant expression /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:141:9: note: declared here 141 | int archLen = strlen(devName); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3422:13: warning: 'break' will never be executed [-Wunreachable-code-break] 3422 | break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3415:13: warning: 'break' will never be executed [-Wunreachable-code-break] 3415 | break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3412:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3412 | } break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3403:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3403 | } break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3396:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3396 | } break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3391:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3391 | } break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3386:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3386 | } break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3381:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3381 | } break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3334:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3334 | } break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3326:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3326 | } break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3322:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3322 | } break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3303:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3303 | } break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3238:13: warning: 'break' will never be executed [-Wunreachable-code-break] 3238 | break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3225:13: warning: 'break' will never be executed [-Wunreachable-code-break] 3225 | break; | ^~~~~ 27 warnings generated when compiling for gfx900. 1 warning generated when compiling for gfx908. 1 warning generated when compiling for gfx1031. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:26: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:26: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/gla.cu:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:142:19: warning: variable length arrays in C++ are a Clang extension [-Wvla-cxx-extension] 142 | char archName[archLen + 1]; | ^~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:142:19: note: read of non-const variable 'archLen' is not allowed in a constant expression /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:141:9: note: declared here 141 | int archLen = strlen(devName); | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f32.cu:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3422:13: warning: 'break' will never be executed [-Wunreachable-code-break] 3422 | break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3415:13: warning: 'break' will never be executed [-Wunreachable-code-break] 3415 | break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3412:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3412 | } break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3403:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3403 | } break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3396:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3396 | } break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3391:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3391 | } break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3386:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3386 | } break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3381:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3381 | } break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3334:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3334 | } break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3326:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3326 | } break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3322:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3322 | } break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3303:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3303 | } break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3238:13: warning: 'break' will never be executed [-Wunreachable-code-break] 3238 | break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3225:13: warning: 'break' will never be executed [-Wunreachable-code-break] 3225 | break; | ^~~~~ 27 warnings generated when compiling for gfx906. 11 warnings generated when compiling for gfx900. 1 warning generated when compiling for gfx1035. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:26: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:26: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mmq.cuh:1796:31: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 1796 | for (int l = 0; l < sizeof(int); ++l) { | ~ ^ ~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:142:19: warning: variable length arrays in C++ are a Clang extension [-Wvla-cxx-extension] 142 | char archName[archLen + 1]; | ^~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:142:19: note: read of non-const variable 'archLen' is not allowed in a constant expression /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:141:9: note: declared here 141 | int archLen = strlen(devName); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3422:13: warning: 'break' will never be executed [-Wunreachable-code-break] 3422 | break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3415:13: warning: 'break' will never be executed [-Wunreachable-code-break] 3415 | break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3412:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3412 | } break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3403:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3403 | } break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3396:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3396 | } break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3391:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3391 | } break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3386:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3386 | } break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3381:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3381 | } break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3334:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3334 | } break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3326:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3326 | } break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3322:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3322 | } break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3303:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3303 | } break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3238:13: warning: 'break' will never be executed [-Wunreachable-code-break] 3238 | break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3225:13: warning: 'break' will never be executed [-Wunreachable-code-break] 3225 | break; | ^~~~~ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:26:23: warning: unused parameter 'ne00' [-Wunused-parameter] 26 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:26:83: warning: unused parameter 'ne03' [-Wunused-parameter] 26 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:28:23: warning: unused parameter 'ne10' [-Wunused-parameter] 28 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:28:83: warning: unused parameter 'ne13' [-Wunused-parameter] 28 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:30:43: warning: unused parameter 'nb21' [-Wunused-parameter] 30 | const int32_t nb21, const int32_t nb22, const int64_t nb23, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:30:63: warning: unused parameter 'nb22' [-Wunused-parameter] 30 | const int32_t nb21, const int32_t nb22, const int64_t nb23, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:30:83: warning: unused parameter 'nb23' [-Wunused-parameter] 30 | const int32_t nb21, const int32_t nb22, const int64_t nb23, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:31:43: warning: unused parameter 'ne31' [-Wunused-parameter] 31 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:31:63: warning: unused parameter 'ne32' [-Wunused-parameter] 31 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:32:63: warning: unused parameter 'nb32' [-Wunused-parameter] 32 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/gla.cu:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 25 warnings generated when compiling for gfx908. 1 warning generated when compiling for gfx1036. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:26: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:26: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mmq.cuh:1796:31: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 1796 | for (int l = 0; l < sizeof(int); ++l) { | ~ ^ ~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:142:19: warning: variable length arrays in C++ are a Clang extension [-Wvla-cxx-extension] 142 | char archName[archLen + 1]; | ^~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:142:19: note: read of non-const variable 'archLen' is not allowed in a constant expression /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:141:9: note: declared here 141 | int archLen = strlen(devName); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3422:13: warning: 'break' will never be executed [-Wunreachable-code-break] 3422 | break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3415:13: warning: 'break' will never be executed [-Wunreachable-code-break] 3415 | break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3412:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3412 | } break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3403:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3403 | } break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3396:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3396 | } break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3391:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3391 | } break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3386:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3386 | } break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3381:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3381 | } break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3334:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3334 | } break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3326:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3326 | } break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3322:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3322 | } break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3303:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3303 | } break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3238:13: warning: 'break' will never be executed [-Wunreachable-code-break] 3238 | break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3225:13: warning: 'break' will never be executed [-Wunreachable-code-break] 3225 | break; | ^~~~~ 25 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/gla.cu:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1100. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:26: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:26: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mmq.cuh:1796:31: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 1796 | for (int l = 0; l < sizeof(int); ++l) { | ~ ^ ~~~~~~~~~~~ 1 warning generated when compiling for gfx90a. /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:142:19: warning: variable length arrays in C++ are a Clang extension [-Wvla-cxx-extension] 142 | char archName[archLen + 1]; | ^~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:142:19: note: read of non-const variable 'archLen' is not allowed in a constant expression /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:141:9: note: declared here 141 | int archLen = strlen(devName); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3422:13: warning: 'break' will never be executed [-Wunreachable-code-break] 3422 | break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3415:13: warning: 'break' will never be executed [-Wunreachable-code-break] 3415 | break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3412:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3412 | } break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3403:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3403 | } break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3396:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3396 | } break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3391:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3391 | } break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3386:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3386 | } break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3381:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3381 | } break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3334:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3334 | } break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3326:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3326 | } break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3322:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3322 | } break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3303:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3303 | } break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3238:13: warning: 'break' will never be executed [-Wunreachable-code-break] 3238 | break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3225:13: warning: 'break' will never be executed [-Wunreachable-code-break] 3225 | break; | ^~~~~ 25 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/gla.cu:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f32.cu:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:26: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:521:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 521 | acc[0] = __builtin_amdgcn_mfma_i32_16x16x32_i8(((int64_t *) A.x)[0], | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:522:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 522 | ((int64_t *) B.x)[0], | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:549:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 549 | acc[0] = __builtin_amdgcn_mfma_i32_32x32x16_i8(((int64_t *) A.x)[0], | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:550:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 550 | ((int64_t *) B.x)[0], | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:26: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mmq.cuh:1796:31: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 1796 | for (int l = 0; l < sizeof(int); ++l) { | ~ ^ ~~~~~~~~~~~ 1 warning generated when compiling for gfx1101. 11 warnings generated when compiling for gfx906. /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:142:19: warning: variable length arrays in C++ are a Clang extension [-Wvla-cxx-extension] 142 | char archName[archLen + 1]; | ^~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:142:19: note: read of non-const variable 'archLen' is not allowed in a constant expression /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:141:9: note: declared here 141 | int archLen = strlen(devName); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3422:13: warning: 'break' will never be executed [-Wunreachable-code-break] 3422 | break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3415:13: warning: 'break' will never be executed [-Wunreachable-code-break] 3415 | break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3412:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3412 | } break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3403:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3403 | } break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3396:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3396 | } break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3391:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3391 | } break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3386:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3386 | } break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3381:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3381 | } break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3334:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3334 | } break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3326:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3326 | } break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3322:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3322 | } break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3303:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3303 | } break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3238:13: warning: 'break' will never be executed [-Wunreachable-code-break] 3238 | break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3225:13: warning: 'break' will never be executed [-Wunreachable-code-break] 3225 | break; | ^~~~~ 29 warnings generated when compiling for gfx942. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/gla.cu:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:26:23: warning: unused parameter 'ne00' [-Wunused-parameter] 26 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:26:83: warning: unused parameter 'ne03' [-Wunused-parameter] 26 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:28:23: warning: unused parameter 'ne10' [-Wunused-parameter] 28 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:28:83: warning: unused parameter 'ne13' [-Wunused-parameter] 28 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:30:43: warning: unused parameter 'nb21' [-Wunused-parameter] 30 | const int32_t nb21, const int32_t nb22, const int64_t nb23, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:30:63: warning: unused parameter 'nb22' [-Wunused-parameter] 30 | const int32_t nb21, const int32_t nb22, const int64_t nb23, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:30:83: warning: unused parameter 'nb23' [-Wunused-parameter] 30 | const int32_t nb21, const int32_t nb22, const int64_t nb23, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:31:43: warning: unused parameter 'ne31' [-Wunused-parameter] 31 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:31:63: warning: unused parameter 'ne32' [-Wunused-parameter] 31 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:32:63: warning: unused parameter 'nb32' [-Wunused-parameter] 32 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:26: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:26: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:142:19: warning: variable length arrays in C++ are a Clang extension [-Wvla-cxx-extension] 142 | char archName[archLen + 1]; | ^~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:142:19: note: read of non-const variable 'archLen' is not allowed in a constant expression /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:141:9: note: declared here 141 | int archLen = strlen(devName); | ^ 1 warning generated when compiling for gfx1102. /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3422:13: warning: 'break' will never be executed [-Wunreachable-code-break] 3422 | break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3415:13: warning: 'break' will never be executed [-Wunreachable-code-break] 3415 | break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3412:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3412 | } break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3403:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3403 | } break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3396:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3396 | } break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3391:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3391 | } break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3386:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3386 | } break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3381:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3381 | } break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3334:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3334 | } break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3326:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3326 | } break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3322:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3322 | } break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3303:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3303 | } break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3238:13: warning: 'break' will never be executed [-Wunreachable-code-break] 3238 | break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3225:13: warning: 'break' will never be executed [-Wunreachable-code-break] 3225 | break; | ^~~~~ 27 warnings generated when compiling for gfx950. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/gla.cu:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:26: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:26: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:142:19: warning: variable length arrays in C++ are a Clang extension [-Wvla-cxx-extension] 142 | char archName[archLen + 1]; | ^~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:142:19: note: read of non-const variable 'archLen' is not allowed in a constant expression /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:141:9: note: declared here 141 | int archLen = strlen(devName); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3422:13: warning: 'break' will never be executed [-Wunreachable-code-break] 3422 | break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3415:13: warning: 'break' will never be executed [-Wunreachable-code-break] 3415 | break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3412:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3412 | } break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3403:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3403 | } break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3396:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3396 | } break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3391:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3391 | } break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3386:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3386 | } break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3381:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3381 | } break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3334:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3334 | } break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3326:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3326 | } break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3322:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3322 | } break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3303:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3303 | } break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3238:13: warning: 'break' will never be executed [-Wunreachable-code-break] 3238 | break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3225:13: warning: 'break' will never be executed [-Wunreachable-code-break] 3225 | break; | ^~~~~ 1 warning generated when compiling for gfx1103. 1 warning generated when compiling for gfx90a. 27 warnings generated when compiling for host. [ 27%] Building CXX object ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/im2col.cu.o cd /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build/ggml/src/ggml-hip && /usr/bin/hipcc -DGGML_BACKEND_BUILD -DGGML_BACKEND_SHARED -DGGML_HIP_NO_VMM -DGGML_HIP_ROCWMMA_FATTN_GFX12 -DGGML_SCHED_MAX_COPIES=4 -DGGML_SHARED -DGGML_USE_HIP -DUSE_PROF_API=1 -D_GNU_SOURCE -D_XOPEN_SOURCE=600 -D__HIP_PLATFORM_AMD__=1 -Dggml_hip_EXPORTS -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/.. -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -DNDEBUG -std=gnu++17 -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -x hip --offload-arch=gfx900 --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx942 --offload-arch=gfx950 --offload-arch=gfx1010 --offload-arch=gfx1012 --offload-arch=gfx1030 --offload-arch=gfx1031 --offload-arch=gfx1035 --offload-arch=gfx1036 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1103 --offload-arch=gfx1150 --offload-arch=gfx1151 --offload-arch=gfx1152 --offload-arch=gfx1153 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/im2col.cu.o -MF CMakeFiles/ggml-hip.dir/__/ggml-cuda/im2col.cu.o.d -o CMakeFiles/ggml-hip.dir/__/ggml-cuda/im2col.cu.o -c /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/im2col.cu In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/gla.cu:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f32.cu:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1150. 11 warnings generated when compiling for gfx908. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/im2col.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/im2col.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1010. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/gla.cu:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:26:23: warning: unused parameter 'ne00' [-Wunused-parameter] 26 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:26:83: warning: unused parameter 'ne03' [-Wunused-parameter] 26 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:28:23: warning: unused parameter 'ne10' [-Wunused-parameter] 28 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:28:83: warning: unused parameter 'ne13' [-Wunused-parameter] 28 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:30:43: warning: unused parameter 'nb21' [-Wunused-parameter] 30 | const int32_t nb21, const int32_t nb22, const int64_t nb23, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:30:63: warning: unused parameter 'nb22' [-Wunused-parameter] 30 | const int32_t nb21, const int32_t nb22, const int64_t nb23, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:30:83: warning: unused parameter 'nb23' [-Wunused-parameter] 30 | const int32_t nb21, const int32_t nb22, const int64_t nb23, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:31:43: warning: unused parameter 'ne31' [-Wunused-parameter] 31 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:31:63: warning: unused parameter 'ne32' [-Wunused-parameter] 31 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:32:63: warning: unused parameter 'nb32' [-Wunused-parameter] 32 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/im2col.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/im2col.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1012. 1 warning generated when compiling for gfx1151. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/im2col.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/im2col.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/gla.cu:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1030. 1 warning generated when compiling for gfx1152. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/im2col.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/im2col.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx942. 1 warning generated when compiling for gfx1031. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/gla.cu:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f32.cu:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/im2col.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/im2col.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1035. 1 warning generated when compiling for gfx1153. 11 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/im2col.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/im2col.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1036. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/gla.cu:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1200. /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:26:23: warning: unused parameter 'ne00' [-Wunused-parameter] 26 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:26:83: warning: unused parameter 'ne03' [-Wunused-parameter] 26 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:28:23: warning: unused parameter 'ne10' [-Wunused-parameter] 28 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:28:83: warning: unused parameter 'ne13' [-Wunused-parameter] 28 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:30:43: warning: unused parameter 'nb21' [-Wunused-parameter] 30 | const int32_t nb21, const int32_t nb22, const int64_t nb23, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:30:63: warning: unused parameter 'nb22' [-Wunused-parameter] 30 | const int32_t nb21, const int32_t nb22, const int64_t nb23, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:30:83: warning: unused parameter 'nb23' [-Wunused-parameter] 30 | const int32_t nb21, const int32_t nb22, const int64_t nb23, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:31:43: warning: unused parameter 'ne31' [-Wunused-parameter] 31 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:31:63: warning: unused parameter 'ne32' [-Wunused-parameter] 31 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:32:63: warning: unused parameter 'nb32' [-Wunused-parameter] 32 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/im2col.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/im2col.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1100. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/gla.cu:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1201. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/im2col.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/im2col.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1101. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/gla.cu:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx950. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/im2col.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/im2col.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1102. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f32.cu:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx900. 1 warning generated when compiling for host. [ 27%] Building CXX object ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/mean.cu.o cd /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build/ggml/src/ggml-hip && /usr/bin/hipcc -DGGML_BACKEND_BUILD -DGGML_BACKEND_SHARED -DGGML_HIP_NO_VMM -DGGML_HIP_ROCWMMA_FATTN_GFX12 -DGGML_SCHED_MAX_COPIES=4 -DGGML_SHARED -DGGML_USE_HIP -DUSE_PROF_API=1 -D_GNU_SOURCE -D_XOPEN_SOURCE=600 -D__HIP_PLATFORM_AMD__=1 -Dggml_hip_EXPORTS -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/.. -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -DNDEBUG -std=gnu++17 -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -x hip --offload-arch=gfx900 --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx942 --offload-arch=gfx950 --offload-arch=gfx1010 --offload-arch=gfx1012 --offload-arch=gfx1030 --offload-arch=gfx1031 --offload-arch=gfx1035 --offload-arch=gfx1036 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1103 --offload-arch=gfx1150 --offload-arch=gfx1151 --offload-arch=gfx1152 --offload-arch=gfx1153 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/mean.cu.o -MF CMakeFiles/ggml-hip.dir/__/ggml-cuda/mean.cu.o.d -o CMakeFiles/ggml-hip.dir/__/ggml-cuda/mean.cu.o -c /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mean.cu In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/im2col.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/im2col.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1103. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/gla.cu:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 11 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mean.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mean.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mean.cu:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/reduce_rows.cuh:42:21: warning: comparison of integers of different signs: 'const int' and 'unsigned int' [-Wsign-compare] 42 | if (lane_id < (blockDim.x / WARP_SIZE)) { | ~~~~~~~ ^ ~~~~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/reduce_rows.cuh:42:21: warning: comparison of integers of different signs: 'const int' and 'unsigned int' [-Wsign-compare] 42 | if (lane_id < (blockDim.x / WARP_SIZE)) { | ~~~~~~~ ^ ~~~~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mean.cu:62:9: note: in instantiation of function template specialization 'reduce_rows_f32' requested here 62 | reduce_rows_f32<<>>(src0_d, dst_d, ncols); | ^ 3 warnings generated when compiling for gfx1010. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/im2col.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/im2col.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1150. 1 warning generated when compiling for gfx906. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:26:23: warning: unused parameter 'ne00' [-Wunused-parameter] 26 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:26:83: warning: unused parameter 'ne03' [-Wunused-parameter] 26 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:28:23: warning: unused parameter 'ne10' [-Wunused-parameter] 28 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:28:83: warning: unused parameter 'ne13' [-Wunused-parameter] 28 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:30:43: warning: unused parameter 'nb21' [-Wunused-parameter] 30 | const int32_t nb21, const int32_t nb22, const int64_t nb23, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:30:63: warning: unused parameter 'nb22' [-Wunused-parameter] 30 | const int32_t nb21, const int32_t nb22, const int64_t nb23, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:30:83: warning: unused parameter 'nb23' [-Wunused-parameter] 30 | const int32_t nb21, const int32_t nb22, const int64_t nb23, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:31:43: warning: unused parameter 'ne31' [-Wunused-parameter] 31 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:31:63: warning: unused parameter 'ne32' [-Wunused-parameter] 31 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:32:63: warning: unused parameter 'nb32' [-Wunused-parameter] 32 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mean.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mean.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mean.cu:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/reduce_rows.cuh:42:21: warning: comparison of integers of different signs: 'const int' and 'unsigned int' [-Wsign-compare] 42 | if (lane_id < (blockDim.x / WARP_SIZE)) { | ~~~~~~~ ^ ~~~~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/reduce_rows.cuh:42:21: warning: comparison of integers of different signs: 'const int' and 'unsigned int' [-Wsign-compare] 42 | if (lane_id < (blockDim.x / WARP_SIZE)) { | ~~~~~~~ ^ ~~~~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mean.cu:62:9: note: in instantiation of function template specialization 'reduce_rows_f32' requested here 62 | reduce_rows_f32<<>>(src0_d, dst_d, ncols); | ^ 3 warnings generated when compiling for gfx1012. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/im2col.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/im2col.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/gla.cu:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1151. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mean.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mean.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mean.cu:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/reduce_rows.cuh:42:21: warning: comparison of integers of different signs: 'const int' and 'unsigned int' [-Wsign-compare] 42 | if (lane_id < (blockDim.x / WARP_SIZE)) { | ~~~~~~~ ^ ~~~~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/reduce_rows.cuh:42:21: warning: comparison of integers of different signs: 'const int' and 'unsigned int' [-Wsign-compare] 42 | if (lane_id < (blockDim.x / WARP_SIZE)) { | ~~~~~~~ ^ ~~~~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mean.cu:62:9: note: in instantiation of function template specialization 'reduce_rows_f32' requested here 62 | reduce_rows_f32<<>>(src0_d, dst_d, ncols); | ^ 1 warning generated when compiling for gfx908. 3 warnings generated when compiling for gfx1030. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/im2col.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/im2col.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1152. In file included from In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/gla.cu:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mean.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mean.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mean.cu:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/reduce_rows.cuh:42:21: warning: comparison of integers of different signs: 'const int' and 'unsigned int' [-Wsign-compare] 42 | if (lane_id < (blockDim.x / WARP_SIZE)) { | ~~~~~~~ ^ ~~~~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/reduce_rows.cuh:42:21: warning: comparison of integers of different signs: 'const int' and 'unsigned int' [-Wsign-compare] 42 | if (lane_id < (blockDim.x / WARP_SIZE)) { | ~~~~~~~ ^ ~~~~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mean.cu:62:9: note: in instantiation of function template specialization 'reduce_rows_f32' requested here 62 | reduce_rows_f32<<>>(src0_d, dst_d, ncols); | ^ 3 warnings generated when compiling for gfx1031. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/im2col.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/im2col.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1153. 1 warning generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mean.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mean.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mean.cu:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/reduce_rows.cuh:42:21: warning: comparison of integers of different signs: 'const int' and 'unsigned int' [-Wsign-compare] 42 | if (lane_id < (blockDim.x / WARP_SIZE)) { | ~~~~~~~ ^ ~~~~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/reduce_rows.cuh:42:21: warning: comparison of integers of different signs: 'const int' and 'unsigned int' [-Wsign-compare] 42 | if (lane_id < (blockDim.x / WARP_SIZE)) { | ~~~~~~~ ^ ~~~~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mean.cu:62:9: note: in instantiation of function template specialization 'reduce_rows_f32' requested here 62 | reduce_rows_f32<<>>(src0_d, dst_d, ncols); | ^ 3 warnings generated when compiling for gfx1035. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/im2col.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/im2col.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 11 warnings generated when compiling for gfx942. 1 warning generated when compiling for gfx1200. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/gla.cu:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mean.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mean.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mean.cu:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/reduce_rows.cuh:42:21: warning: comparison of integers of different signs: 'const int' and 'unsigned int' [-Wsign-compare] 42 | if (lane_id < (blockDim.x / WARP_SIZE)) { | ~~~~~~~ ^ ~~~~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/reduce_rows.cuh:42:21: warning: comparison of integers of different signs: 'const int' and 'unsigned int' [-Wsign-compare] 42 | if (lane_id < (blockDim.x / WARP_SIZE)) { | ~~~~~~~ ^ ~~~~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mean.cu:62:9: note: in instantiation of function template specialization 'reduce_rows_f32' requested here 62 | reduce_rows_f32<<>>(src0_d, dst_d, ncols); | ^ 3 warnings generated when compiling for gfx1036. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:26:23: warning: unused parameter 'ne00' [-Wunused-parameter] 26 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:26:83: warning: unused parameter 'ne03' [-Wunused-parameter] 26 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:28:23: warning: unused parameter 'ne10' [-Wunused-parameter] 28 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:28:83: warning: unused parameter 'ne13' [-Wunused-parameter] 28 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:30:43: warning: unused parameter 'nb21' [-Wunused-parameter] 30 | const int32_t nb21, const int32_t nb22, const int64_t nb23, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:30:63: warning: unused parameter 'nb22' [-Wunused-parameter] 30 | const int32_t nb21, const int32_t nb22, const int64_t nb23, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:30:83: warning: unused parameter 'nb23' [-Wunused-parameter] 30 | const int32_t nb21, const int32_t nb22, const int64_t nb23, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:31:43: warning: unused parameter 'ne31' [-Wunused-parameter] 31 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:31:63: warning: unused parameter 'ne32' [-Wunused-parameter] 31 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:32:63: warning: unused parameter 'nb32' [-Wunused-parameter] 32 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/im2col.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/im2col.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx90a. 1 warning generated when compiling for gfx1201. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mean.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mean.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mean.cu:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/reduce_rows.cuh:42:21: warning: comparison of integers of different signs: 'const int' and 'unsigned int' [-Wsign-compare] 42 | if (lane_id < (blockDim.x / WARP_SIZE)) { | ~~~~~~~ ^ ~~~~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/reduce_rows.cuh:42:21: warning: comparison of integers of different signs: 'const int' and 'unsigned int' [-Wsign-compare] 42 | if (lane_id < (blockDim.x / WARP_SIZE)) { | ~~~~~~~ ^ ~~~~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mean.cu:62:9: note: in instantiation of function template specialization 'reduce_rows_f32' requested here 62 | reduce_rows_f32<<>>(src0_d, dst_d, ncols); | ^ 3 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/gla.cu:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/im2col.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/im2col.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx900. 1 warning generated when compiling for gfx942. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mean.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mean.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mean.cu:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/reduce_rows.cuh:42:21: warning: comparison of integers of different signs: 'const int' and 'unsigned int' [-Wsign-compare] 42 | if (lane_id < (blockDim.x / WARP_SIZE)) { | ~~~~~~~ ^ ~~~~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/reduce_rows.cuh:42:21: warning: comparison of integers of different signs: 'const int' and 'unsigned int' [-Wsign-compare] 42 | if (lane_id < (blockDim.x / WARP_SIZE)) { | ~~~~~~~ ^ ~~~~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mean.cu:62:9: note: in instantiation of function template specialization 'reduce_rows_f32' requested here 62 | reduce_rows_f32<<>>(src0_d, dst_d, ncols); | ^ 3 warnings generated when compiling for gfx1101. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/im2col.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/im2col.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx906. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/gla.cu:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mean.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mean.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mean.cu:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/reduce_rows.cuh:42:21: warning: comparison of integers of different signs: 'const int' and 'unsigned int' [-Wsign-compare] 42 | if (lane_id < (blockDim.x / WARP_SIZE)) { | ~~~~~~~ ^ ~~~~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/reduce_rows.cuh:42:21: warning: comparison of integers of different signs: 'const int' and 'unsigned int' [-Wsign-compare] 42 | if (lane_id < (blockDim.x / WARP_SIZE)) { | ~~~~~~~ ^ ~~~~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mean.cu:62:9: note: in instantiation of function template specialization 'reduce_rows_f32' requested here 62 | reduce_rows_f32<<>>(src0_d, dst_d, ncols); | ^ 3 warnings generated when compiling for gfx1102. 1 warning generated when compiling for gfx950. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/im2col.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/im2col.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx908. 11 warnings generated when compiling for gfx950. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mean.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mean.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mean.cu:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/reduce_rows.cuh:42:21: warning: comparison of integers of different signs: 'const int' and 'unsigned int' [-Wsign-compare] 42 | if (lane_id < (blockDim.x / WARP_SIZE)) { | ~~~~~~~ ^ ~~~~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/reduce_rows.cuh:42:21: warning: comparison of integers of different signs: 'const int' and 'unsigned int' [-Wsign-compare] 42 | if (lane_id < (blockDim.x / WARP_SIZE)) { | ~~~~~~~ ^ ~~~~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mean.cu:62:9: note: in instantiation of function template specialization 'reduce_rows_f32' requested here 62 | reduce_rows_f32<<>>(src0_d, dst_d, ncols); | ^ 3 warnings generated when compiling for gfx1103. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/gla.cu:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for host. [ 28%] Building CXX object ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/mmf.cu.o cd /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build/ggml/src/ggml-hip && /usr/bin/hipcc -DGGML_BACKEND_BUILD -DGGML_BACKEND_SHARED -DGGML_HIP_NO_VMM -DGGML_HIP_ROCWMMA_FATTN_GFX12 -DGGML_SCHED_MAX_COPIES=4 -DGGML_SHARED -DGGML_USE_HIP -DUSE_PROF_API=1 -D_GNU_SOURCE -D_XOPEN_SOURCE=600 -D__HIP_PLATFORM_AMD__=1 -Dggml_hip_EXPORTS -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/.. -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -DNDEBUG -std=gnu++17 -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -x hip --offload-arch=gfx900 --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx942 --offload-arch=gfx950 --offload-arch=gfx1010 --offload-arch=gfx1012 --offload-arch=gfx1030 --offload-arch=gfx1031 --offload-arch=gfx1035 --offload-arch=gfx1036 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1103 --offload-arch=gfx1150 --offload-arch=gfx1151 --offload-arch=gfx1152 --offload-arch=gfx1153 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/mmf.cu.o -MF CMakeFiles/ggml-hip.dir/__/ggml-cuda/mmf.cu.o.d -o CMakeFiles/ggml-hip.dir/__/ggml-cuda/mmf.cu.o -c /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmf.cu In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/im2col.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/im2col.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx90a. /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:26:23: warning: unused parameter 'ne00' [-Wunused-parameter] 26 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:26:83: warning: unused parameter 'ne03' [-Wunused-parameter] 26 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:28:23: warning: unused parameter 'ne10' [-Wunused-parameter] 28 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:28:83: warning: unused parameter 'ne13' [-Wunused-parameter] 28 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:30:43: warning: unused parameter 'nb21' [-Wunused-parameter] 30 | const int32_t nb21, const int32_t nb22, const int64_t nb23, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:30:63: warning: unused parameter 'nb22' [-Wunused-parameter] 30 | const int32_t nb21, const int32_t nb22, const int64_t nb23, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:30:83: warning: unused parameter 'nb23' [-Wunused-parameter] 30 | const int32_t nb21, const int32_t nb22, const int64_t nb23, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:31:43: warning: unused parameter 'ne31' [-Wunused-parameter] 31 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:31:63: warning: unused parameter 'ne32' [-Wunused-parameter] 31 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:32:63: warning: unused parameter 'nb32' [-Wunused-parameter] 32 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mean.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mean.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mean.cu:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/reduce_rows.cuh:42:21: warning: comparison of integers of different signs: 'const int' and 'unsigned int' [-Wsign-compare] 42 | if (lane_id < (blockDim.x / WARP_SIZE)) { | ~~~~~~~ ^ ~~~~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/reduce_rows.cuh:42:21: warning: comparison of integers of different signs: 'const int' and 'unsigned int' [-Wsign-compare] 42 | if (lane_id < (blockDim.x / WARP_SIZE)) { | ~~~~~~~ ^ ~~~~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mean.cu:62:9: note: in instantiation of function template specialization 'reduce_rows_f32' requested here 62 | reduce_rows_f32<<>>(src0_d, dst_d, ncols); | ^ 3 warnings generated when compiling for gfx1150. 11 warnings generated when compiling for host. [ 28%] Building CXX object ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/mmq.cu.o cd /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build/ggml/src/ggml-hip && /usr/bin/hipcc -DGGML_BACKEND_BUILD -DGGML_BACKEND_SHARED -DGGML_HIP_NO_VMM -DGGML_HIP_ROCWMMA_FATTN_GFX12 -DGGML_SCHED_MAX_COPIES=4 -DGGML_SHARED -DGGML_USE_HIP -DUSE_PROF_API=1 -D_GNU_SOURCE -D_XOPEN_SOURCE=600 -D__HIP_PLATFORM_AMD__=1 -Dggml_hip_EXPORTS -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/.. -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -DNDEBUG -std=gnu++17 -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -x hip --offload-arch=gfx900 --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx942 --offload-arch=gfx950 --offload-arch=gfx1010 --offload-arch=gfx1012 --offload-arch=gfx1030 --offload-arch=gfx1031 --offload-arch=gfx1035 --offload-arch=gfx1036 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1103 --offload-arch=gfx1150 --offload-arch=gfx1151 --offload-arch=gfx1152 --offload-arch=gfx1153 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/mmq.cu.o -MF CMakeFiles/ggml-hip.dir/__/ggml-cuda/mmq.cu.o.d -o CMakeFiles/ggml-hip.dir/__/ggml-cuda/mmq.cu.o -c /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cu In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmf.cu:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmf.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmf.cu:356:26: warning: unused variable 'prec' [-Wunused-variable] 356 | const enum ggml_prec prec = fast_fp16_available(cc) ? ggml_prec(dst->op_params[0]) : GGML_PREC_F32; | ^~~~ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/im2col.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/im2col.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmf.cu:154:32: warning: unused typedef 'tile_C' [-Wunused-local-typedef] 154 | typedef tile<16, 8, float> tile_C; | ^ 1 warning generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mean.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mean.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mean.cu:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/reduce_rows.cuh:42:21: warning: comparison of integers of different signs: 'const int' and 'unsigned int' [-Wsign-compare] 42 | if (lane_id < (blockDim.x / WARP_SIZE)) { | ~~~~~~~ ^ ~~~~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/reduce_rows.cuh:42:21: warning: comparison of integers of different signs: 'const int' and 'unsigned int' [-Wsign-compare] 42 | if (lane_id < (blockDim.x / WARP_SIZE)) { | ~~~~~~~ ^ ~~~~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mean.cu:62:9: note: in instantiation of function template specialization 'reduce_rows_f32' requested here 62 | reduce_rows_f32<<>>(src0_d, dst_d, ncols); | ^ 3 warnings generated when compiling for gfx1151. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cu:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cu:67:13: warning: 'break' will never be executed [-Wunreachable-code-break] 67 | break; | ^~~~~ 13 warnings generated when compiling for gfx1010. 13 warnings generated when compiling for gfx1010. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/im2col.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/im2col.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx942. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mean.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mean.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mean.cu:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/reduce_rows.cuh:42:21: warning: comparison of integers of different signs: 'const int' and 'unsigned int' [-Wsign-compare] 42 | if (lane_id < (blockDim.x / WARP_SIZE)) { | ~~~~~~~ ^ ~~~~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/reduce_rows.cuh:42:21: warning: comparison of integers of different signs: 'const int' and 'unsigned int' [-Wsign-compare] 42 | if (lane_id < (blockDim.x / WARP_SIZE)) { | ~~~~~~~ ^ ~~~~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mean.cu:62:9: note: in instantiation of function template specialization 'reduce_rows_f32' requested here 62 | reduce_rows_f32<<>>(src0_d, dst_d, ncols); | ^ 3 warnings generated when compiling for gfx1152. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cu:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmf.cu:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cu:67:13: warning: 'break' will never be executed [-Wunreachable-code-break] 67 | break; | ^~~~~ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmf.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmf.cu:356:26: warning: unused variable 'prec' [-Wunused-variable] 356 | const enum ggml_prec prec = fast_fp16_available(cc) ? ggml_prec(dst->op_params[0]) : GGML_PREC_F32; | ^~~~ 13 warnings generated when compiling for gfx1012. /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmf.cu:154:32: warning: unused typedef 'tile_C' [-Wunused-local-typedef] 154 | typedef tile<16, 8, float> tile_C; | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/im2col.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/im2col.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx950. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mean.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mean.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mean.cu:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/reduce_rows.cuh:42:21: warning: comparison of integers of different signs: 'const int' and 'unsigned int' [-Wsign-compare] 42 | if (lane_id < (blockDim.x / WARP_SIZE)) { | ~~~~~~~ ^ ~~~~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/reduce_rows.cuh:42:21: warning: comparison of integers of different signs: 'const int' and 'unsigned int' [-Wsign-compare] 42 | if (lane_id < (blockDim.x / WARP_SIZE)) { | ~~~~~~~ ^ ~~~~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mean.cu:62:9: note: in instantiation of function template specialization 'reduce_rows_f32' requested here 62 | reduce_rows_f32<<>>(src0_d, dst_d, ncols); | ^ 3 warnings generated when compiling for gfx1153. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ 13 warnings generated when compiling for gfx1012. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cu:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cu:67:13: warning: 'break' will never be executed [-Wunreachable-code-break] 67 | break; | ^~~~~ 13 warnings generated when compiling for gfx1030. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/im2col.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/im2col.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for host. [ 29%] Building CXX object ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/mmvf.cu.o cd /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build/ggml/src/ggml-hip && /usr/bin/hipcc -DGGML_BACKEND_BUILD -DGGML_BACKEND_SHARED -DGGML_HIP_NO_VMM -DGGML_HIP_ROCWMMA_FATTN_GFX12 -DGGML_SCHED_MAX_COPIES=4 -DGGML_SHARED -DGGML_USE_HIP -DUSE_PROF_API=1 -D_GNU_SOURCE -D_XOPEN_SOURCE=600 -D__HIP_PLATFORM_AMD__=1 -Dggml_hip_EXPORTS -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/.. -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -DNDEBUG -std=gnu++17 -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -x hip --offload-arch=gfx900 --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx942 --offload-arch=gfx950 --offload-arch=gfx1010 --offload-arch=gfx1012 --offload-arch=gfx1030 --offload-arch=gfx1031 --offload-arch=gfx1035 --offload-arch=gfx1036 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1103 --offload-arch=gfx1150 --offload-arch=gfx1151 --offload-arch=gfx1152 --offload-arch=gfx1153 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/mmvf.cu.o -MF CMakeFiles/ggml-hip.dir/__/ggml-cuda/mmvf.cu.o.d -o CMakeFiles/ggml-hip.dir/__/ggml-cuda/mmvf.cu.o -c /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmvf.cu In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mean.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mean.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mean.cu:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/reduce_rows.cuh:42:21: warning: comparison of integers of different signs: 'const int' and 'unsigned int' [-Wsign-compare] 42 | if (lane_id < (blockDim.x / WARP_SIZE)) { | ~~~~~~~ ^ ~~~~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/reduce_rows.cuh:42:21: warning: comparison of integers of different signs: 'const int' and 'unsigned int' [-Wsign-compare] 42 | if (lane_id < (blockDim.x / WARP_SIZE)) { | ~~~~~~~ ^ ~~~~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mean.cu:62:9: note: in instantiation of function template specialization 'reduce_rows_f32' requested here 62 | reduce_rows_f32<<>>(src0_d, dst_d, ncols); | ^ 3 warnings generated when compiling for gfx1200. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmf.cu:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmf.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmf.cu:356:26: warning: unused variable 'prec' [-Wunused-variable] 356 | const enum ggml_prec prec = fast_fp16_available(cc) ? ggml_prec(dst->op_params[0]) : GGML_PREC_F32; | ^~~~ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmf.cu:154:32: warning: unused typedef 'tile_C' [-Wunused-local-typedef] 154 | typedef tile<16, 8, float> tile_C; | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cu:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cu:67:13: warning: 'break' will never be executed [-Wunreachable-code-break] 67 | break; | ^~~~~ 13 warnings generated when compiling for gfx1031. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmvf.cu:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mean.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mean.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mean.cu:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/reduce_rows.cuh:42:21: warning: comparison of integers of different signs: 'const int' and 'unsigned int' [-Wsign-compare] 42 | if (lane_id < (blockDim.x / WARP_SIZE)) { | ~~~~~~~ ^ ~~~~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/reduce_rows.cuh:42:21: warning: comparison of integers of different signs: 'const int' and 'unsigned int' [-Wsign-compare] 42 | if (lane_id < (blockDim.x / WARP_SIZE)) { | ~~~~~~~ ^ ~~~~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mean.cu:62:9: note: in instantiation of function template specialization 'reduce_rows_f32' requested here 62 | reduce_rows_f32<<>>(src0_d, dst_d, ncols); | ^ 3 warnings generated when compiling for gfx1201. 13 warnings generated when compiling for gfx1030. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cu:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cu:67:13: warning: 'break' will never be executed [-Wunreachable-code-break] 67 | break; | ^~~~~ 13 warnings generated when compiling for gfx1035. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mean.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mean.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mean.cu:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/reduce_rows.cuh:42:21: warning: comparison of integers of different signs: 'const int' and 'unsigned int' [-Wsign-compare] 42 | if (lane_id < (blockDim.x / WARP_SIZE)) { | ~~~~~~~ ^ ~~~~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/reduce_rows.cuh:42:21: warning: comparison of integers of different signs: 'const int' and 'unsigned int' [-Wsign-compare] 42 | if (lane_id < (blockDim.x / WARP_SIZE)) { | ~~~~~~~ ^ ~~~~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mean.cu:62:9: note: in instantiation of function template specialization 'reduce_rows_f32' requested here 62 | reduce_rows_f32<<>>(src0_d, dst_d, ncols); | ^ 3 warnings generated when compiling for gfx900. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmf.cu:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmf.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmf.cu:356:26: warning: unused variable 'prec' [-Wunused-variable] 356 | const enum ggml_prec prec = fast_fp16_available(cc) ? ggml_prec(dst->op_params[0]) : GGML_PREC_F32; | ^~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmf.cu:154:32: warning: unused typedef 'tile_C' [-Wunused-local-typedef] 154 | typedef tile<16, 8, float> tile_C; | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cu:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cu:67:13: warning: 'break' will never be executed [-Wunreachable-code-break] 67 | break; | ^~~~~ 13 warnings generated when compiling for gfx1036. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mean.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mean.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mean.cu:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/reduce_rows.cuh:42:21: warning: comparison of integers of different signs: 'const int' and 'unsigned int' [-Wsign-compare] 42 | if (lane_id < (blockDim.x / WARP_SIZE)) { | ~~~~~~~ ^ ~~~~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/reduce_rows.cuh:42:21: warning: comparison of integers of different signs: 'const int' and 'unsigned int' [-Wsign-compare] 42 | if (lane_id < (blockDim.x / WARP_SIZE)) { | ~~~~~~~ ^ ~~~~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mean.cu:62:9: note: in instantiation of function template specialization 'reduce_rows_f32' requested here 62 | reduce_rows_f32<<>>(src0_d, dst_d, ncols); | ^ 3 warnings generated when compiling for gfx906. 13 warnings generated when compiling for gfx1031. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cu:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cu:67:13: warning: 'break' will never be executed [-Wunreachable-code-break] 67 | break; | ^~~~~ 13 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mean.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mean.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mean.cu:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/reduce_rows.cuh:42:21: warning: comparison of integers of different signs: 'const int' and 'unsigned int' [-Wsign-compare] 42 | if (lane_id < (blockDim.x / WARP_SIZE)) { | ~~~~~~~ ^ ~~~~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/reduce_rows.cuh:42:21: warning: comparison of integers of different signs: 'const int' and 'unsigned int' [-Wsign-compare] 42 | if (lane_id < (blockDim.x / WARP_SIZE)) { | ~~~~~~~ ^ ~~~~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mean.cu:62:9: note: in instantiation of function template specialization 'reduce_rows_f32' requested here 62 | reduce_rows_f32<<>>(src0_d, dst_d, ncols); | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmf.cu:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmf.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmf.cu:356:26: warning: unused variable 'prec' [-Wunused-variable] 356 | const enum ggml_prec prec = fast_fp16_available(cc) ? ggml_prec(dst->op_params[0]) : GGML_PREC_F32; | ^~~~ 3 warnings generated when compiling for gfx908. /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmf.cu:154:32: warning: unused typedef 'tile_C' [-Wunused-local-typedef] 154 | typedef tile<16, 8, float> tile_C; | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cu:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cu:67:13: warning: 'break' will never be executed [-Wunreachable-code-break] 67 | break; | ^~~~~ 13 warnings generated when compiling for gfx1101. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mean.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mean.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mean.cu:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/reduce_rows.cuh:42:21: warning: comparison of integers of different signs: 'const int' and 'unsigned int' [-Wsign-compare] 42 | if (lane_id < (blockDim.x / WARP_SIZE)) { | ~~~~~~~ ^ ~~~~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/reduce_rows.cuh:42:21: warning: comparison of integers of different signs: 'const int' and 'unsigned int' [-Wsign-compare] 42 | if (lane_id < (blockDim.x / WARP_SIZE)) { | ~~~~~~~ ^ ~~~~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mean.cu:62:9: note: in instantiation of function template specialization 'reduce_rows_f32' requested here 62 | reduce_rows_f32<<>>(src0_d, dst_d, ncols); | ^ 3 warnings generated when compiling for gfx90a. 13 warnings generated when compiling for gfx1035. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cu:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cu:67:13: warning: 'break' will never be executed [-Wunreachable-code-break] 67 | break; | ^~~~~ 13 warnings generated when compiling for gfx1102. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mean.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mean.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmf.cu:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mean.cu:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/reduce_rows.cuh:42:21: warning: comparison of integers of different signs: 'const int' and 'unsigned int' [-Wsign-compare] 42 | if (lane_id < (blockDim.x / WARP_SIZE)) { | ~~~~~~~ ^ ~~~~~~~~~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmf.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmf.cu:356:26: warning: unused variable 'prec' [-Wunused-variable] 356 | const enum ggml_prec prec = fast_fp16_available(cc) ? ggml_prec(dst->op_params[0]) : GGML_PREC_F32; | ^~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/reduce_rows.cuh:42:21: warning: comparison of integers of different signs: 'const int' and 'unsigned int' [-Wsign-compare] 42 | if (lane_id < (blockDim.x / WARP_SIZE)) { | ~~~~~~~ ^ ~~~~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mean.cu:62:9: note: in instantiation of function template specialization 'reduce_rows_f32' requested here 62 | reduce_rows_f32<<>>(src0_d, dst_d, ncols); | ^ 3 warnings generated when compiling for gfx90a. /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmf.cu:154:32: warning: unused typedef 'tile_C' [-Wunused-local-typedef] 154 | typedef tile<16, 8, float> tile_C; | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cu:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cu:67:13: warning: 'break' will never be executed [-Wunreachable-code-break] 67 | break; | ^~~~~ 13 warnings generated when compiling for gfx1103. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mean.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mean.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mean.cu:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/reduce_rows.cuh:42:21: warning: comparison of integers of different signs: 'const int' and 'unsigned int' [-Wsign-compare] 42 | if (lane_id < (blockDim.x / WARP_SIZE)) { | ~~~~~~~ ^ ~~~~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/reduce_rows.cuh:42:21: warning: comparison of integers of different signs: 'const int' and 'unsigned int' [-Wsign-compare] 42 | if (lane_id < (blockDim.x / WARP_SIZE)) { | ~~~~~~~ ^ ~~~~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mean.cu:62:9: note: in instantiation of function template specialization 'reduce_rows_f32' requested here 62 | reduce_rows_f32<<>>(src0_d, dst_d, ncols); | ^ 13 warnings generated when compiling for gfx1036. 3 warnings generated when compiling for gfx942. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cu:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cu:67:13: warning: 'break' will never be executed [-Wunreachable-code-break] 67 | break; | ^~~~~ 13 warnings generated when compiling for gfx1150. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mean.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mean.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mean.cu:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/reduce_rows.cuh:42:21: warning: comparison of integers of different signs: 'const int' and 'unsigned int' [-Wsign-compare] 42 | if (lane_id < (blockDim.x In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmf.cu:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ / WARP_SIZE)) { | ~~~~~~~ ^ ~~~~~~~~~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmf.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmf.cu:356:26: warning: unused variable 'prec' [-Wunused-variable] 356 | const enum ggml_prec prec = fast_fp16_available(cc) ? ggml_prec(dst->op_params[0]) : GGML_PREC_F32; | ^~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/reduce_rows.cuh:42:21: warning: comparison of integers of different signs: 'const int' and 'unsigned int' [-Wsign-compare] 42 | if (lane_id < (blockDim.x / WARP_SIZE)) { | ~~~~~~~ ^ ~~~~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mean.cu:62:9: note: in instantiation of function template specialization 'reduce_rows_f32' requested here 62 | reduce_rows_f32<<>>(src0_d, dst_d, ncols); | ^ 3 warnings generated when compiling for gfx950. /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmf.cu:154:32: warning: unused typedef 'tile_C' [-Wunused-local-typedef] 154 | typedef tile<16, 8, float> tile_C; | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cu:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cu:67:13: warning: 'break' will never be executed [-Wunreachable-code-break] 67 | break; | ^~~~~ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mean.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mean.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mean.cu:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/reduce_rows.cuh:42:21: warning: comparison of integers of different signs: 'const int' and 'unsigned int' [-Wsign-compare] 42 | if (lane_id < (blockDim.x / WARP_SIZE)) { | ~~~~~~~ ^ ~~~~~~~~~~~~~~~~~~~~~~ 13 warnings generated when compiling for gfx1151. /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/reduce_rows.cuh:42:21: warning: comparison of integers of different signs: 'const int' and 'unsigned int' [-Wsign-compare] 42 | if (lane_id < (blockDim.x / WARP_SIZE)) { | ~~~~~~~ ^ ~~~~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mean.cu:62:9: note: in instantiation of function template specialization 'reduce_rows_f32' requested here 62 | reduce_rows_f32<<>>(src0_d, dst_d, ncols); | ^ 3 warnings generated when compiling for host. 13 warnings generated when compiling for gfx1100. [ 29%] Building CXX object ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/mmvq.cu.o cd /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build/ggml/src/ggml-hip && /usr/bin/hipcc -DGGML_BACKEND_BUILD -DGGML_BACKEND_SHARED -DGGML_HIP_NO_VMM -DGGML_HIP_ROCWMMA_FATTN_GFX12 -DGGML_SCHED_MAX_COPIES=4 -DGGML_SHARED -DGGML_USE_HIP -DUSE_PROF_API=1 -D_GNU_SOURCE -D_XOPEN_SOURCE=600 -D__HIP_PLATFORM_AMD__=1 -Dggml_hip_EXPORTS -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/.. -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -DNDEBUG -std=gnu++17 -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -x hip --offload-arch=gfx900 --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx942 --offload-arch=gfx950 --offload-arch=gfx1010 --offload-arch=gfx1012 --offload-arch=gfx1030 --offload-arch=gfx1031 --offload-arch=gfx1035 --offload-arch=gfx1036 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1103 --offload-arch=gfx1150 --offload-arch=gfx1151 --offload-arch=gfx1152 --offload-arch=gfx1153 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/mmvq.cu.o -MF CMakeFiles/ggml-hip.dir/__/ggml-cuda/mmvq.cu.o.d -o CMakeFiles/ggml-hip.dir/__/ggml-cuda/mmvq.cu.o -c /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmvq.cu In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1010. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmvq.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmvq.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmf.cu:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmvq.cu:2: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cuh:4: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuhIn file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cu:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ :383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cu:67:13: warning: 'break' will never be executed [-Wunreachable-code-break] 67 | break; | ^~~~~ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmf.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmvq.cu:2: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cuh:4: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmf.cu:356:26: warning: unused variable 'prec' [-Wunused-variable] 356 | const enum ggml_prec prec = fast_fp16_available(cc) ? ggml_prec(dst->op_params[0]) : GGML_PREC_F32; | ^~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmvq.cu:496:13: warning: 'break' will never be executed [-Wunreachable-code-break] 496 | break; | ^~~~~ 13 warnings generated when compiling for gfx1152. /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmf.cu:154:32: warning: unused typedef 'tile_C' [-Wunused-local-typedef] 154 | typedef tile<16, 8, float> tile_C; | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmvf.cu:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2>13 & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ warnings generated when compiling for gfx1101. /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cu:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cu:67:13: warning: 'break' will never be executed [-Wunreachable-code-break] 67 | break; | ^~~~~ 13 warnings generated when compiling for gfx1153. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmf.cu:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmf.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmf.cu:356:26: warning: unused variable 'prec' [-Wunused-variable] 356 | const enum ggml_prec prec = fast_fp16_available(cc) ? ggml_prec(dst->op_params[0]) : GGML_PREC_F32; | ^~~~ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmf.cu:154:32: warning: unused typedef 'tile_C' [-Wunused-local-typedef] 154 | typedef tile<16, 8, float> tile_C; | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cu:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cu:67:13: warning: 'break' will never be executed [-Wunreachable-code-break] 67 | break; | ^~~~~ 13 warnings generated when compiling for gfx1200. 13 warnings generated when compiling for gfx1102. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cu:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cu:67:13: warning: 'break' will never be executed [-Wunreachable-code-break] 67 | break; | ^~~~~ 13 warnings generated when compiling for gfx1201. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmf.cu:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmf.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmf.cu:356:26: warning: unused variable 'prec' [-Wunused-variable] 356 | const enum ggml_prec prec = fast_fp16_available(cc) ? ggml_prec(dst->op_params[0]) : GGML_PREC_F32; | ^~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmf.cu:154:32: warning: unused typedef 'tile_C' [-Wunused-local-typedef] 154 | typedef tile<16, 8, float> tile_C; | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cu:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cu:67:13: warning: 'break' will never be executed [-Wunreachable-code-break] 67 | break; | ^~~~~ 13 warnings generated when compiling for gfx900. 13 warnings generated when compiling for gfx1103. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cu:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cu:67:13: warning: 'break' will never be executed [-Wunreachable-code-break] 67 | break; | ^~~~~ 13 warnings generated when compiling for gfx906. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmf.cu:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmf.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmf.cu:356:26: warning: unused variable 'prec' [-Wunused-variable] 356 | const enum ggml_prec prec = fast_fp16_available(cc) ? ggml_prec(dst->op_params[0]) : GGML_PREC_F32; | ^~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmf.cu:154:32: warning: unused typedef 'tile_C' [-Wunused-local-typedef] 154 | typedef tile<16, 8, float> tile_C; | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cu:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:1796:31: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 1796 | for (int l = 0; l < sizeof(int); ++l) { | ~ ^ ~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cu:67:13: warning: 'break' will never be executed [-Wunreachable-code-break] 67 | break; | ^~~~~ 11 warnings generated when compiling for gfx908. 13 warnings generated when compiling for gfx1150. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cu:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:1796:31: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 1796 | for (int l = 0; l < sizeof(int); ++l) { | ~ ^ ~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cu:67:13: warning: 'break' will never be executed [-Wunreachable-code-break] 67 | break; | ^~~~~ 11 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmf.cu:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmf.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmf.cu:356:26: warning: unused variable 'prec' [-Wunused-variable] 356 | const enum ggml_prec prec = fast_fp16_available(cc) ? ggml_prec(dst->op_params[0]) : GGML_PREC_F32; | ^~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmf.cu:154:32: warning: unused typedef 'tile_C' [-Wunused-local-typedef] 154 | typedef tile<16, 8, float> tile_C; | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cu:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:1796:31: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 1796 | for (int l = 0; l < sizeof(int); ++l) { | ~ ^ ~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cu:67:13: warning: 'break' will never be executed [-Wunreachable-code-break] 67 | break; | ^~~~~ 11 warnings generated when compiling for gfx90a. 13 warnings generated when compiling for gfx1151. 1 warning generated when compiling for gfx1012. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmf.cu:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ : warning: unused parameter 'cc' [-Wunused-parameter] /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh 270 | static bool fp16_mma_available(const int cc) { | ^ :480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:521:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 521 | acc[0] = __builtin_amdgcn_mfma_i32_16x16x32_i8(((int64_t *) A.x)[0], | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:522:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 522 | ((int64_t *) B.x)[0], | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:549:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 549 | acc[0] = __builtin_amdgcn_mfma_i32_32x32x16_i8(((int64_t *) A.x)[0], | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:550:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 550 | ((int64_t *) B.x)[0], | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cu:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:1796:31: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 1796 | for (int l = 0; l < sizeof(int); ++l) { | ~ ^ ~~~~~~~~~~~ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmf.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cu:67:13: warning: 'break' will never be executed [-Wunreachable-code-break] 67 | break; | ^~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmf.cu:356:26: warning: unused variable 'prec' [-Wunused-variable] 356 | const enum ggml_prec prec = fast_fp16_available(cc) ? ggml_prec(dst->op_params[0]) : GGML_PREC_F32; | ^~~~ 15 warnings generated when compiling for gfx942. /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmf.cu:154:32: warning: unused typedef 'tile_C' [-Wunused-local-typedef] 154 | typedef tile<16, 8, float> tile_C; | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmvf.cu:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 13 warnings generated when compiling for gfx1152. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cu:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cu:67:13: warning: 'break' will never be executed [-Wunreachable-code-break] 67 | break; | ^~~~~ 13 warnings generated when compiling for gfx950. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmf.cu:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmf.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmf.cu:356:26: warning: unused variable 'prec' [-Wunused-variable] 356 | const enum ggml_prec prec = fast_fp16_available(cc) ? ggml_prec(dst->op_params[0]) : GGML_PREC_F32; | ^~~~ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmf.cu:154:32: warning: unused typedef 'tile_C' [-Wunused-local-typedef] 154 | typedef tile<16, 8, float> tile_C; | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cu:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cu:67:13: warning: 'break' will never be executed [-Wunreachable-code-break] 67 | break; | ^~~~~ 13 warnings generated when compiling for host. [ 30%] Building CXX object ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/norm.cu.o cd /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build/ggml/src/ggml-hip && /usr/bin/hipcc -DGGML_BACKEND_BUILD -DGGML_BACKEND_SHARED -DGGML_HIP_NO_VMM -DGGML_HIP_ROCWMMA_FATTN_GFX12 -DGGML_SCHED_MAX_COPIES=4 -DGGML_SHARED -DGGML_USE_HIP -DUSE_PROF_API=1 -D_GNU_SOURCE -D_XOPEN_SOURCE=600 -D__HIP_PLATFORM_AMD__=1 -Dggml_hip_EXPORTS -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/.. -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -DNDEBUG -std=gnu++17 -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -x hip --offload-arch=gfx900 --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx942 --offload-arch=gfx950 --offload-arch=gfx1010 --offload-arch=gfx1012 --offload-arch=gfx1030 --offload-arch=gfx1031 --offload-arch=gfx1035 --offload-arch=gfx1036 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1103 --offload-arch=gfx1150 --offload-arch=gfx1151 --offload-arch=gfx1152 --offload-arch=gfx1153 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/norm.cu.o -MF CMakeFiles/ggml-hip.dir/__/ggml-cuda/norm.cu.o.d -o CMakeFiles/ggml-hip.dir/__/ggml-cuda/norm.cu.o -c /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/norm.cu 13 warnings generated when compiling for gfx1153. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/norm.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/norm.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1010. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmf.cu:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmf.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmf.cu:356:26: warning: unused variable 'prec' [-Wunused-variable] 356 | const enum ggml_prec prec = fast_fp16_available(cc) ? ggml_prec(dst->op_params[0]) : GGML_PREC_F32; | ^~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmf.cu:154:32: warning: unused typedef 'tile_C' [-Wunused-local-typedef] 154 | typedef tile<16, 8, float> tile_C; | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/norm.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/norm.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1012. 13 warnings generated when compiling for gfx1200. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/norm.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/norm.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmf.cu:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmf.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmf.cu:356:26: warning: unused variable 'prec' [-Wunused-variable] 356 | const enum ggml_prec prec = fast_fp16_available(cc) ? ggml_prec(dst->op_params[0]) : GGML_PREC_F32; | ^~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmf.cu:154:32: warning: unused typedef 'tile_C' [-Wunused-local-typedef] 154 | typedef tile<16, 8, float> tile_C; | ^ 1 warning generated when compiling for gfx1030. 13 warnings generated when compiling for gfx1201. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/norm.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/norm.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1031. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmf.cu:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmf.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmf.cu:356:26: warning: unused variable 'prec' [-Wunused-variable] 356 | const enum ggml_prec prec = fast_fp16_available(cc) ? ggml_prec(dst->op_params[0]) : GGML_PREC_F32; | ^~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmf.cu:154:32: warning: unused typedef 'tile_C' [-Wunused-local-typedef] 154 | typedef tile<16, 8, float> tile_C; | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/norm.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/norm.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1035. 13 warnings generated when compiling for gfx900. 1 warning generated when compiling for gfx1030. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/norm.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/norm.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from 1/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmf.cu:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ warning generated when compiling for gfx1036. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmf.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmf.cu:356:26: warning: unused variable 'prec' [-Wunused-variable] 356 | const enum ggml_prec prec = fast_fp16_available(cc) ? ggml_prec(dst->op_params[0]) : GGML_PREC_F32; | ^~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmf.cu:154:32: warning: unused typedef 'tile_C' [-Wunused-local-typedef] 154 | typedef tile<16, 8, float> tile_C; | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmvf.cu:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/norm.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/norm.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 13 warnings generated when compiling for gfx906. 1 warning generated when compiling for gfx1100. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmf.cu:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmf.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmf.cu:356:26: warning: unused variable 'prec' [-Wunused-variable] 356 | const enum ggml_prec prec = fast_fp16_available(cc) ? ggml_prec(dst->op_params[0]) : GGML_PREC_F32; | ^~~~ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/norm.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/norm.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmf.cu:154:32: warning: unused typedef 'tile_C' [-Wunused-local-typedef] 154 | typedef tile<16, 8, float> tile_C; | ^ 1 warning generated when compiling for gfx1101. 11 warnings generated when compiling for gfx908. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/norm.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/norm.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1102. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmf.cu:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmf.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmf.cu:356:26: warning: unused variable 'prec' [-Wunused-variable] 356 | const enum ggml_prec prec = fast_fp16_available(cc) ? ggml_prec(dst->op_params[0]) : GGML_PREC_F32; | ^~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmf.cu:154:32: warning: unused typedef 'tile_C' [-Wunused-local-typedef] 154 | typedef tile<16, 8, float> tile_C; | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/norm.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/norm.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1103. 11 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/norm.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/norm.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmf.cu:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmf.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmf.cu:356:26: warning: unused variable 'prec' [-Wunused-variable] 356 | const enum ggml_prec prec = fast_fp16_available(cc) ? ggml_prec(dst->op_params[0]) : GGML_PREC_F32; | ^~~~ 1 warning generated when compiling for gfx1150. /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmf.cu:154:32: warning: unused typedef 'tile_C' [-Wunused-local-typedef] 154 | typedef tile<16, 8, float> tile_C; | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/norm.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/norm.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 11 warnings generated when compiling for gfx90a. 1 warning generated when compiling for gfx1151. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmf.cu:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmf.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:521:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 521 | acc[0] = __builtin_amdgcn_mfma_i32_16x16x32_i8(((int64_t *) A.x)[0], | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:522:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 522 | ((int64_t *) B.x)[0], | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:549:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 549 | acc[0] = __builtin_amdgcn_mfma_i32_32x32x16_i8(((int64_t *) A.x)[0], | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:550:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 550 | ((int64_t *) B.x)[0], | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmf.cu:356:26: warning: unused variable 'prec' [-Wunused-variable] 356 | const enum ggml_prec prec = fast_fp16_available(cc) ? ggml_prec(dst->op_params[0]) : GGML_PREC_F32; | ^~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmf.cu:154:32: warning: unused typedef 'tile_C' [-Wunused-local-typedef] 154 | typedef tile<16, 8, float> tile_C; | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/norm.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/norm.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1152. 1 warning generated when compiling for gfx1031. 15 warnings generated when compiling for gfx942. 13 warnings generated when compiling for gfx1010. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/norm.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/norm.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmvf.cu:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1153. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmf.cu:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmf.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmf.cu:356:26: warning: unused variable 'prec' [-Wunused-variable] 356 | const enum ggml_prec prec = fast_fp16_available(cc) ? ggml_prec(dst->op_params[0]) : GGML_PREC_F32; | ^~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmf.cu:154:32: warning: unused typedef 'tile_C' [-Wunused-local-typedef] 154 | typedef tile<16, 8, float> tile_C; | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmvq.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmvq.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmvq.cu:2: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cuh:4: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmvq.cu:2: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cuh:4: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmvq.cu:496:13: warning: 'break' will never be executed [-Wunreachable-code-break] 496 | break; | ^~~~~ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/norm.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/norm.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1200. 13 warnings generated when compiling for gfx950. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/norm.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/norm.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmf.cu:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmf.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmf.cu:356:26: warning: unused variable 'prec' [-Wunused-variable] 356 | const enum ggml_prec prec = fast_fp16_available(cc) ? ggml_prec(dst->op_params[0]) : GGML_PREC_F32; | ^~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmf.cu:154:32: warning: unused typedef 'tile_C' [-Wunused-local-typedef] 154 | typedef tile<16, 8, float> tile_C; | ^ 1 warning generated when compiling for gfx1201. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/norm.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/norm.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx900. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/norm.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/norm.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 13 warnings generated when compiling for host. [ 30%] Building CXX object ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/opt-step-adamw.cu.o cd /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build/ggml/src/ggml-hip && /usr/bin/hipcc -DGGML_BACKEND_BUILD -DGGML_BACKEND_SHARED -DGGML_HIP_NO_VMM -DGGML_HIP_ROCWMMA_FATTN_GFX12 -DGGML_SCHED_MAX_COPIES=4 -DGGML_SHARED -DGGML_USE_HIP -DUSE_PROF_API=1 -D_GNU_SOURCE -D_XOPEN_SOURCE=600 -D__HIP_PLATFORM_AMD__=1 -Dggml_hip_EXPORTS -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/.. -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -DNDEBUG -std=gnu++17 -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -x hip --offload-arch=gfx900 --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx942 --offload-arch=gfx950 --offload-arch=gfx1010 --offload-arch=gfx1012 --offload-arch=gfx1030 --offload-arch=gfx1031 --offload-arch=gfx1035 --offload-arch=gfx1036 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1103 --offload-arch=gfx1150 --offload-arch=gfx1151 --offload-arch=gfx1152 --offload-arch=gfx1153 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/opt-step-adamw.cu.o -MF CMakeFiles/ggml-hip.dir/__/ggml-cuda/opt-step-adamw.cu.o.d -o CMakeFiles/ggml-hip.dir/__/ggml-cuda/opt-step-adamw.cu.o -c /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/opt-step-adamw.cu 1 warning generated when compiling for gfx906. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/opt-step-adamw.cu:2: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/opt-step-adamw.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/norm.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/norm.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1010. 1 warning generated when compiling for gfx908. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/opt-step-adamw.cu:2: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/opt-step-adamw.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1012. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/norm.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/norm.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/opt-step-adamw.cu:2: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/opt-step-adamw.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1030. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/norm.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/norm.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/opt-step-adamw.cu:2: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/opt-step-adamw.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1031. 1 warning generated when compiling for gfx1035. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/norm.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/norm.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/opt-step-adamw.cu:2: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/opt-step-adamw.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx942. 1 warning generated when compiling for gfx1035. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmvf.cu:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/norm.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/norm.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/opt-step-adamw.cu:2: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/opt-step-adamw.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1036. 1 warning generated when compiling for gfx950. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/opt-step-adamw.cu:2: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/opt-step-adamw.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/norm.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/norm.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1100. 1 warning generated when compiling for host. [ 31%] Building CXX object ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/out-prod.cu.o cd /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build/ggml/src/ggml-hip && /usr/bin/hipcc -DGGML_BACKEND_BUILD -DGGML_BACKEND_SHARED -DGGML_HIP_NO_VMM -DGGML_HIP_ROCWMMA_FATTN_GFX12 -DGGML_SCHED_MAX_COPIES=4 -DGGML_SHARED -DGGML_USE_HIP -DUSE_PROF_API=1 -D_GNU_SOURCE -D_XOPEN_SOURCE=600 -D__HIP_PLATFORM_AMD__=1 -Dggml_hip_EXPORTS -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/.. -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -DNDEBUG -std=gnu++17 -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -x hip --offload-arch=gfx900 --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx942 --offload-arch=gfx950 --offload-arch=gfx1010 --offload-arch=gfx1012 --offload-arch=gfx1030 --offload-arch=gfx1031 --offload-arch=gfx1035 --offload-arch=gfx1036 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1103 --offload-arch=gfx1150 --offload-arch=gfx1151 --offload-arch=gfx1152 --offload-arch=gfx1153 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/out-prod.cu.o -MF CMakeFiles/ggml-hip.dir/__/ggml-cuda/out-prod.cu.o.d -o CMakeFiles/ggml-hip.dir/__/ggml-cuda/out-prod.cu.o -c /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/out-prod.cu In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/opt-step-adamw.cu:2: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/opt-step-adamw.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/out-prod.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/out-prod.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1101. 1 warning generated when compiling for gfx1010. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/out-prod.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/out-prod.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/opt-step-adamw.cu:2: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/opt-step-adamw.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1012. 1 warning generated when compiling for gfx1102. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/out-prod.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/out-prod.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/opt-step-adamw.cu:2: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/opt-step-adamw.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1030. 1 warning generated when compiling for gfx1103. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/out-prod.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/out-prod.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/opt-step-adamw.cu:2: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/opt-step-adamw.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1031. 1 warning generated when compiling for gfx1150. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/out-prod.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/out-prod.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1035. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/opt-step-adamw.cu:2: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/opt-step-adamw.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1151. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/out-prod.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/out-prod.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1036. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/opt-step-adamw.cu:2: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/opt-step-adamw.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1152. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/out-prod.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/out-prod.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1100. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/opt-step-adamw.cu:2: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/opt-step-adamw.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1153. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/out-prod.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/out-prod.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1101. 1 warning generated when compiling for gfx1036. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/opt-step-adamw.cu:2: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/opt-step-adamw.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1200. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/out-prod.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/out-prod.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1102. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmvf.cu:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/opt-step-adamw.cu:2: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/opt-step-adamw.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1201. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/out-prod.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/out-prod.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1103. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/opt-step-adamw.cu:2: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/opt-step-adamw.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx900. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/out-prod.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/out-prod.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1150. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/opt-step-adamw.cu:2: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/opt-step-adamw.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx906. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/out-prod.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/out-prod.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1151. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/opt-step-adamw.cu:2: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/opt-step-adamw.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx908. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/out-prod.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/out-prod.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1152. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/opt-step-adamw.cu:2: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/opt-step-adamw.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/out-prod.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/out-prod.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1153. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/opt-step-adamw.cu:2: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/opt-step-adamw.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/out-prod.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/out-prod.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1200. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/opt-step-adamw.cu:2: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/opt-step-adamw.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx942. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/out-prod.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/out-prod.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1201. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/opt-step-adamw.cu:2: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/opt-step-adamw.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx950. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/out-prod.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/out-prod.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx900. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/opt-step-adamw.cu:2: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/opt-step-adamw.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for host. [ 31%] Building CXX object ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/pad.cu.o cd /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build/ggml/src/ggml-hip && /usr/bin/hipcc -DGGML_BACKEND_BUILD -DGGML_BACKEND_SHARED -DGGML_HIP_NO_VMM -DGGML_HIP_ROCWMMA_FATTN_GFX12 -DGGML_SCHED_MAX_COPIES=4 -DGGML_SHARED -DGGML_USE_HIP -DUSE_PROF_API=1 -D_GNU_SOURCE -D_XOPEN_SOURCE=600 -D__HIP_PLATFORM_AMD__=1 -Dggml_hip_EXPORTS -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/.. -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -DNDEBUG -std=gnu++17 -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -x hip --offload-arch=gfx900 --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx942 --offload-arch=gfx950 --offload-arch=gfx1010 --offload-arch=gfx1012 --offload-arch=gfx1030 --offload-arch=gfx1031 --offload-arch=gfx1035 --offload-arch=gfx1036 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1103 --offload-arch=gfx1150 --offload-arch=gfx1151 --offload-arch=gfx1152 --offload-arch=gfx1153 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/pad.cu.o -MF CMakeFiles/ggml-hip.dir/__/ggml-cuda/pad.cu.o.d -o CMakeFiles/ggml-hip.dir/__/ggml-cuda/pad.cu.o -c /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/pad.cu In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/out-prod.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/out-prod.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx906. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/pad.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/pad.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1010. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/out-prod.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/out-prod.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx908. 1 warning generated when compiling for gfx1100. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/pad.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/pad.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1012. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/out-prod.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/out-prod.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmvf.cu:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx90a. 13 warnings generated when compiling for gfx1012. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/pad.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/pad.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1030. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/out-prod.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/out-prod.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmvq.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmvq.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmvq.cu:2: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cuh:4: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmvq.cu:2: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cuh:4: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmvq.cu:496:13: warning: 'break' will never be executed [-Wunreachable-code-break] 496 | break; | ^~~~~ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/pad.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/pad.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1031. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/out-prod.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/out-prod.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx942. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/pad.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/pad.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1035. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/out-prod.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/out-prod.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx950. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/pad.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/pad.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1036. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/out-prod.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/out-prod.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for host. [ 32%] Building CXX object ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/pool2d.cu.o cd /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build/ggml/src/ggml-hip && /usr/bin/hipcc -DGGML_BACKEND_BUILD -DGGML_BACKEND_SHARED -DGGML_HIP_NO_VMM -DGGML_HIP_ROCWMMA_FATTN_GFX12 -DGGML_SCHED_MAX_COPIES=4 -DGGML_SHARED -DGGML_USE_HIP -DUSE_PROF_API=1 -D_GNU_SOURCE -D_XOPEN_SOURCE=600 -D__HIP_PLATFORM_AMD__=1 -Dggml_hip_EXPORTS -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/.. -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -DNDEBUG -std=gnu++17 -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -x hip --offload-arch=gfx900 --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx942 --offload-arch=gfx950 --offload-arch=gfx1010 --offload-arch=gfx1012 --offload-arch=gfx1030 --offload-arch=gfx1031 --offload-arch=gfx1035 --offload-arch=gfx1036 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1103 --offload-arch=gfx1150 --offload-arch=gfx1151 --offload-arch=gfx1152 --offload-arch=gfx1153 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/pool2d.cu.o -MF CMakeFiles/ggml-hip.dir/__/ggml-cuda/pool2d.cu.o.d -o CMakeFiles/ggml-hip.dir/__/ggml-cuda/pool2d.cu.o -c /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/pool2d.cu In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/pad.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/pad.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1100. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/pool2d.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/pool2d.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1010. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/pad.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/pad.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1101. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/pool2d.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/pool2d.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1012. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/pad.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/pad.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1102. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/pool2d.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/pool2d.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1030. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/pad.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/pad.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1103. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/pool2d.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/pool2d.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1031. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/pad.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/pad.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1150. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/pool2d.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/pool2d.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1101. 1 warning generated when compiling for gfx1035. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/pad.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/pad.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1151. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmvf.cu:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/pool2d.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/pool2d.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1036. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/pad.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/pad.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1152. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/pool2d.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/pool2d.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1100. 13 warnings generated when compiling for gfx1030. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/pad.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/pad.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1153. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/pool2d.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/pool2d.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1101. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmvq.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmvq.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmvq.cu:2: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cuh:4: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmvq.cu:2: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cuh:4: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmvq.cu:496:13: warning: 'break' will never be executed [-Wunreachable-code-break] 496 | break; | ^~~~~ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/pad.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/pad.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1200. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/pool2d.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/pool2d.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1102. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/pad.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/pad.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1201. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/pool2d.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/pool2d.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1103. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/pad.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/pad.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx900. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/pool2d.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/pool2d.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1150. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/pad.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/pad.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx906. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/pool2d.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/pool2d.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1151. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/pad.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/pad.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx908. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/pool2d.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/pool2d.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1152. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/pad.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/pad.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/pool2d.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/pool2d.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1153. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/pad.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/pad.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx90a. 1 warning generated when compiling for gfx1102. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/pool2d.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/pool2d.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1200. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/pad.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/pad.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmvf.cu:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx942. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/pool2d.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/pool2d.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1201. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/pad.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/pad.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx950. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/pool2d.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/pool2d.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx900. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/pad.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/pad.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for host. [ 32%] Building CXX object ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/quantize.cu.o cd /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build/ggml/src/ggml-hip && /usr/bin/hipcc -DGGML_BACKEND_BUILD -DGGML_BACKEND_SHARED -DGGML_HIP_NO_VMM -DGGML_HIP_ROCWMMA_FATTN_GFX12 -DGGML_SCHED_MAX_COPIES=4 -DGGML_SHARED -DGGML_USE_HIP -DUSE_PROF_API=1 -D_GNU_SOURCE -D_XOPEN_SOURCE=600 -D__HIP_PLATFORM_AMD__=1 -Dggml_hip_EXPORTS -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/.. -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -DNDEBUG -std=gnu++17 -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -x hip --offload-arch=gfx900 --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx942 --offload-arch=gfx950 --offload-arch=gfx1010 --offload-arch=gfx1012 --offload-arch=gfx1030 --offload-arch=gfx1031 --offload-arch=gfx1035 --offload-arch=gfx1036 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1103 --offload-arch=gfx1150 --offload-arch=gfx1151 --offload-arch=gfx1152 --offload-arch=gfx1153 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/quantize.cu.o -MF CMakeFiles/ggml-hip.dir/__/ggml-cuda/quantize.cu.o.d -o CMakeFiles/ggml-hip.dir/__/ggml-cuda/quantize.cu.o -c /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cu In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/pool2d.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/pool2d.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx906. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cuh:4: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cuh:4: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 13 warnings generated when compiling for gfx1031. /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cu:188:13: warning: 'break' will never be executed [-Wunreachable-code-break] 188 | break; | ^~~~~ 13 warnings generated when compiling for gfx1010. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/pool2d.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/pool2d.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmvq.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmvq.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx908. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmvq.cu:2: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cuh:4: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmvq.cu:2: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cuh:4: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmvq.cu:496:13: warning: 'break' will never be executed [-Wunreachable-code-break] 496 | break; | ^~~~~ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cuh:4: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cuh:4: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cu:188:13: warning: 'break' will never be executed [-Wunreachable-code-break] 188 | break; | ^~~~~ 13 warnings generated when compiling for gfx1012. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/pool2d.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/pool2d.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cuh:4: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cuh:4: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cu:188:13: warning: 'break' will never be executed [-Wunreachable-code-break] 188 | break; | ^~~~~ 13 warnings generated when compiling for gfx1030. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/pool2d.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/pool2d.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cuh:4: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cuh:4: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cu:188:13: warning: 'break' will never be executed [-Wunreachable-code-break] 188 | break; | ^~~~~ 13 warnings generated when compiling for gfx1031. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/pool2d.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/pool2d.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx942. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cuh:4: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cuh:4: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cu:188:13: warning: 'break' will never be executed [-Wunreachable-code-break] 188 | break; | ^~~~~ 13 warnings generated when compiling for gfx1035. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/pool2d.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/pool2d.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx950. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cuh:4: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/pool2d.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/pool2d.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cuh:4: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cu:188:13: warning: 'break' will never be executed [-Wunreachable-code-break] 188 | break; | ^~~~~ 1 warning generated when compiling for host. [ 32%] Building CXX object ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/roll.cu.o cd /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build/ggml/src/ggml-hip && /usr/bin/hipcc -DGGML_BACKEND_BUILD -DGGML_BACKEND_SHARED -DGGML_HIP_NO_VMM -DGGML_HIP_ROCWMMA_FATTN_GFX12 -DGGML_SCHED_MAX_COPIES=4 -DGGML_SHARED -DGGML_USE_HIP -DUSE_PROF_API=1 -D_GNU_SOURCE -D_XOPEN_SOURCE=600 -D__HIP_PLATFORM_AMD__=1 -Dggml_hip_EXPORTS -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/.. -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -DNDEBUG -std=gnu++17 -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -x hip --offload-arch=gfx900 --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx942 --offload-arch=gfx950 --offload-arch=gfx1010 --offload-arch=gfx1012 --offload-arch=gfx1030 --offload-arch=gfx1031 --offload-arch=gfx1035 --offload-arch=gfx1036 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1103 --offload-arch=gfx1150 --offload-arch=gfx1151 --offload-arch=gfx1152 --offload-arch=gfx1153 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/roll.cu.o -MF CMakeFiles/ggml-hip.dir/__/ggml-cuda/roll.cu.o.d -o CMakeFiles/ggml-hip.dir/__/ggml-cuda/roll.cu.o -c /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/roll.cu 13 warnings generated when compiling for gfx1036. 1 warning generated when compiling for gfx1103. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/roll.cu:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cuh:4: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cuh:4: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cu:188:13: warning: 'break' will never be executed [-Wunreachable-code-break] 188 | break; | ^~~~~ 1 warning generated when compiling for gfx1010. 13 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmvf.cu:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/roll.cu:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1012. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cuh:4: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cuh:4: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cu:188:13: warning: 'break' will never be executed [-Wunreachable-code-break] 188 | break; | ^~~~~ 13 warnings generated when compiling for gfx1101. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/roll.cu:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1030. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cuh:4: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cuh:4: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cu:188:13: warning: 'break' will never be executed [-Wunreachable-code-break] 188 | break; | ^~~~~ 13 warnings generated when compiling for gfx1102. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/roll.cu:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1031. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cuh:4: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cuh:4: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cu:188:13: warning: 'break' will never be executed [-Wunreachable-code-break] 188 | break; | ^~~~~ 13 warnings generated when compiling for gfx1103. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/roll.cu:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1035. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cuh:4: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ 13 warnings generated when compiling for gfx1035. /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cuh:4: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cu:188:13: warning: 'break' will never be executed [-Wunreachable-code-break] 188 | break; | ^~~~~ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/roll.cu:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 13 warnings generated when compiling for gfx1150. 1 warning generated when compiling for gfx1036. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmvq.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmvq.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmvq.cu:2: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cuh:4: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmvq.cu:2: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cuh:4: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmvq.cu:496:13: warning: 'break' will never be executed [-Wunreachable-code-break] 496 | break; | ^~~~~ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cuh:4: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/roll.cu:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cuh:4: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cu:188:13: warning: 'break' will never be executed [-Wunreachable-code-break] 188 | break; | ^~~~~ 1 warning generated when compiling for gfx1100. 13 warnings generated when compiling for gfx1151. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/roll.cu:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cuh:4: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cuh:4: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 1 warning generated when compiling for gfx1101. /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cu:188:13: warning: 'break' will never be executed [-Wunreachable-code-break] 188 | break; | ^~~~~ 13 warnings generated when compiling for gfx1152. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/roll.cu:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1102. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cuh:4: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cuh:4: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cu:188:13: warning: 'break' will never be executed [-Wunreachable-code-break] 188 | break; | ^~~~~ 13 warnings generated when compiling for gfx1153. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/roll.cu:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1103. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cuh:4: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cuh:4: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cu:188:13: warning: 'break' will never be executed [-Wunreachable-code-break] 188 | break; | ^~~~~ 13 warnings generated when compiling for gfx1200. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/roll.cu:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1150. 1 warning generated when compiling for gfx1150. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cuh:4: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cuh:4: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cu:188:13: warning: 'break' will never be executed [-Wunreachable-code-break] 188 | break; | ^~~~~ 13 warnings generated when compiling for gfx1201. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmvf.cu:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/roll.cu:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1151. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cuh:4: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cuh:4: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cu:188:13: warning: 'break' will never be executed [-Wunreachable-code-break] 188 | break; | ^~~~~ 13 warnings generated when compiling for gfx900. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/roll.cu:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1152. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cuh:4: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuhIn file included from :436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/roll.cu:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh{ :463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cuh:4: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cu:188:13: warning: 'break' will never be executed [-Wunreachable-code-break] 188 | break; | ^~~~~ 1 warning generated when compiling for gfx1153. 13 warnings generated when compiling for gfx906. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/roll.cu:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1200. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cuh:4: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cuh:4: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:1796:31: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 1796 | for (int l = 0; l < sizeof(int); ++l) { | ~ ^ ~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cu:188:13: warning: 'break' will never be executed [-Wunreachable-code-break] 188 | break; | ^~~~~ 11 warnings generated when compiling for gfx908. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/roll.cu:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1201. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cuh:4: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cuh:4: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:1796:31: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 1796 | for (int l = 0; l < sizeof(int); ++l) { | ~ ^ ~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cu:188:13: warning: 'break' will never be executed [-Wunreachable-code-break] 188 | break; | ^~~~~ 11 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/roll.cu:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx900. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 13 warnings generated when compiling for gfx1036. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cuh:4: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cuh:4: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:1796:31: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 1796 | for (int l = 0; l < sizeof(int); ++l) { | ~ ^ ~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cu:188:13: warning: 'break' will never be executed [-Wunreachable-code-break] 188 | break; | ^~~~~ 11 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/roll.cu:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx906. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmvq.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmvq.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmvq.cu:2: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cuh:4: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmvq.cu:2: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cuh:4: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmvq.cu:496:13: warning: 'break' will never be executed [-Wunreachable-code-break] 496 | break; | ^~~~~ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cuh:4: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:521:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 521 | acc[0] = __builtin_amdgcn_mfma_i32_16x16x32_i8(((int64_t *) A.x)[0], | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:522:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 522 | ((int64_t *) B.x)[0], | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:549:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 549 | acc[0] = __builtin_amdgcn_mfma_i32_32x32x16_i8(((int64_t *) A.x)[0], | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:550:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 550 | ((int64_t *) B.x)[0], | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cuh:4: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:1796:31: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 1796 | for (int l = 0; l < sizeof(int); ++l) { | ~ ^ ~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cu:188:13: warning: 'break' will never be executed [-Wunreachable-code-break] 188 | break; | ^~~~~ 15 warnings generated when compiling for gfx942. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/roll.cu:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx908. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cuh:4: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cuh:4: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cu:188:13: warning: 'break' will never be executed [-Wunreachable-code-break] 188 | break; | ^~~~~ 13 warnings generated when compiling for gfx950. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/roll.cu:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cuh:4: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cuh:4: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cu:188:13: warning: 'break' will never be executed [-Wunreachable-code-break] 188 | break; | ^~~~~ 13 warnings generated when compiling for host. [ 33%] Building CXX object ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/rope.cu.o cd /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build/ggml/src/ggml-hip && /usr/bin/hipcc -DGGML_BACKEND_BUILD -DGGML_BACKEND_SHARED -DGGML_HIP_NO_VMM -DGGML_HIP_ROCWMMA_FATTN_GFX12 -DGGML_SCHED_MAX_COPIES=4 -DGGML_SHARED -DGGML_USE_HIP -DUSE_PROF_API=1 -D_GNU_SOURCE -D_XOPEN_SOURCE=600 -D__HIP_PLATFORM_AMD__=1 -Dggml_hip_EXPORTS -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/.. -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -DNDEBUG -std=gnu++17 -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -x hip --offload-arch=gfx900 --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx942 --offload-arch=gfx950 --offload-arch=gfx1010 --offload-arch=gfx1012 --offload-arch=gfx1030 --offload-arch=gfx1031 --offload-arch=gfx1035 --offload-arch=gfx1036 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1103 --offload-arch=gfx1150 --offload-arch=gfx1151 --offload-arch=gfx1152 --offload-arch=gfx1153 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/rope.cu.o -MF CMakeFiles/ggml-hip.dir/__/ggml-cuda/rope.cu.o.d -o CMakeFiles/ggml-hip.dir/__/ggml-cuda/rope.cu.o -c /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/rope.cu In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/roll.cu:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1151. 1 warning generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/rope.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/rope.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/roll.cu:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmvf.cu:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx942. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/roll.cu:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx950. 1 warning generated when compiling for gfx1010. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/roll.cu:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for host. [ 33%] Building CXX object ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/scale.cu.o cd /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build/ggml/src/ggml-hip && /usr/bin/hipcc -DGGML_BACKEND_BUILD -DGGML_BACKEND_SHARED -DGGML_HIP_NO_VMM -DGGML_HIP_ROCWMMA_FATTN_GFX12 -DGGML_SCHED_MAX_COPIES=4 -DGGML_SHARED -DGGML_USE_HIP -DUSE_PROF_API=1 -D_GNU_SOURCE -D_XOPEN_SOURCE=600 -D__HIP_PLATFORM_AMD__=1 -Dggml_hip_EXPORTS -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/.. -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -DNDEBUG -std=gnu++17 -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -x hip --offload-arch=gfx900 --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx942 --offload-arch=gfx950 --offload-arch=gfx1010 --offload-arch=gfx1012 --offload-arch=gfx1030 --offload-arch=gfx1031 --offload-arch=gfx1035 --offload-arch=gfx1036 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1103 --offload-arch=gfx1150 --offload-arch=gfx1151 --offload-arch=gfx1152 --offload-arch=gfx1153 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/scale.cu.o -MF CMakeFiles/ggml-hip.dir/__/ggml-cuda/scale.cu.o.d -o CMakeFiles/ggml-hip.dir/__/ggml-cuda/scale.cu.o -c /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/scale.cu In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/rope.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/rope.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/scale.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/scale.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1010. 1 warning generated when compiling for gfx1012. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/scale.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/scale.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1012. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/rope.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/rope.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/scale.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/scale.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1030. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/scale.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/scale.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1031. 1 warning generated when compiling for gfx1030. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/scale.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/scale.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1035. 13 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/rope.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/rope.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/scale.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/scale.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmvq.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmvq.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1036. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmvq.cu:2: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cuh:4: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmvq.cu:2: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cuh:4: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmvq.cu:496:13: warning: 'break' will never be executed [-Wunreachable-code-break] 496 | break; | ^~~~~ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/scale.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/scale.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1100. 1 warning generated when compiling for gfx1031. 1 warning generated when compiling for gfx1152. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/scale.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/scale.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/rope.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/rope.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1101. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmvf.cu:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/scale.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/scale.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1102. 1 warning generated when compiling for gfx1035. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/scale.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/scale.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1103. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/rope.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/rope.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/scale.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/scale.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1150. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/scale.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/scale.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1151. 1 warning generated when compiling for gfx1036. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/scale.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/scale.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/rope.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/rope.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1152. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/scale.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/scale.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1153. 1 warning generated when compiling for gfx1100. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/scale.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/scale.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1200. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/rope.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/rope.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/scale.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/scale.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1201. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/scale.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/scale.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx900. 13 warnings generated when compiling for gfx1101. 1 warning generated when compiling for gfx1101. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/scale.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/scale.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1153. 1 warning generated when compiling for gfx906. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmvq.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmvq.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmvq.cu:2: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cuh:4: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/rope.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/rope.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmvq.cu:2: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cuh:4: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmvq.cu:496:13: warning: 'break' will never be executed [-Wunreachable-code-break] 496 | break; | ^~~~~ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmvf.cu:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/scale.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/scale.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx908. 1 warning generated when compiling for gfx1102. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/scale.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/scale.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/rope.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/rope.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/scale.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/scale.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/scale.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/scale.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx942. 1 warning generated when compiling for gfx1103. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/scale.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/scale.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx950. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/rope.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/rope.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/scale.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/scale.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for host. [ 34%] Building CXX object ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/set-rows.cu.o cd /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build/ggml/src/ggml-hip && /usr/bin/hipcc -DGGML_BACKEND_BUILD -DGGML_BACKEND_SHARED -DGGML_HIP_NO_VMM -DGGML_HIP_ROCWMMA_FATTN_GFX12 -DGGML_SCHED_MAX_COPIES=4 -DGGML_SHARED -DGGML_USE_HIP -DUSE_PROF_API=1 -D_GNU_SOURCE -D_XOPEN_SOURCE=600 -D__HIP_PLATFORM_AMD__=1 -Dggml_hip_EXPORTS -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/.. -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -DNDEBUG -std=gnu++17 -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -x hip --offload-arch=gfx900 --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx942 --offload-arch=gfx950 --offload-arch=gfx1010 --offload-arch=gfx1012 --offload-arch=gfx1030 --offload-arch=gfx1031 --offload-arch=gfx1035 --offload-arch=gfx1036 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1103 --offload-arch=gfx1150 --offload-arch=gfx1151 --offload-arch=gfx1152 --offload-arch=gfx1153 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/set-rows.cu.o -MF CMakeFiles/ggml-hip.dir/__/ggml-cuda/set-rows.cu.o.d -o CMakeFiles/ggml-hip.dir/__/ggml-cuda/set-rows.cu.o -c /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/set-rows.cu In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/set-rows.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/set-rows.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1150. 1 warning generated when compiling for gfx1010. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/rope.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/rope.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/set-rows.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/set-rows.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1012. 1 warning generated when compiling for gfx1151. 1 warning generated when compiling for gfx1200. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/set-rows.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/set-rows.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/rope.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/rope.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1030. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmvf.cu:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 13 warnings generated when compiling for gfx1102. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/set-rows.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/set-rows.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmvq.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmvq.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmvq.cu:2: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cuh:4: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmvq.cu:2: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cuh:4: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmvq.cu:496:13: warning: 'break' will never be executed [-Wunreachable-code-break] 496 | break; | ^~~~~ 1 warning generated when compiling for gfx1152. 1 warning generated when compiling for gfx1031. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/rope.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/rope.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/set-rows.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/set-rows.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1035. 1 warning generated when compiling for gfx1153. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/set-rows.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/set-rows.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/rope.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/rope.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1036. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/set-rows.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/set-rows.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1100. 1 warning generated when compiling for gfx1200. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/set-rows.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/set-rows.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/rope.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/rope.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1101. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/set-rows.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/set-rows.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1201. 1 warning generated when compiling for gfx1201. 1 warning generated when compiling for gfx1102. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/rope.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/rope.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmvf.cu:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/set-rows.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/set-rows.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1103. 1 warning generated when compiling for gfx900. 13 warnings generated when compiling for gfx1103. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/set-rows.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/set-rows.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/rope.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/rope.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmvq.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmvq.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmvq.cu:2: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cuh:4: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmvq.cu:2: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cuh:4: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmvq.cu:496:13: warning: 'break' will never be executed [-Wunreachable-code-break] 496 | break; | ^~~~~ 1 warning generated when compiling for gfx1150. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/set-rows.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/set-rows.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx906. 1 warning generated when compiling for gfx1151. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/rope.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/rope.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/set-rows.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/set-rows.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1152. 1 warning generated when compiling for gfx908. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/set-rows.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/set-rows.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/rope.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/rope.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1153. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/set-rows.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/set-rows.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx900. 1 warning generated when compiling for gfx1200. 1 warning generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmvf.cu:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/set-rows.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/set-rows.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/rope.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/rope.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1201. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/set-rows.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/set-rows.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx90a. 1 warning generated when compiling for gfx900. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/rope.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/rope.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/set-rows.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/set-rows.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 13 warnings generated when compiling for gfx1150. 1 warning generated when compiling for gfx906. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmvq.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmvq.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmvq.cu:2: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cuh:4: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmvq.cu:2: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cuh:4: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmvq.cu:496:13: warning: 'break' will never be executed [-Wunreachable-code-break] 496 | break; | ^~~~~ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/set-rows.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/set-rows.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx942. 1 warning generated when compiling for gfx908. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/rope.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/rope.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/set-rows.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/set-rows.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx90a. 1 warning generated when compiling for gfx950. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/set-rows.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/set-rows.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/rope.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/rope.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx90a. 1 warning generated when compiling for gfx906. 1 warning generated when compiling for host. [ 34%] Building CXX object ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/softcap.cu.o cd /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build/ggml/src/ggml-hip && /usr/bin/hipcc -DGGML_BACKEND_BUILD -DGGML_BACKEND_SHARED -DGGML_HIP_NO_VMM -DGGML_HIP_ROCWMMA_FATTN_GFX12 -DGGML_SCHED_MAX_COPIES=4 -DGGML_SHARED -DGGML_USE_HIP -DUSE_PROF_API=1 -D_GNU_SOURCE -D_XOPEN_SOURCE=600 -D__HIP_PLATFORM_AMD__=1 -Dggml_hip_EXPORTS -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/.. -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -DNDEBUG -std=gnu++17 -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -x hip --offload-arch=gfx900 --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx942 --offload-arch=gfx950 --offload-arch=gfx1010 --offload-arch=gfx1012 --offload-arch=gfx1030 --offload-arch=gfx1031 --offload-arch=gfx1035 --offload-arch=gfx1036 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1103 --offload-arch=gfx1150 --offload-arch=gfx1151 --offload-arch=gfx1152 --offload-arch=gfx1153 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/softcap.cu.o -MF CMakeFiles/ggml-hip.dir/__/ggml-cuda/softcap.cu.o.d -o CMakeFiles/ggml-hip.dir/__/ggml-cuda/softcap.cu.o -c /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/softcap.cu In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/set-rows.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/set-rows.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmvf.cu:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/softcap.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/softcap.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1010. 1 warning generated when compiling for gfx942. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/softcap.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/softcap.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1012. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/set-rows.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/set-rows.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx950. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/softcap.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/softcap.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1030. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/set-rows.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/set-rows.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for host. [ 35%] Building CXX object ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/softmax.cu.o cd /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build/ggml/src/ggml-hip && /usr/bin/hipcc -DGGML_BACKEND_BUILD -DGGML_BACKEND_SHARED -DGGML_HIP_NO_VMM -DGGML_HIP_ROCWMMA_FATTN_GFX12 -DGGML_SCHED_MAX_COPIES=4 -DGGML_SHARED -DGGML_USE_HIP -DUSE_PROF_API=1 -D_GNU_SOURCE -D_XOPEN_SOURCE=600 -D__HIP_PLATFORM_AMD__=1 -Dggml_hip_EXPORTS -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/.. -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -DNDEBUG -std=gnu++17 -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -x hip --offload-arch=gfx900 --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx942 --offload-arch=gfx950 --offload-arch=gfx1010 --offload-arch=gfx1012 --offload-arch=gfx1030 --offload-arch=gfx1031 --offload-arch=gfx1035 --offload-arch=gfx1036 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1103 --offload-arch=gfx1150 --offload-arch=gfx1151 --offload-arch=gfx1152 --offload-arch=gfx1153 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/softmax.cu.o -MF CMakeFiles/ggml-hip.dir/__/ggml-cuda/softmax.cu.o.d -o CMakeFiles/ggml-hip.dir/__/ggml-cuda/softmax.cu.o -c /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/softmax.cu In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/softcap.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/softcap.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1031. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/softmax.cu:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/softcap.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/softcap.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1035. 1 warning generated when compiling for gfx1010. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/softcap.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/softcap.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1036. 13 warnings generated when compiling for gfx1151. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/softmax.cu:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/softcap.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/softcap.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1100. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmvq.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmvq.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmvq.cu:2: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cuh:4: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmvq.cu:2: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cuh:4: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmvq.cu:496:13: warning: 'break' will never be executed [-Wunreachable-code-break] 496 | break; | ^~~~~ 1 warning generated when compiling for gfx1012. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/softcap.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/softcap.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1101. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/softmax.cu:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/softcap.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/softcap.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1102. 1 warning generated when compiling for gfx1030. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/softcap.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/softcap.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1103. 1 warning generated when compiling for gfx908. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/softmax.cu:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/softcap.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/softcap.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1150. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmvf.cu:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1031. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/softcap.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/softcap.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1151. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/softmax.cu:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/softcap.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/softcap.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1152. 1 warning generated when compiling for gfx1035. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/softcap.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/softcap.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1153. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/softmax.cu:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/softcap.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/softcap.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1200. 1 warning generated when compiling for gfx1036. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/softcap.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/softcap.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1201. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/softmax.cu:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/softcap.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/softcap.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx900. 1 warning generated when compiling for gfx1100. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/softcap.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/softcap.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx906. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/softmax.cu:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/softcap.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/softcap.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx908. 1 warning generated when compiling for gfx1101. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/softcap.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/softcap.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/softmax.cu:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/softcap.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/softcap.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx90a. 1 warning generated when compiling for gfx90a. 1 warning generated when compiling for gfx1102. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/softcap.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/softcap.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx942. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmvf.cu:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/softmax.cu:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/softcap.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/softcap.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx950. 1 warning generated when compiling for gfx1103. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/softcap.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/softcap.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for host. [ 35%] Building CXX object ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/ssm-conv.cu.o cd /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build/ggml/src/ggml-hip && /usr/bin/hipcc -DGGML_BACKEND_BUILD -DGGML_BACKEND_SHARED -DGGML_HIP_NO_VMM -DGGML_HIP_ROCWMMA_FATTN_GFX12 -DGGML_SCHED_MAX_COPIES=4 -DGGML_SHARED -DGGML_USE_HIP -DUSE_PROF_API=1 -D_GNU_SOURCE -D_XOPEN_SOURCE=600 -D__HIP_PLATFORM_AMD__=1 -Dggml_hip_EXPORTS -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/.. -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -DNDEBUG -std=gnu++17 -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -x hip --offload-arch=gfx900 --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx942 --offload-arch=gfx950 --offload-arch=gfx1010 --offload-arch=gfx1012 --offload-arch=gfx1030 --offload-arch=gfx1031 --offload-arch=gfx1035 --offload-arch=gfx1036 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1103 --offload-arch=gfx1150 --offload-arch=gfx1151 --offload-arch=gfx1152 --offload-arch=gfx1153 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/ssm-conv.cu.o -MF CMakeFiles/ggml-hip.dir/__/ggml-cuda/ssm-conv.cu.o.d -o CMakeFiles/ggml-hip.dir/__/ggml-cuda/ssm-conv.cu.o -c /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ssm-conv.cu In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/softmax.cu:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ssm-conv.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ssm-conv.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1010. 1 warning generated when compiling for gfx1150. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ssm-conv.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ssm-conv.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1012. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/softmax.cu:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ssm-conv.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ssm-conv.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1030. 1 warning generated when compiling for gfx1151. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ssm-conv.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ssm-conv.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/softmax.cu:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1031. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ssm-conv.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ssm-conv.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1152. 1 warning generated when compiling for gfx1035. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/softmax.cu:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ssm-conv.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ssm-conv.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1036. 1 warning generated when compiling for gfx1153. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ssm-conv.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ssm-conv.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1100. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/softmax.cu:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ssm-conv.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ssm-conv.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1101. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmvf.cu:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1200. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ssm-conv.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ssm-conv.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1102. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/softmax.cu:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ssm-conv.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ssm-conv.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1103. 1 warning generated when compiling for gfx1201. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ssm-conv.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ssm-conv.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/softmax.cu:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1150. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ssm-conv.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ssm-conv.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx900. 1 warning generated when compiling for gfx1151. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/softmax.cu:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ssm-conv.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ssm-conv.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1152. 1 warning generated when compiling for gfx906. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ssm-conv.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ssm-conv.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1153. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/softmax.cu:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ssm-conv.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ssm-conv.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1200. 1 warning generated when compiling for gfx908. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ssm-conv.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ssm-conv.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/softmax.cu:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1201. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ssm-conv.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ssm-conv.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx90a. 1 warning generated when compiling for gfx900. 1 warning generated when compiling for gfx942. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/softmax.cu:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ssm-conv.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ssm-conv.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmvf.cu:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx906. 1 warning generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ssm-conv.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ssm-conv.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx908. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/softmax.cu:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ssm-conv.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ssm-conv.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx90a. 1 warning generated when compiling for gfx942. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ssm-conv.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ssm-conv.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/softmax.cu:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ssm-conv.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ssm-conv.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx950. 1 warning generated when compiling for gfx942. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/softmax.cu:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ssm-conv.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ssm-conv.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for host. [ 36%] Building CXX object ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/ssm-scan.cu.o cd /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build/ggml/src/ggml-hip && /usr/bin/hipcc -DGGML_BACKEND_BUILD -DGGML_BACKEND_SHARED -DGGML_HIP_NO_VMM -DGGML_HIP_ROCWMMA_FATTN_GFX12 -DGGML_SCHED_MAX_COPIES=4 -DGGML_SHARED -DGGML_USE_HIP -DUSE_PROF_API=1 -D_GNU_SOURCE -D_XOPEN_SOURCE=600 -D__HIP_PLATFORM_AMD__=1 -Dggml_hip_EXPORTS -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/.. -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -DNDEBUG -std=gnu++17 -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -x hip --offload-arch=gfx900 --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx942 --offload-arch=gfx950 --offload-arch=gfx1010 --offload-arch=gfx1012 --offload-arch=gfx1030 --offload-arch=gfx1031 --offload-arch=gfx1035 --offload-arch=gfx1036 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1103 --offload-arch=gfx1150 --offload-arch=gfx1151 --offload-arch=gfx1152 --offload-arch=gfx1153 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/ssm-scan.cu.o -MF CMakeFiles/ggml-hip.dir/__/ggml-cuda/ssm-scan.cu.o.d -o CMakeFiles/ggml-hip.dir/__/ggml-cuda/ssm-scan.cu.o -c /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ssm-scan.cu 1 warning generated when compiling for gfx950. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ssm-scan.cu:10: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ssm-scan.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ssm-conv.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ssm-conv.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for host. [ 36%] Building CXX object ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/sum.cu.o cd /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build/ggml/src/ggml-hip && /usr/bin/hipcc -DGGML_BACKEND_BUILD -DGGML_BACKEND_SHARED -DGGML_HIP_NO_VMM -DGGML_HIP_ROCWMMA_FATTN_GFX12 -DGGML_SCHED_MAX_COPIES=4 -DGGML_SHARED -DGGML_USE_HIP -DUSE_PROF_API=1 -D_GNU_SOURCE -D_XOPEN_SOURCE=600 -D__HIP_PLATFORM_AMD__=1 -Dggml_hip_EXPORTS -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/.. -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -DNDEBUG -std=gnu++17 -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -x hip --offload-arch=gfx900 --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx942 --offload-arch=gfx950 --offload-arch=gfx1010 --offload-arch=gfx1012 --offload-arch=gfx1030 --offload-arch=gfx1031 --offload-arch=gfx1035 --offload-arch=gfx1036 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1103 --offload-arch=gfx1150 --offload-arch=gfx1151 --offload-arch=gfx1152 --offload-arch=gfx1153 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/sum.cu.o -MF CMakeFiles/ggml-hip.dir/__/ggml-cuda/sum.cu.o.d -o CMakeFiles/ggml-hip.dir/__/ggml-cuda/sum.cu.o -c /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/sum.cu In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/sum.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/sum.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1010. 1 warning generated when compiling for gfx1010. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/sum.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/sum.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1012. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ssm-scan.cu:10: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ssm-scan.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx950. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/sum.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/sum.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1030. 1 warning generated when compiling for gfx1012. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmvf.cu:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/sum.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/sum.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1031. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ssm-scan.cu:10: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ssm-scan.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/sum.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/sum.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1035. 1 warning generated when compiling for gfx1030. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/sum.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/sum.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1036. 1 warning generated when compiling for host. [ 37%] Building CXX object ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/sumrows.cu.o cd /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build/ggml/src/ggml-hip && /usr/bin/hipcc -DGGML_BACKEND_BUILD -DGGML_BACKEND_SHARED -DGGML_HIP_NO_VMM -DGGML_HIP_ROCWMMA_FATTN_GFX12 -DGGML_SCHED_MAX_COPIES=4 -DGGML_SHARED -DGGML_USE_HIP -DUSE_PROF_API=1 -D_GNU_SOURCE -D_XOPEN_SOURCE=600 -D__HIP_PLATFORM_AMD__=1 -Dggml_hip_EXPORTS -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/.. -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -DNDEBUG -std=gnu++17 -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -x hip --offload-arch=gfx900 --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx942 --offload-arch=gfx950 --offload-arch=gfx1010 --offload-arch=gfx1012 --offload-arch=gfx1030 --offload-arch=gfx1031 --offload-arch=gfx1035 --offload-arch=gfx1036 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1103 --offload-arch=gfx1150 --offload-arch=gfx1151 --offload-arch=gfx1152 --offload-arch=gfx1153 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/sumrows.cu.o -MF CMakeFiles/ggml-hip.dir/__/ggml-cuda/sumrows.cu.o.d -o CMakeFiles/ggml-hip.dir/__/ggml-cuda/sumrows.cu.o -c /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/sumrows.cu In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ssm-scan.cu:10: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ssm-scan.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/sum.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/sum.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1100. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/sumrows.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/reduce_rows.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/sumrows.cu:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/reduce_rows.cuh:42:21: warning: comparison of integers of different signs: 'const int' and 'unsigned int' [-Wsign-compare] 42 | if (lane_id < (blockDim.x / WARP_SIZE)) { | ~~~~~~~ ^ ~~~~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/reduce_rows.cuh:42:21: warning: comparison of integers of different signs: 'const int' and 'unsigned int' [-Wsign-compare] 42 | if (lane_id < (blockDim.x / WARP_SIZE)) { | ~~~~~~~ ^ ~~~~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/sumrows.cu:10:9: note: in instantiation of function template specialization 'reduce_rows_f32' requested here 10 | reduce_rows_f32<<>>(x, dst, ncols); | ^ 3 warnings generated when compiling for gfx1010. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/sum.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/sum.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1101. 1 warning generated when compiling for gfx1031. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/sumrows.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/reduce_rows.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/sumrows.cu:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/reduce_rows.cuh:42:21: warning: comparison of integers of different signs: 'const int' and 'unsigned int' [-Wsign-compare] 42 | if (lane_id < (blockDim.x / WARP_SIZE)) { | ~~~~~~~ ^ ~~~~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/reduce_rows.cuh:42:21: warning: comparison of integers of different signs: 'const int' and 'unsigned int' [-Wsign-compare] 42 | if (lane_id < (blockDim.x / WARP_SIZE)) { | ~~~~~~~ ^ ~~~~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/sumrows.cu:10:9: note: in instantiation of function template specialization 'reduce_rows_f32' requested here 10 | reduce_rows_f32<<>>(x, dst, ncols); | ^ 3 warnings generated when compiling for gfx1012. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ssm-scan.cu:10: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ssm-scan.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/sum.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/sum.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1102. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/sumrows.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/reduce_rows.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/sumrows.cu:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/reduce_rows.cuh:42:21: warning: comparison of integers of different signs: 'const int' and 'unsigned int' [-Wsign-compare] 42 | if (lane_id < (blockDim.x / WARP_SIZE)) { | ~~~~~~~ ^ ~~~~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/reduce_rows.cuh:42:21: warning: comparison of integers of different signs: 'const int' and 'unsigned int' [-Wsign-compare] 42 | if (lane_id < (blockDim.x / WARP_SIZE)) { | ~~~~~~~ ^ ~~~~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/sumrows.cu:10:9: note: in instantiation of function template specialization 'reduce_rows_f32' requested here 10 | reduce_rows_f32<<>>(x, dst, ncols); | ^ 3 warnings generated when compiling for gfx1030. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/sum.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/sum.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1103. 1 warning generated when compiling for gfx1035. 13 warnings generated when compiling for gfx1152. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/sumrows.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/reduce_rows.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/sumrows.cu:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/reduce_rows.cuh:42:21: warning: comparison of integers of different signs: 'const int' and 'unsigned int' [-Wsign-compare] 42 | if (lane_id < (blockDim.x / WARP_SIZE)) { | ~~~~~~~ ^ ~~~~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/reduce_rows.cuh:42:21: warning: comparison of integers of different signs: 'const int' and 'unsigned int' [-Wsign-compare] 42 | if (lane_id < (blockDim.x / WARP_SIZE)) { | ~~~~~~~ ^ ~~~~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/sumrows.cu:10:9: note: in instantiation of function template specialization 'reduce_rows_f32' requested here 10 | reduce_rows_f32<<>>(x, dst, ncols); | ^ 3 warnings generated when compiling for gfx1031. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/sum.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/sum.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ssm-scan.cu:10: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ssm-scan.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1150. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmvq.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmvq.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmvq.cu:2: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cuh:4: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmvq.cu:2: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cuh:4: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmvq.cu:496:13: warning: 'break' will never be executed [-Wunreachable-code-break] 496 | break; | ^~~~~ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/sumrows.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/reduce_rows.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/sumrows.cu:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/reduce_rows.cuh:42:21: warning: comparison of integers of different signs: 'const int' and 'unsigned int' [-Wsign-compare] 42 | if (lane_id < (blockDim.x / WARP_SIZE)) { | ~~~~~~~ ^ ~~~~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/reduce_rows.cuh:42:21: warning: comparison of integers of different signs: 'const int' and 'unsigned int' [-Wsign-compare] 42 | if (lane_id < (blockDim.x / WARP_SIZE)) { | ~~~~~~~ ^ ~~~~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/sumrows.cu:10:9: note: in instantiation of function template specialization 'reduce_rows_f32' requested here 10 | reduce_rows_f32<<>>(x, dst, ncols); | ^ 3 warnings generated when compiling for gfx1035. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/sum.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/sum.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1151. 1 warning generated when compiling for gfx1036. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/sumrows.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/reduce_rows.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/sumrows.cu:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/reduce_rows.cuh:42:21: warning: comparison of integers of different signs: 'const int' and 'unsigned int' [-Wsign-compare] 42 | if (lane_id < (blockDim.x / WARP_SIZE)) { | ~~~~~~~ ^ ~~~~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/reduce_rows.cuh:42:21: warning: comparison of integers of different signs: 'const int' and 'unsigned int' [-Wsign-compare] 42 | if (lane_id < (blockDim.x / WARP_SIZE)) { | ~~~~~~~ ^ ~~~~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/sumrows.cu:10:9: note: in instantiation of function template specialization 'reduce_rows_f32' requested here 10 | reduce_rows_f32<<>>(x, dst, ncols); | ^ 3 warnings generated when compiling for gfx1036. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/sum.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/sum.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1152. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ssm-scan.cu:10: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ssm-scan.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/sumrows.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/reduce_rows.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/sumrows.cu:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/reduce_rows.cuh:42:21: warning: comparison of integers of different signs: 'const int' and 'unsigned int' [-Wsign-compare] 42 | if (lane_id < (blockDim.x / WARP_SIZE)) { | ~~~~~~~ ^ ~~~~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/reduce_rows.cuh:42:21: warning: comparison of integers of different signs: 'const int' and 'unsigned int' [-Wsign-compare] 42 | if (lane_id < (blockDim.x / WARP_SIZE)) { | ~~~~~~~ ^ ~~~~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/sumrows.cu:10:9: note: in instantiation of function template specialization 'reduce_rows_f32' requested here 10 | reduce_rows_f32<<>>(x, dst, ncols); | ^ 3 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/sum.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/sum.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1153. 1 warning generated when compiling for gfx1100. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/sumrows.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/reduce_rows.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/sumrows.cu:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/reduce_rows.cuh:42:21: warning: comparison of integers of different signs: 'const int' and 'unsigned int' [-Wsign-compare] 42 | if (lane_id < (blockDim.x / WARP_SIZE)) { | ~~~~~~~ ^ ~~~~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/reduce_rows.cuh:42:21: warning: comparison of integers of different signs: 'const int' and 'unsigned int' [-Wsign-compare] 42 | if (lane_id < (blockDim.x / WARP_SIZE)) { | ~~~~~~~ ^ ~~~~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/sumrows.cu:10:9: note: in instantiation of function template specialization 'reduce_rows_f32' requested here 10 | reduce_rows_f32<<>>(x, dst, ncols); | ^ 3 warnings generated when compiling for gfx1101. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/sum.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/sum.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1200. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ssm-scan.cu:10: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ssm-scan.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/sumrows.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/reduce_rows.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/sumrows.cu:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/reduce_rows.cuh:42:21: warning: comparison of integers of different signs: 'const int' and 'unsigned int' [-Wsign-compare] 42 | if (lane_id < (blockDim.x / WARP_SIZE)) { | ~~~~~~~ ^ ~~~~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/reduce_rows.cuh:42:21: warning: comparison of integers of different signs: 'const int' and 'unsigned int' [-Wsign-compare] 42 | if (lane_id < (blockDim.x / WARP_SIZE)) { | ~~~~~~~ ^ ~~~~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/sumrows.cu:10:9: note: in instantiation of function template specialization 'reduce_rows_f32' requested here 10 | reduce_rows_f32<<>>(x, dst, ncols); | ^ 3 warnings generated when compiling for gfx1102. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/sum.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/sum.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1201. 1 warning generated when compiling for gfx1101. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/sumrows.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/reduce_rows.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/sumrows.cu:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/reduce_rows.cuh:42:21: warning: comparison of integers of different signs: 'const int' and 'unsigned int' [-Wsign-compare] 42 | if (lane_id < (blockDim.x / WARP_SIZE)) { | ~~~~~~~ ^ ~~~~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/reduce_rows.cuh:42:21: warning: comparison of integers of different signs: 'const int' and 'unsigned int' [-Wsign-compare] 42 | if (lane_id < (blockDim.x / WARP_SIZE)) { | ~~~~~~~ ^ ~~~~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/sumrows.cu:10:9: note: in instantiation of function template specialization 'reduce_rows_f32' requested here 10 | reduce_rows_f32<<>>(x, dst, ncols); | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/sum.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/sum.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 3 warnings generated when compiling for gfx1103. 1 warning generated when compiling for gfx900. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ssm-scan.cu:10: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ssm-scan.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/sumrows.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/reduce_rows.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/sumrows.cu:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/reduce_rows.cuh:42:21: warning: comparison of integers of different signs: 'const int' and 'unsigned int' [-Wsign-compare] 42 | if (lane_id < (blockDim.x / WARP_SIZE)) { | ~~~~~~~ ^ ~~~~~~~~~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/sum.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/sum.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/reduce_rows.cuh:42:21: warning: comparison of integers of different signs: 'const int' and 'unsigned int' [-Wsign-compare] 42 | if (lane_id < (blockDim.x / WARP_SIZE)) { | ~~~~~~~ ^ ~~~~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/sumrows.cu:10:9: note: in instantiation of function template specialization 'reduce_rows_f32' requested here 10 | reduce_rows_f32<<>>(x, dst, ncols); | ^ 3 warnings generated when compiling for gfx1150. 1 warning generated when compiling for gfx906. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/sumrows.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/reduce_rows.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/sum.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/sum.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/sumrows.cu:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/reduce_rows.cuh:42:21: warning: comparison of integers of different signs: 'const int' and 'unsigned int' [-Wsign-compare] 42 | if (lane_id < (blockDim.x / WARP_SIZE)) { | ~~~~~~~ ^ ~~~~~~~~~~~~~~~~~~~~~~ 1 warning generated when compiling for gfx1102. /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/reduce_rows.cuh:42:21: warning: comparison of integers of different signs: 'const int' and 'unsigned int' [-Wsign-compare] 42 | if (lane_id < (blockDim.x / WARP_SIZE)) { | ~~~~~~~ ^ ~~~~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/sumrows.cu:10:9: note: in instantiation of function template specialization 'reduce_rows_f32' requested here 10 | reduce_rows_f32<<>>(x, dst, ncols); | ^ 1 warning generated when compiling for gfx908. 3 warnings generated when compiling for gfx1151. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ssm-scan.cu:10: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ssm-scan.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/sum.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/sum.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/sumrows.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/reduce_rows.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/sumrows.cu:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/reduce_rows.cuh:42:21: warning: comparison of integers of different signs: 'const int' and 'unsigned int' [-Wsign-compare] 42 | if (lane_id < (blockDim.x / WARP_SIZE)) { | ~~~~~~~ ^ ~~~~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/reduce_rows.cuh:42:21: warning: comparison of integers of different signs: 'const int' and 'unsigned int' [-Wsign-compare] 42 | if (lane_id < (blockDim.x / WARP_SIZE)) { | ~~~~~~~ ^ ~~~~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/sumrows.cu:10:9: note: in instantiation of function template specialization 'reduce_rows_f32' requested here 10 | reduce_rows_f32<<>>(x, dst, ncols); | ^ 1 warning generated when compiling for gfx90a. 3 warnings generated when compiling for gfx1152. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/sum.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/sum.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/sumrows.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/reduce_rows.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/sumrows.cu:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/reduce_rows.cuh:42:21: warning: comparison of integers of different signs: 'const int' and 'unsigned int' [-Wsign-compare] 42 | if (lane_id < (blockDim.x / WARP_SIZE)) { | ~~~~~~~ ^ ~~~~~~~~~~~~~~~~~~~~~~ 1 warning generated when compiling for gfx90a. /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/reduce_rows.cuh:42:21: warning: comparison of integers of different signs: 'const int' and 'unsigned int' [-Wsign-compare] 42 | if (lane_id < (blockDim.x / WARP_SIZE)) { | ~~~~~~~ ^ ~~~~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/sumrows.cu:10:9: note: in instantiation of function template specialization 'reduce_rows_f32' requested here 10 | reduce_rows_f32<<>>(x, dst, ncols); | ^ 3 warnings generated when compiling for gfx1153. 1 warning generated when compiling for gfx1103. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/sum.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/sum.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ssm-scan.cu:10: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ssm-scan.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/sumrows.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/reduce_rows.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/sumrows.cu:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/reduce_rows.cuh:42:21: warning: comparison of integers of different signs: 'const int' and 'unsigned int' [-Wsign-compare] 42 | if (lane_id < (blockDim.x / WARP_SIZE)) { | ~~~~~~~ ^ ~~~~~~~~~~~~~~~~~~~~~~ 1 warning generated when compiling for gfx942. /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/reduce_rows.cuh:42:21: warning: comparison of integers of different signs: 'const int' and 'unsigned int' [-Wsign-compare] 42 | if (lane_id < (blockDim.x / WARP_SIZE)) { | ~~~~~~~ ^ ~~~~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/sumrows.cu:10:9: note: in instantiation of function template specialization 'reduce_rows_f32' requested here 10 | reduce_rows_f32<<>>(x, dst, ncols); | ^ 3 warnings generated when compiling for gfx1200. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/sum.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/sum.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/sumrows.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/reduce_rows.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx950. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/sumrows.cu:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/reduce_rows.cuh:42:21: warning: comparison of integers of different signs: 'const int' and 'unsigned int' [-Wsign-compare] 42 | if (lane_id < (blockDim.x / WARP_SIZE)) { | ~~~~~~~ ^ ~~~~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/reduce_rows.cuh:42:21: warning: comparison of integers of different signs: 'const int' and 'unsigned int' [-Wsign-compare] 42 | if (lane_id < (blockDim.x / WARP_SIZE)) { | ~~~~~~~ ^ ~~~~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/sumrows.cu:10:9: note: in instantiation of function template specialization 'reduce_rows_f32' requested here 10 | reduce_rows_f32<<>>(x, dst, ncols); | ^ 3 warnings generated when compiling for gfx1201. 1 warning generated when compiling for gfx1150. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/sum.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/sum.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/sumrows.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/reduce_rows.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/sumrows.cu:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/reduce_rows.cuh:42:21: warning: comparison of integers of different signs: 'const int' and 'unsigned int' [-Wsign-compare] 42 | if (lane_id < (blockDim.x / WARP_SIZE)) { | ~~~~~~~ ^ ~~~~~~~~~~~~~~~~~~~~~~ 1 warning generated when compiling for host. [ 37%] Building CXX object ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/tsembd.cu.o cd /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build/ggml/src/ggml-hip && /usr/bin/hipcc -DGGML_BACKEND_BUILD -DGGML_BACKEND_SHARED -DGGML_HIP_NO_VMM -DGGML_HIP_ROCWMMA_FATTN_GFX12 -DGGML_SCHED_MAX_COPIES=4 -DGGML_SHARED -DGGML_USE_HIP -DUSE_PROF_API=1 -D_GNU_SOURCE -D_XOPEN_SOURCE=600 -D__HIP_PLATFORM_AMD__=1 -Dggml_hip_EXPORTS -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/.. -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -DNDEBUG -std=gnu++17 -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -x hip --offload-arch=gfx900 --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx942 --offload-arch=gfx950 --offload-arch=gfx1010 --offload-arch=gfx1012 --offload-arch=gfx1030 --offload-arch=gfx1031 --offload-arch=gfx1035 --offload-arch=gfx1036 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1103 --offload-arch=gfx1150 --offload-arch=gfx1151 --offload-arch=gfx1152 --offload-arch=gfx1153 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/tsembd.cu.o -MF CMakeFiles/ggml-hip.dir/__/ggml-cuda/tsembd.cu.o.d -o CMakeFiles/ggml-hip.dir/__/ggml-cuda/tsembd.cu.o -c /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/tsembd.cu /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/reduce_rows.cuh:42:21: warning: comparison of integers of different signs: 'const int' and 'unsigned int' [-Wsign-compare] 42 | if (lane_id < (blockDim.x / WARP_SIZE)) { | ~~~~~~~ ^ ~~~~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/sumrows.cu:10:9: note: in instantiation of function template specialization 'reduce_rows_f32' requested here 10 | reduce_rows_f32<<>>(x, dst, ncols); | ^ 3 warnings generated when compiling for gfx900. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ssm-scan.cu:10: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ssm-scan.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/tsembd.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/tsembd.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/sumrows.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/reduce_rows.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/sumrows.cu:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/reduce_rows.cuh:42:21: warning: comparison of integers of different signs: 'const int' and 'unsigned int' [-Wsign-compare] 42 | if (lane_id < (blockDim.x / WARP_SIZE)) { | ~~~~~~~ ^ ~~~~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/reduce_rows.cuh:42:21: warning: comparison of integers of different signs: 'const int' and 'unsigned int' [-Wsign-compare] 42 | if (lane_id < (blockDim.x / WARP_SIZE)) { | ~~~~~~~ ^ ~~~~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/sumrows.cu:10:9: note: in instantiation of function template specialization 'reduce_rows_f32' requested here 10 | reduce_rows_f32<<>>(x, dst, ncols); | ^ 3 warnings generated when compiling for gfx906. 1 warning generated when compiling for gfx1010. 1 warning generated when compiling for gfx1151. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/sumrows.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/reduce_rows.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/sumrows.cu:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/reduce_rows.cuh:42:21: warning: comparison of integers of different signs: 'const int' and 'unsigned int' [-Wsign-compare] 42 | if (lane_id < (blockDim.x / WARP_SIZE)) { | ~~~~~~~ ^ ~~~~~~~~~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/tsembd.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/tsembd.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/reduce_rows.cuh:42:21: warning: comparison of integers of different signs: 'const int' and 'unsigned int' [-Wsign-compare] 42 | if (lane_id < (blockDim.x / WARP_SIZE)) { | ~~~~~~~ ^ ~~~~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/sumrows.cu:10:9: note: in instantiation of function template specialization 'reduce_rows_f32' requested here 10 | reduce_rows_f32<<>>(x, dst, ncols); | ^ 3 warnings generated when compiling for gfx908. 1 warning generated when compiling for gfx1012. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ssm-scan.cu:10: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ssm-scan.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/sumrows.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/reduce_rows.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/tsembd.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/tsembd.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/sumrows.cu:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/reduce_rows.cuh:42:21: warning: comparison of integers of different signs: 'const int' and 'unsigned int' [-Wsign-compare] 42 | if (lane_id < (blockDim.x / WARP_SIZE)) { | ~~~~~~~ ^ ~~~~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/reduce_rows.cuh:42:21: warning: comparison of integers of different signs: 'const int' and 'unsigned int' [-Wsign-compare] 42 | if (lane_id < (blockDim.x / WARP_SIZE)) { | ~~~~~~~ ^ ~~~~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/sumrows.cu:10:9: note: in instantiation of function template specialization 'reduce_rows_f32' requested here 10 | reduce_rows_f32<<>>(x, dst, ncols); | ^ 3 warnings generated when compiling for gfx90a. 1 warning generated when compiling for gfx1030. 1 warning generated when compiling for gfx1152. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/sumrows.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/reduce_rows.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/tsembd.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/tsembd.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/sumrows.cu:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/reduce_rows.cuh:42:21: warning: comparison of integers of different signs: 'const int' and 'unsigned int' [-Wsign-compare] 42 | if (lane_id < (blockDim.x / WARP_SIZE)) { | ~~~~~~~ ^ ~~~~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/reduce_rows.cuh:42:21: warning: comparison of integers of different signs: 'const int' and 'unsigned int' [-Wsign-compare] 42 | if (lane_id < (blockDim.x / WARP_SIZE)) { | ~~~~~~~ ^ ~~~~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/sumrows.cu:10:9: note: in instantiation of function template specialization 'reduce_rows_f32' requested here 10 | reduce_rows_f32<<>>(x, dst, ncols); | ^ 1 warning generated when compiling for gfx1031. 3 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ssm-scan.cu:10: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ssm-scan.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/tsembd.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/tsembd.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/sumrows.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/reduce_rows.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/sumrows.cu:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/reduce_rows.cuh:42:21: warning: comparison of integers of different signs: 'const int' and 'unsigned int' [-Wsign-compare] 42 | if (lane_id < (blockDim.x / WARP_SIZE)) { | ~~~~~~~ ^ ~~~~~~~~~~~~~~~~~~~~~~ 1 warning generated when compiling for gfx1035. /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/reduce_rows.cuh:42:21: warning: comparison of integers of different signs: 'const int' and 'unsigned int' [-Wsign-compare] 42 | if (lane_id < (blockDim.x / WARP_SIZE)) { | ~~~~~~~ ^ ~~~~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/sumrows.cu:10:9: note: in instantiation of function template specialization 'reduce_rows_f32' requested here 10 | reduce_rows_f32<<>>(x, dst, ncols); | ^ 3 warnings generated when compiling for gfx942. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/tsembd.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/tsembd.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/sumrows.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/reduce_rows.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/sumrows.cu:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/reduce_rows.cuh:42:21: warning: comparison of integers of different signs: 'const int' and 'unsigned int' [-Wsign-compare] 42 | if (lane_id < (blockDim.x / WARP_SIZE)) { | ~~~~~~~ ^ ~~~~~~~~~~~~~~~~~~~~~~ 1 warning generated when compiling for gfx1036. /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/reduce_rows.cuh:42:21: warning: comparison of integers of different signs: 'const int' and 'unsigned int' [-Wsign-compare] 42 | if (lane_id < (blockDim.x / WARP_SIZE)) { | ~~~~~~~ ^ ~~~~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/sumrows.cu:10:9: note: in instantiation of function template specialization 'reduce_rows_f32' requested here 10 | reduce_rows_f32<<>>(x, dst, ncols); | ^ 3 warnings generated when compiling for gfx950. 1 warning generated when compiling for gfx1153. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/tsembd.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/tsembd.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1100. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/sumrows.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/reduce_rows.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/sumrows.cu:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/reduce_rows.cuh:42:21: warning: comparison of integers of different signs: 'const int' and 'unsigned int' [-Wsign-compare] 42 | if (lane_id < (blockDim.x / WARP_SIZE)) {In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ssm-scan.cu:10: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ssm-scan.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ | ~~~~~~~ ^ ~~~~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/reduce_rows.cuh:42:21: warning: comparison of integers of different signs: 'const int' and 'unsigned int' [-Wsign-compare] 42 | if (lane_id < (blockDim.x / WARP_SIZE)) { | ~~~~~~~ ^ ~~~~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/sumrows.cu:10:9: note: in instantiation of function template specialization 'reduce_rows_f32' requested here 10 | reduce_rows_f32<<>>(x, dst, ncols); | ^ 3 warnings generated when compiling for host. [ 38%] Building CXX object ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/unary.cu.o cd /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build/ggml/src/ggml-hip && /usr/bin/hipcc -DGGML_BACKEND_BUILD -DGGML_BACKEND_SHARED -DGGML_HIP_NO_VMM -DGGML_HIP_ROCWMMA_FATTN_GFX12 -DGGML_SCHED_MAX_COPIES=4 -DGGML_SHARED -DGGML_USE_HIP -DUSE_PROF_API=1 -D_GNU_SOURCE -D_XOPEN_SOURCE=600 -D__HIP_PLATFORM_AMD__=1 -Dggml_hip_EXPORTS -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/.. -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -DNDEBUG -std=gnu++17 -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -x hip --offload-arch=gfx900 --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx942 --offload-arch=gfx950 --offload-arch=gfx1010 --offload-arch=gfx1012 --offload-arch=gfx1030 --offload-arch=gfx1031 --offload-arch=gfx1035 --offload-arch=gfx1036 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1103 --offload-arch=gfx1150 --offload-arch=gfx1151 --offload-arch=gfx1152 --offload-arch=gfx1153 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/unary.cu.o -MF CMakeFiles/ggml-hip.dir/__/ggml-cuda/unary.cu.o.d -o CMakeFiles/ggml-hip.dir/__/ggml-cuda/unary.cu.o -c /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/unary.cu In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/tsembd.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/tsembd.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1101. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/unary.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/unary.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1200. 1 warning generated when compiling for gfx1010. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/tsembd.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/tsembd.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1102. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ssm-scan.cu:10: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ssm-scan.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/unary.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/unary.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/tsembd.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/tsembd.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1103. 1 warning generated when compiling for gfx1012. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/tsembd.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/tsembd.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1201. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/unary.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/unary.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1150. 1 warning generated when compiling for gfx1030. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ssm-scan.cu:10: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ssm-scan.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/tsembd.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/tsembd.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1151. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/unary.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/unary.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/tsembd.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/tsembd.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1152. 1 warning generated when compiling for gfx1031. 1 warning generated when compiling for gfx900. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/tsembd.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/tsembd.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/unary.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/unary.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1153. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ssm-scan.cu:10: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ssm-scan.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1035. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/tsembd.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/tsembd.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1200. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/unary.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/unary.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx906. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/tsembd.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/tsembd.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1036. 1 warning generated when compiling for gfx1201. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ssm-scan.cu:10: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ssm-scan.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/unary.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/unary.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/tsembd.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/tsembd.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx900. 1 warning generated when compiling for gfx1100. 1 warning generated when compiling for gfx908. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/tsembd.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/tsembd.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx906. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/unary.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/unary.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ssm-scan.cu:10: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ssm-scan.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1101. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/tsembd.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/tsembd.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx908. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/unary.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/unary.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/tsembd.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/tsembd.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx90a. 1 warning generated when compiling for gfx90a. 1 warning generated when compiling for gfx1102. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/tsembd.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/tsembd.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ssm-scan.cu:10: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ssm-scan.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/unary.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/unary.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1103. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/tsembd.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/tsembd.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx942. 1 warning generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/unary.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/unary.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/tsembd.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/tsembd.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1150. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ssm-scan.cu:10: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ssm-scan.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx950. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/unary.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/unary.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/tsembd.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/tsembd.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for host. [ 38%] Building CXX object ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/upscale.cu.o cd /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build/ggml/src/ggml-hip && /usr/bin/hipcc -DGGML_BACKEND_BUILD -DGGML_BACKEND_SHARED -DGGML_HIP_NO_VMM -DGGML_HIP_ROCWMMA_FATTN_GFX12 -DGGML_SCHED_MAX_COPIES=4 -DGGML_SHARED -DGGML_USE_HIP -DUSE_PROF_API=1 -D_GNU_SOURCE -D_XOPEN_SOURCE=600 -D__HIP_PLATFORM_AMD__=1 -Dggml_hip_EXPORTS -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/.. -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -DNDEBUG -std=gnu++17 -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -x hip --offload-arch=gfx900 --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx942 --offload-arch=gfx950 --offload-arch=gfx1010 --offload-arch=gfx1012 --offload-arch=gfx1030 --offload-arch=gfx1031 --offload-arch=gfx1035 --offload-arch=gfx1036 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1103 --offload-arch=gfx1150 --offload-arch=gfx1151 --offload-arch=gfx1152 --offload-arch=gfx1153 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/upscale.cu.o -MF CMakeFiles/ggml-hip.dir/__/ggml-cuda/upscale.cu.o.d -o CMakeFiles/ggml-hip.dir/__/ggml-cuda/upscale.cu.o -c /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/upscale.cu 1 warning generated when compiling for gfx942. 1 warning generated when compiling for gfx1151. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/upscale.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/upscale.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1010. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ssm-scan.cu:10: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ssm-scan.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/unary.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/unary.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1152. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/upscale.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/upscale.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1012. 1 warning generated when compiling for gfx950. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/unary.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/unary.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/upscale.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/upscale.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1030. 1 warning generated when compiling for gfx1153. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ssm-scan.cu:10: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/ssm-scan.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for host. [ 39%] Building CXX object ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/wkv.cu.o cd /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build/ggml/src/ggml-hip && /usr/bin/hipcc -DGGML_BACKEND_BUILD -DGGML_BACKEND_SHARED -DGGML_HIP_NO_VMM -DGGML_HIP_ROCWMMA_FATTN_GFX12 -DGGML_SCHED_MAX_COPIES=4 -DGGML_SHARED -DGGML_USE_HIP -DUSE_PROF_API=1 -D_GNU_SOURCE -D_XOPEN_SOURCE=600 -D__HIP_PLATFORM_AMD__=1 -Dggml_hip_EXPORTS -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/.. -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -DNDEBUG -std=gnu++17 -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -x hip --offload-arch=gfx900 --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx942 --offload-arch=gfx950 --offload-arch=gfx1010 --offload-arch=gfx1012 --offload-arch=gfx1030 --offload-arch=gfx1031 --offload-arch=gfx1035 --offload-arch=gfx1036 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1103 --offload-arch=gfx1150 --offload-arch=gfx1151 --offload-arch=gfx1152 --offload-arch=gfx1153 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/wkv.cu.o -MF CMakeFiles/ggml-hip.dir/__/ggml-cuda/wkv.cu.o.d -o CMakeFiles/ggml-hip.dir/__/ggml-cuda/wkv.cu.o -c /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/wkv.cu In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/upscale.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/upscale.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/unary.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/unary.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1031. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/wkv.cu:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1200. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/upscale.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/upscale.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1035. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/unary.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/unary.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1010. 1 warning generated when compiling for gfx1201. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/upscale.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/upscale.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1036. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/wkv.cu:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/unary.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/unary.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/upscale.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/upscale.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1100. 1 warning generated when compiling for gfx900. 1 warning generated when compiling for gfx1012. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/unary.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/unary.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/upscale.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/upscale.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1101. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/wkv.cu:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx906. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/upscale.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/upscale.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1102. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/unary.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/unary.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1030. 1 warning generated when compiling for gfx908. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/upscale.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/upscale.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1103. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/unary.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/unary.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/wkv.cu:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/upscale.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/upscale.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx90a. 1 warning generated when compiling for gfx1150. 1 warning generated when compiling for gfx1031. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/unary.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/unary.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/upscale.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/upscale.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1151. 1 warning generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/wkv.cu:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/upscale.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/upscale.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1152. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/unary.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/unary.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1035. 1 warning generated when compiling for gfx942. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/upscale.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/upscale.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1153. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/wkv.cu:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/unary.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/unary.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/upscale.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/upscale.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1200. 1 warning generated when compiling for gfx950. 13 warnings generated when compiling for gfx1153. 1 warning generated when compiling for gfx1036. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/upscale.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/upscale.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1201. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/unary.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/unary.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmvq.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmvq.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmvq.cu:2: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cuh:4: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmvq.cu:2: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cuh:4: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmvq.cu:496:13: warning: 'break' will never be executed [-Wunreachable-code-break] 496 | break; | ^~~~~ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/wkv.cu:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for host. [ 39%] Building CXX object ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_16.cu.o cd /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build/ggml/src/ggml-hip && /usr/bin/hipcc -DGGML_BACKEND_BUILD -DGGML_BACKEND_SHARED -DGGML_HIP_NO_VMM -DGGML_HIP_ROCWMMA_FATTN_GFX12 -DGGML_SCHED_MAX_COPIES=4 -DGGML_SHARED -DGGML_USE_HIP -DUSE_PROF_API=1 -D_GNU_SOURCE -D_XOPEN_SOURCE=600 -D__HIP_PLATFORM_AMD__=1 -Dggml_hip_EXPORTS -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/.. -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -DNDEBUG -std=gnu++17 -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -x hip --offload-arch=gfx900 --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx942 --offload-arch=gfx950 --offload-arch=gfx1010 --offload-arch=gfx1012 --offload-arch=gfx1030 --offload-arch=gfx1031 --offload-arch=gfx1035 --offload-arch=gfx1036 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1103 --offload-arch=gfx1150 --offload-arch=gfx1151 --offload-arch=gfx1152 --offload-arch=gfx1153 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_16.cu.o -MF CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_16.cu.o.d -o CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_16.cu.o -c /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_16.cu In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/upscale.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/upscale.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx900. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ 1 warning generated when compiling for gfx1100. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_16.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/upscale.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/upscale.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 14 warnings generated when compiling for gfx1010. 1 warning generated when compiling for gfx906. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/wkv.cu:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_16.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/upscale.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/upscale.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx908. 14 warnings generated when compiling for gfx1012. 1 warning generated when compiling for gfx1101. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/upscale.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/upscale.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tiIn file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/wkv.cu:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ le<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_16.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx1030. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/upscale.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/upscale.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx90a. 1 warning generated when compiling for gfx1102. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_16.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/upscale.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/upscale.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 14 warnings generated when compiling for gfx1031. 1 warning generated when compiling for gfx942. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/wkv.cu:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, conIn file included from st tile<32, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/upscale.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/upscale.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_16.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 1 warning generated when compiling for gfx950. 1 warning generated when compiling for gfx1103. 14 warnings generated when compiling for gfx1035. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/upscale.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/upscale.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for host. [ 40%] Building CXX object ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_8.cu.o cd /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build/ggml/src/ggml-hip && /usr/bin/hipcc -DGGML_BACKEND_BUILD -DGGML_BACKEND_SHARED -DGGML_HIP_NO_VMM -DGGML_HIP_ROCWMMA_FATTN_GFX12 -DGGML_SCHED_MAX_COPIES=4 -DGGML_SHARED -DGGML_USE_HIP -DUSE_PROF_API=1 -D_GNU_SOURCE -D_XOPEN_SOURCE=600 -D__HIP_PLATFORM_AMD__=1 -Dggml_hip_EXPORTS -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/.. -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -DNDEBUG -std=gnu++17 -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -x hip --offload-arch=gfx900 --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx942 --offload-arch=gfx950 --offload-arch=gfx1010 --offload-arch=gfx1012 --offload-arch=gfx1030 --offload-arch=gfx1031 --offload-arch=gfx1035 --offload-arch=gfx1036 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1103 --offload-arch=gfx1150 --offload-arch=gfx1151 --offload-arch=gfx1152 --offload-arch=gfx1153 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_8.cu.o -MF CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_8.cu.o.d -o CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_8.cu.o -c /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_8.cu In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/wkv.cu:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_16.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx1036. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_8.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 1 warning generated when compiling for gfx1150. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_16.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx1010. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/wkv.cu:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 14 warnings generated when compiling for gfx1100. 13 warnings generated when compiling for gfx1200. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_8.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_16.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 1 warning generated when compiling for gfx1151. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmvq.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmvq.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 14 warnings generated when compiling for gfx1012. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmvq.cu:2: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cuh:4: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmvq.cu:2: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cuh:4: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmvq.cu:496:13: warning: 'break' will never be executed [-Wunreachable-code-break] 496 | break; | ^~~~~ 14 warnings generated when compiling for gfx1101. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/wkv.cu:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_8.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_16.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx1102. 14 warnings generated when compiling for gfx1030. 1 warning generated when compiling for gfx1152. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_16.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_8.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/wkv.cu:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 14 warnings generated when compiling for gfx1103. 14 warnings generated when compiling for gfx1031. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_16.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 1 warning generated when compiling for gfx1153. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ 14 warnings generated when compiling for gfx1150. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_8.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/wkv.cu:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 14 warnings generated when compiling for gfx1035. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_16.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx1151. 1 warning generated when compiling for gfx1200. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_8.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_16.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/wkv.cu:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 14 warnings generated when compiling for gfx1036. 14 warnings generated when compiling for gfx1152. 1 warning generated when compiling for gfx1201. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_8.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_16.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx1153. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/wkv.cu:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 14 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_16.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_8.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx1200. 1 warning generated when compiling for gfx900. 14 warnings generated when compiling for gfx1101. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/wkv.cu:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_16.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx1201. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) 13 warnings generated when compiling for gfx1201. { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_8.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx906. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ 14 warnings generated when compiling for gfx1102. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_16.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmvq.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmvq.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmvq.cu:2: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cuh:4: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmvq.cu:2: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cuh:4: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmvq.cu:496:13: warning: 'break' will never be executed [-Wunreachable-code-break] 496 | break; | ^~~~~ 14 warnings generated when compiling for gfx900. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/wkv.cu:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_8.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_16.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx1103. 14 warnings generated when compiling for gfx906. 1 warning generated when compiling for gfx908. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_8.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_16.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/wkv.cu:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 12 warnings generated when compiling for gfx908. 14 warnings generated when compiling for gfx1150. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ 1 warning generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_16.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_8.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 12 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/wkv.cu:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 14 warnings generated when compiling for gfx1151. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_16.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ 12 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_8.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 1 warning generated when compiling for gfx90a. 14 warnings generated when compiling for gfx1152. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:521:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 521 | acc[0] = __builtin_amdgcn_mfma_i32_16x16x32_i8(((int64_t *) A.x)[0], | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:522:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 522 | ((int64_t *) B.x)[0], | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:549:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 549 | acc[0] = __builtin_amdgcn_mfma_i32_32x32x16_i8(((int64_t *) A.x)[0], | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:550:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 550 | ((int64_t *) B.x)[0], | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_16.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/wkv.cu:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 16 warnings generated when compiling for gfx942. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_8.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 1 warning generated when compiling for gfx942. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_16.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx1153. 14 warnings generated when compiling for gfx950. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/wkv.cu:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_8.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_16.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 1 warning generated when compiling for gfx950. 14 warnings generated when compiling for host. [ 40%] Building CXX object ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_1.cu.o cd /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build/ggml/src/ggml-hip && /usr/bin/hipcc -DGGML_BACKEND_BUILD -DGGML_BACKEND_SHARED -DGGML_HIP_NO_VMM -DGGML_HIP_ROCWMMA_FATTN_GFX12 -DGGML_SCHED_MAX_COPIES=4 -DGGML_SHARED -DGGML_USE_HIP -DUSE_PROF_API=1 -D_GNU_SOURCE -D_XOPEN_SOURCE=600 -D__HIP_PLATFORM_AMD__=1 -Dggml_hip_EXPORTS -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/.. -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -DNDEBUG -std=gnu++17 -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -x hip --offload-arch=gfx900 --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx942 --offload-arch=gfx950 --offload-arch=gfx1010 --offload-arch=gfx1012 --offload-arch=gfx1030 --offload-arch=gfx1031 --offload-arch=gfx1035 --offload-arch=gfx1036 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1103 --offload-arch=gfx1150 --offload-arch=gfx1151 --offload-arch=gfx1152 --offload-arch=gfx1153 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_1.cu.o -MF CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_1.cu.o.d -o CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_1.cu.o -c /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_1.cu 14 warnings generated when compiling for gfx1200. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/wkv.cu:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ 1 warning generated when compiling for host. [ 41%] Building CXX object ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_2.cu.o cd /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build/ggml/src/ggml-hip && /usr/bin/hipcc -DGGML_BACKEND_BUILD -DGGML_BACKEND_SHARED -DGGML_HIP_NO_VMM -DGGML_HIP_ROCWMMA_FATTN_GFX12 -DGGML_SCHED_MAX_COPIES=4 -DGGML_SHARED -DGGML_USE_HIP -DUSE_PROF_API=1 -D_GNU_SOURCE -D_XOPEN_SOURCE=600 -D__HIP_PLATFORM_AMD__=1 -Dggml_hip_EXPORTS -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/.. -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -DNDEBUG -std=gnu++17 -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -x hip --offload-arch=gfx900 --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx942 --offload-arch=gfx950 --offload-arch=gfx1010 --offload-arch=gfx1012 --offload-arch=gfx1030 --offload-arch=gfx1031 --offload-arch=gfx1035 --offload-arch=gfx1036 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1103 --offload-arch=gfx1150 --offload-arch=gfx1151 --offload-arch=gfx1152 --offload-arch=gfx1153 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_2.cu.o -MF CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_2.cu.o.d -o CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_2.cu.o -c /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_2.cu In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_1.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_8.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx1010. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 14 warnings generated when compiling for gfx1201. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_2.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_1.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx1010. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_8.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx1012. 14 warnings generated when compiling for gfx900. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_2.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_1.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx1012. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_8.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx1030. 14 warnings generated when compiling for gfx906. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_2.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_1.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8,14 warnings generated when compiling for gfx1030. float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_8.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx1031. 12 warnings generated when compiling for gfx908. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_2.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_1.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_8.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx1031. 14 warnings generated when compiling for gfx1035. 12 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_2.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_1.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_8.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx1035. 14 warnings generated when compiling for gfx1036. 12 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_2.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_1.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:521:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 521 | acc[0] = __builtin_amdgcn_mfma_i32_16x16x32_i8(((int64_t *) A.x)[0], | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:522:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 522 | ((int64_t *) B.x)[0], | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:549:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 549 | acc[0] = __builtin_amdgcn_mfma_i32_32x32x16_i8(((int64_t *) A.x)[0], | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:550:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 550 | ((int64_t *) B.x)[0], | ^ 14 warnings generated when compiling for gfx1036. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_8.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ 16 warnings generated when compiling for gfx942. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_2.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_1.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_8.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx1101. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_2.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx950. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 14 warnings generated when compiling for gfx1101. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_1.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_8.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ 14 warnings generated when compiling for gfx1102. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_2.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for host. [ 41%] Building CXX object ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_4.cu.o cd /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build/ggml/src/ggml-hip && /usr/bin/hipcc -DGGML_BACKEND_BUILD -DGGML_BACKEND_SHARED -DGGML_HIP_NO_VMM -DGGML_HIP_ROCWMMA_FATTN_GFX12 -DGGML_SCHED_MAX_COPIES=4 -DGGML_SHARED -DGGML_USE_HIP -DUSE_PROF_API=1 -D_GNU_SOURCE -D_XOPEN_SOURCE=600 -D__HIP_PLATFORM_AMD__=1 -Dggml_hip_EXPORTS -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/.. -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -DNDEBUG -std=gnu++17 -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -x hip --offload-arch=gfx900 --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx942 --offload-arch=gfx950 --offload-arch=gfx1010 --offload-arch=gfx1012 --offload-arch=gfx1030 --offload-arch=gfx1031 --offload-arch=gfx1035 --offload-arch=gfx1036 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1103 --offload-arch=gfx1150 --offload-arch=gfx1151 --offload-arch=gfx1152 --offload-arch=gfx1153 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_4.cu.o -MF CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_4.cu.o.d -o CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_4.cu.o -c /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_4.cu 14 warnings generated when compiling for gfx1102. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_1.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_4.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_2.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx1103. 14 warnings generated when compiling for gfx1010. 14 warnings generated when compiling for gfx1103. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & In file included from D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mm/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuha_available(cons:t int cc) 463{ | :110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] ^ 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_1.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_4.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_2.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx1150. 14 warnings generated when compiling for gfx1012. 14 warnings generated when compiling for gfx1150. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_1.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_4.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_2.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx1151. 14 warnings generated when compiling for gfx1030. 14 warnings generated when compiling for gfx1151. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | In file included from ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuhc:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> onst tile<16& A, const tile<16, 8, int> & B) { | , 4, i ^ nt> & A, const tile/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh<8, 4, int>:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_1.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_4.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_2.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 13 warnings generated when compiling for gfx900. 14 warnings generated when compiling for gfx1152. 14 warnings generated when compiling for gfx1031. 14 warnings generated when compiling for gfx1152. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmvq.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmvq.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmvq.cu:2: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cuh:4: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmvq.cu:2: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cuh:4: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmvq.cu:496:13: warning: 'break' will never be executed [-Wunreachable-code-break] 496 | break; | ^~~~~ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_1.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_4.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_2.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx1035. 14 warnings generated when compiling for gfx1153. 14 warnings generated when compiling for gfx1153. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_4.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_1.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_2.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx1036. 14 warnings generated when compiling for gfx1200. 14 warnings generated when compiling for gfx1200. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, constIn file included from tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ :419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_4.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_1.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_2.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx1100. 14 warnings generated when compiling for gfx1201. 14 warnings generated when compiling for gfx1201. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_4.cu:3: In file included from In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_1.cu/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool :356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ fp/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh16_mma_av:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ ailabl/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, conse(const int cc) { | ^ t tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_4.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ , half2> & B) { /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_1.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_2.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx1101. 14 warnings generated when compiling for gfx900. 14 warnings generated when compiling for gfx900. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_4.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_1.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_2.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx1102. 14 warnings generated when compiling for gfx906. 14 warnings generated when compiling for gfx906. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_4.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_1.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_2.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx1103. 12 warnings generated when compiling for gfx908. 12 warnings generated when compiling for gfx908. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_4.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_1.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_2.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx1150. 12 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 12 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_4.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_1.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx1151. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_2.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 12 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_4.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 12 warnings generated when compiling for gfx90a. 14 warnings generated when compiling for gfx1152. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:521:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 521 | acc[0] = __builtin_amdgcn_mfma_i32_16x16x32_i8(((int64_t *) A.x)[0], | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:522:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 522 | ((int64_t *) B.x)[0], | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:549:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 549 | acc[0] = __builtin_amdgcn_mfma_i32_32x32x16_i8(((int64_t *) A.x)[0], | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:550:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 550 | ((int64_t *) B.x)[0], | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_1.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:521:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 521 | acc[0] = __builtin_amdgcn_mfma_i32_16x16x32_i8(((int64_t *) A.x)[0], | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:522:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 522 | ((int64_t *) B.x)[0], | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:549:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 549 | acc[0] = __builtin_amdgcn_mfma_i32_32x32x16_i8(((int64_t *) A.x)[0], | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:550:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 550 | ((int64_t *) B.x)[0], | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_2.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_4.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 16 warnings generated when compiling for gfx942. 16 warnings generated when compiling for gfx942. 14 warnings generated when compiling for gfx1153. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_1.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_2.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_4.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx950. 14 warnings generated when compiling for gfx950. 14 warnings generated when compiling for gfx1200. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_1.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for host. [ 41%] Building CXX object ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_16.cu.o cd /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build/ggml/src/ggml-hip && /usr/bin/hipcc -DGGML_BACKEND_BUILD -DGGML_BACKEND_SHARED -DGGML_HIP_NO_VMM -DGGML_HIP_ROCWMMA_FATTN_GFX12 -DGGML_SCHED_MAX_COPIES=4 -DGGML_SHARED -DGGML_USE_HIP -DUSE_PROF_API=1 -D_GNU_SOURCE -D_XOPEN_SOURCE=600 -D__HIP_PLATFORM_AMD__=1 -Dggml_hip_EXPORTS -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/.. -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -DNDEBUG -std=gnu++17 -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -x hip --offload-arch=gfx900 --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx942 --offload-arch=gfx950 --offload-arch=gfx1010 --offload-arch=gfx1012 --offload-arch=gfx1030 --offload-arch=gfx1031 --offload-arch=gfx1035 --offload-arch=gfx1036 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1103 --offload-arch=gfx1150 --offload-arch=gfx1151 --offload-arch=gfx1152 --offload-arch=gfx1153 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_16.cu.o -MF CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_16.cu.o.d -o CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_16.cu.o -c /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_16.cu In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_2.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_4.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for host. [ 42%] Building CXX object ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_4.cu.o cd /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build/ggml/src/ggml-hip && /usr/bin/hipcc -DGGML_BACKEND_BUILD -DGGML_BACKEND_SHARED -DGGML_HIP_NO_VMM -DGGML_HIP_ROCWMMA_FATTN_GFX12 -DGGML_SCHED_MAX_COPIES=4 -DGGML_SHARED -DGGML_USE_HIP -DUSE_PROF_API=1 -D_GNU_SOURCE -D_XOPEN_SOURCE=600 -D__HIP_PLATFORM_AMD__=1 -Dggml_hip_EXPORTS -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/.. -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -DNDEBUG -std=gnu++17 -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -x hip --offload-arch=gfx900 --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx942 --offload-arch=gfx950 --offload-arch=gfx1010 --offload-arch=gfx1012 --offload-arch=gfx1030 --offload-arch=gfx1031 --offload-arch=gfx1035 --offload-arch=gfx1036 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1103 --offload-arch=gfx1150 --offload-arch=gfx1151 --offload-arch=gfx1152 --offload-arch=gfx1153 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_4.cu.o -MF CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_4.cu.o.d -o CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_4.cu.o -c /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_4.cu In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_16.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx1201. 14 warnings generated when compiling for gfx1010. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_4.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_4.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_16.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx1010. 14 warnings generated when compiling for gfx1012. 14 warnings generated when compiling for gfx900. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_4.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_16.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ 14 warnings generated when compiling for gfx1030. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_4.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx1012. 14 warnings generated when compiling for gfx906. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_16.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_4.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx1031. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_4.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx1030. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_16.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 12 warnings generated when compiling for gfx908. 14 warnings generated when compiling for gfx1035. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_4.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_4.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_16.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx1031. 14 warnings generated when compiling for gfx1036. 12 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_4.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_16.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx1035. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ 14 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_4.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_4.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ 12 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_16.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 13 warnings generated when compiling for gfx906. 14 warnings generated when compiling for gfx1101. 14 warnings generated when compiling for gfx1036. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:521:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 521 | acc[0] = __builtin_amdgcn_mfma_i32_16x16x32_i8(((int64_t *) A.x)[0], | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:522:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 522 | ((int64_t *) B.x)[0], | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:549:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 549 | acc[0] = __builtin_amdgcn_mfma_i32_32x32x16_i8(((int64_t *) A.x)[0], | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:550:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 550 | ((int64_t *) B.x)[0], | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_4.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmvq.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmvq.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmvq.cu:2: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cuh:4: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmvq.cu:2: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cuh:4: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:1796:31: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 1796 | for (int l = 0; l < sizeof(int); ++l) { | ~ ^ ~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmvq.cu:496:13: warning: 'break' will never be executed [-Wunreachable-code-break] 496 | break; | ^~~~~ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_16.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_4.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx1102. 16 warnings generated when compiling for gfx942. 14 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ i/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuhnt cc) { | ^ :356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, halfIn file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_16.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __r2> & De,strict__ sinks_f, | ^ const tile<16, 8, h/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ alf2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_4.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_4.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx1103. 14 warnings generated when compiling for gfx950. 14 warnings generated when compiling for gfx1101. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_16.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx1150. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_4.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_4.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for host. [ 42%] Building CXX object ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_8.cu.o cd /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build/ggml/src/ggml-hip && /usr/bin/hipcc -DGGML_BACKEND_BUILD -DGGML_BACKEND_SHARED -DGGML_HIP_NO_VMM -DGGML_HIP_ROCWMMA_FATTN_GFX12 -DGGML_SCHED_MAX_COPIES=4 -DGGML_SHARED -DGGML_USE_HIP -DUSE_PROF_API=1 -D_GNU_SOURCE -D_XOPEN_SOURCE=600 -D__HIP_PLATFORM_AMD__=1 -Dggml_hip_EXPORTS -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/.. -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -DNDEBUG -std=gnu++17 -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -x hip --offload-arch=gfx900 --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx942 --offload-arch=gfx950 --offload-arch=gfx1010 --offload-arch=gfx1012 --offload-arch=gfx1030 --offload-arch=gfx1031 --offload-arch=gfx1035 --offload-arch=gfx1036 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1103 --offload-arch=gfx1150 --offload-arch=gfx1151 --offload-arch=gfx1152 --offload-arch=gfx1153 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_8.cu.o -MF CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_8.cu.o.d -o CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_8.cu.o -c /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_8.cu In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ 14 warnings generated when compiling for gfx1102. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_16.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx1151. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_8.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_4.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ 14 warnings generated when compiling for gfx1010. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_16.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx1103. 14 warnings generated when compiling for gfx1152. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_8.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16,In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_4.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_16.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx1012. 14 warnings generated when compiling for gfx1153. 14 warnings generated when compiling for gfx1150. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_16.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_8.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ 14 warnings generated when compiling for gfx1200. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_4.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx1030. 14 warnings generated when compiling for gfx1151. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_16.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_8.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx1201. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_4.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx1031. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ 14 warnings generated when compiling for gfx1152. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_16.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx900. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_8.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_4.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_16.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx1035. 14 warnings generated when compiling for gfx1153. 14 warnings generated when compiling for gfx906. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ vIn file included from oid cp_async_wait_all() { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tilIn file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, inte<16, > & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh8, flo:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, hat> & D, aclf2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ onst/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh tile<:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ 16/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh, 8, nv_bfl:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ oat162> & A,/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh const tile:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ fl/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuhoat162> &:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ B) { /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, | ^ const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_4.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_8.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_16.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx1200. 14 warnings generated when compiling for gfx1036. 12 warnings generated when compiling for gfx908. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_4.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_8.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_16.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 12 warnings generated when compiling for gfx90a. 14 warnings generated when compiling for gfx1201. 14 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_4.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_16.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_8.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 12 warnings generated when compiling for gfx90a. 14 warnings generated when compiling for gfx900. 14 warnings generated when compiling for gfx1101. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:521:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 521 | acc[0] = __builtin_amdgcn_mfma_i32_16x16x32_i8(((int64_t *) A.x)[0], | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:522:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 522 | ((int64_t *) B.x)[0], | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:549:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 549 | acc[0] = __builtin_amdgcn_mfma_i32_32x32x16_i8(((int64_t *) A.x)[0], | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:550:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 550 | ((int64_t *) B.x)[0], | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_16.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, In file included from | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_4.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 16 warnings generated when compiling for gfx942. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_8.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx906. 14 warnings generated when compiling for gfx1102. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_16.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_4.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx950. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_8.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 12 warnings generated when compiling for gfx908. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_16.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx1103. 14 warnings generated when compiling for host. [ 43%] Building CXX object ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_1.cu.o cd /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build/ggml/src/ggml-hip && /usr/bin/hipcc -DGGML_BACKEND_BUILD -DGGML_BACKEND_SHARED -DGGML_HIP_NO_VMM -DGGML_HIP_ROCWMMA_FATTN_GFX12 -DGGML_SCHED_MAX_COPIES=4 -DGGML_SHARED -DGGML_USE_HIP -DUSE_PROF_API=1 -D_GNU_SOURCE -D_XOPEN_SOURCE=600 -D__HIP_PLATFORM_AMD__=1 -Dggml_hip_EXPORTS -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/.. -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -DNDEBUG -std=gnu++17 -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -x hip --offload-arch=gfx900 --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx942 --offload-arch=gfx950 --offload-arch=gfx1010 --offload-arch=gfx1012 --offload-arch=gfx1030 --offload-arch=gfx1031 --offload-arch=gfx1035 --offload-arch=gfx1036 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1103 --offload-arch=gfx1150 --offload-arch=gfx1151 --offload-arch=gfx1152 --offload-arch=gfx1153 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_1.cu.o -MF CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_1.cu.o.d -o CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_1.cu.o -c /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_1.cu In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_4.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | In file included from tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuhi:356:nt 96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ cc) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_8.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 12 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_1.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx1150. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 14 warnings generated when compiling for gfx1010. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_4.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 12 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_8.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_1.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx1151. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:521:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 521 | acc[0] = __builtin_amdgcn_mfma_i32_16x16x32_i8(((int64_t *) A.x)[0], | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:522:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 522 | ((int64_t *) B.x)[0], | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:549:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 549 | acc[0] = __builtin_amdgcn_mfma_i32_32x32x16_i8(((int64_t *) A.x)[0], | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:550:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 550 | ((int64_t *) B.x)[0], | ^ 14 warnings generated when compiling for gfx1012. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_4.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ 16 warnings generated when compiling for gfx942. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_8.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_1.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx1152. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_4.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx1030. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ 14 warnings generated when compiling for gfx950. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_8.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_1.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx1153. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_4.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx1031. 14 warnings generated when compiling for host. [ 43%] Building CXX object ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_2.cu.o cd /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build/ggml/src/ggml-hip && /usr/bin/hipcc -DGGML_BACKEND_BUILD -DGGML_BACKEND_SHARED -DGGML_HIP_NO_VMM -DGGML_HIP_ROCWMMA_FATTN_GFX12 -DGGML_SCHED_MAX_COPIES=4 -DGGML_SHARED -DGGML_USE_HIP -DUSE_PROF_API=1 -D_GNU_SOURCE -D_XOPEN_SOURCE=600 -D__HIP_PLATFORM_AMD__=1 -Dggml_hip_EXPORTS -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/.. -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -DNDEBUG -std=gnu++17 -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -x hip --offload-arch=gfx900 --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx942 --offload-arch=gfx950 --offload-arch=gfx1010 --offload-arch=gfx1012 --offload-arch=gfx1030 --offload-arch=gfx1031 --offload-arch=gfx1035 --offload-arch=gfx1036 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1103 --offload-arch=gfx1150 --offload-arch=gfx1151 --offload-arch=gfx1152 --offload-arch=gfx1153 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_2.cu.o -MF CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_2.cu.o.d -o CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_2.cu.o -c /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_2.cu 11 warnings generated when compiling for gfx908. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_8.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_1.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_2.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx1200. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmvq.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmvq.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmvq.cu:2: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cuh:4: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmvq.cu:2: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cuh:4: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:1796:31: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 1796 | for (int l = 0; l < sizeof(int); ++l) { | ~ ^ ~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmvq.cu:496:13: warning: 'break' will never be executed [-Wunreachable-code-break] 496 | break; | ^~~~~ 14 warnings generated when compiling for gfx1035. 14 warnings generated when compiling for gfx1010. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_8.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_1.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_2.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx1201. 14 warnings generated when compiling for gfx1036. 14 warnings generated when compiling for gfx1012. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_8.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_1.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_2.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx900. 14 warnings generated when compiling for gfx1100. 14 warnings generated when compiling for gfx1030. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_8.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_1.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_2.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx906. 14 warnings generated when compiling for gfx1101. 14 warnings generated when compiling for gfx1031. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_8.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_1.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_2.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 12 warnings generated when compiling for gfx908. 14 warnings generated when compiling for gfx1102. 14 warnings generated when compiling for gfx1035. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_8.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_1.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_2.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 12 warnings generated when compiling for gfx90a. 14 warnings generated when compiling for gfx1103. 14 warnings generated when compiling for gfx1036. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_8.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_2.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_1.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 12 warnings generated when compiling for gfx90a. 14 warnings generated when compiling for gfx1100. 14 warnings generated when compiling for gfx1150. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:521:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 521 | acc[0] = __builtin_amdgcn_mfma_i32_16x16x32_i8(((int64_t *) A.x)[0], | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:522:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 522 | ((int64_t *) B.x)[0], | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:549:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 549 | acc[0] = __builtin_amdgcn_mfma_i32_32x32x16_i8(((int64_t *) A.x)[0], | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:550:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 550 | ((int64_t *) B.x)[0], | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_8.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_1.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ :43: warning: unused parameter 'sinks_f' [-Wunused-parameter]/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_2.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 16 warnings generated when compiling for gfx942. 14 warnings generated when compiling for gfx1151. 14 warnings generated when compiling for gfx1101. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_1.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_8.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_2.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx1152. 14 warnings generated when compiling for gfx950. 14 warnings generated when compiling for gfx1102. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_1.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_8.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ 302 | tile<16, /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ :1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ 1257 | /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh: 436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ cons/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ t int * __r/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ estrict/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ __ KV_m/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuhax, | ^ :544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_2.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx1153. 14 warnings generated when compiling for host. [ 44%] Building CXX object ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_16.cu.o cd /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build/ggml/src/ggml-hip && /usr/bin/hipcc -DGGML_BACKEND_BUILD -DGGML_BACKEND_SHARED -DGGML_HIP_NO_VMM -DGGML_HIP_ROCWMMA_FATTN_GFX12 -DGGML_SCHED_MAX_COPIES=4 -DGGML_SHARED -DGGML_USE_HIP -DUSE_PROF_API=1 -D_GNU_SOURCE -D_XOPEN_SOURCE=600 -D__HIP_PLATFORM_AMD__=1 -Dggml_hip_EXPORTS -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/.. -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -DNDEBUG -std=gnu++17 -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -x hip --offload-arch=gfx900 --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx942 --offload-arch=gfx950 --offload-arch=gfx1010 --offload-arch=gfx1012 --offload-arch=gfx1030 --offload-arch=gfx1031 --offload-arch=gfx1035 --offload-arch=gfx1036 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1103 --offload-arch=gfx1150 --offload-arch=gfx1151 --offload-arch=gfx1152 --offload-arch=gfx1153 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_16.cu.o -MF CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_16.cu.o.d -o CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_16.cu.o -c /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_16.cu 14 warnings generated when compiling for gfx1103. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_1.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_16.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_2.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx1010. 14 warnings generated when compiling for gfx1200. 14 warnings generated when compiling for gfx1150. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_16.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_1.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx1012. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_2.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx1201. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_16.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx1151. 14 warnings generated when compiling for gfx1030. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_1.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_2.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_16.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx900. 14 warnings generated when compiling for gfx1031. 14 warnings generated when compiling for gfx1152. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_1.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_16.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_2.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx1035. 14 warnings generated when compiling for gfx906. 14 warnings generated when compiling for gfx1153. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_16.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_1.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx1036. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_2.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 12 warnings generated when compiling for gfx908. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_16.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx1200. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_1.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_2.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 12 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_16.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx1101. 14 warnings generated when compiling for gfx1201. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_1.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 11 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_16.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_2.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 12 warnings generated when compiling for gfx90a. 14 warnings generated when compiling for gfx1102. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmvq.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmvq.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmvq.cu:2: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cuh:4: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmvq.cu:2: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cuh:4: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:1796:31: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 1796 | for (int l = 0; l < sizeof(int); ++l) { | ~ ^ ~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmvq.cu:496:13: warning: 'break' will never be executed [-Wunreachable-code-break] 496 | break; | ^~~~~ 14 warnings generated when compiling for gfx900. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:521:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 521 | acc[0] = __builtin_amdgcn_mfma_i32_16x16x32_i8(((int64_t *) A.x)[0], | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:522:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 522 | ((int64_t *) B.x)[0], | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:549:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 549 | acc[0] = __builtin_amdgcn_mfma_i32_32x32x16_i8(((int64_t *) A.x)[0], | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:550:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 550 | ((int64_t *) B.x)[0], | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_1.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_16.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & 14 warnings generated when compiling for gfx1103. A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_2.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 16 warnings generated when compiling for gfx942. 14 warnings generated when compiling for gfx906. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_16.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_1.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx1150. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_2.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx950. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_16.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 12 warnings generated when compiling for gfx908. 14 warnings generated when compiling for gfx1151. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_1.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_2.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for host. [ 44%] Building CXX object ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_2.cu.o cd /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build/ggml/src/ggml-hip && /usr/bin/hipcc -DGGML_BACKEND_BUILD -DGGML_BACKEND_SHARED -DGGML_HIP_NO_VMM -DGGML_HIP_ROCWMMA_FATTN_GFX12 -DGGML_SCHED_MAX_COPIES=4 -DGGML_SHARED -DGGML_USE_HIP -DUSE_PROF_API=1 -D_GNU_SOURCE -D_XOPEN_SOURCE=600 -D__HIP_PLATFORM_AMD__=1 -Dggml_hip_EXPORTS -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/.. -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -DNDEBUG -std=gnu++17 -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -x hip --offload-arch=gfx900 --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx942 --offload-arch=gfx950 --offload-arch=gfx1010 --offload-arch=gfx1012 --offload-arch=gfx1030 --offload-arch=gfx1031 --offload-arch=gfx1035 --offload-arch=gfx1036 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1103 --offload-arch=gfx1150 --offload-arch=gfx1151 --offload-arch=gfx1152 --offload-arch=gfx1153 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_2.cu.o -MF CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_2.cu.o.d -o CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_2.cu.o -c /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_2.cu In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_16.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx1152. 12 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_2.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_16.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx1010. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_2.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx1153. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_2.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 12 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_16.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx1200. 14 warnings generated when compiling for gfx1012. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:521:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 521 | acc[0] = __builtin_amdgcn_mfma_i32_16x16x32_i8(((int64_t *) A.x)[0], | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:522:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 522 | ((int64_t *) B.x)[0], | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:549:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 549 | acc[0] = __builtin_amdgcn_mfma_i32_32x32x16_i8(((int64_t *) A.x)[0], | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:550:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 550 | ((int64_t *) B.x)[0], | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_2.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_2.cuIn file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ :3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuhIn file included from :326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_2.cu/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const :t356:ile<8,96: 4, int> & B) { | ^ warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh: tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh326:90: warning: :383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ 326 | /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ :/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | til:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ e/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh<16, 8, fl:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ o/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuhat> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ :419/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ :96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_16.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max,In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_2.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 16 warnings generated when compiling for gfx942. 14 warnings generated when compiling for gfx1201. 14 warnings generated when compiling for gfx1030. In file included from In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_2.cu/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_16.cu::33: : In file included from In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh::11: : /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh::270270::4242:: warning: warning: unused parameter 'cc' [-Wunused-parameter]unused parameter 'cc' [-Wunused-parameter] 270270 | | ssttaattiicc bbooooll ffpp1166__mmmmaa__aavvaaiillaabbllee((ccoonnsstt iinntt cccc)) {{ | | ^ ^ In file included from In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_2.cu/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_16.cu::33: : In file included from In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh::22: : /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh::5151::6060:: warning: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn]function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 5151 | | ssttaattiicc ____ddeevviiccee____ ____ffoorrcceeiinnlliinnee____ vvooiidd ccpp__aassyynncc__wwaaiitt__aallll(()) {{ | | ^ ^ In file included from In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_2.cu/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_16.cu::33: : In file included from In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh::33: : /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh::302302::9090:: warning: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn]function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302302 | | ttiillee<<1166,, 88,, iinntt>> && DD,, ccoonnsstt ttiillee<<1166,, 44,, iinntt>> && AA,, ccoonnsstt ttiillee<<88,, 44,, iinntt> >& & BB)) {{ | | ^ ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh::326326::9090:: warning: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn]function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326326 | | ttiillee<<1166,, 88,, iinntt>> && DD,, ccoonnsstt ttiillee<<1166,, 88,, iinntt>> && AA,, ccoonnsstt ttiillee<<88,, 88,, iinntt>> && BB)) {{ | | ^ ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh::356356::9696:: warning: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn]function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356356 | | ttiillee<<1166,, 44,, hhaallff22>> && DD,, ccoonnsstt ttiillee<<1166,, 88,, hhaallff22>> && AA,, ccoonnsstt ttiillee<<88,, 88,, hhaallff22>> && BB)) {{ | | ^ ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh::383383::9797:: warning: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn]function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383383 | | ttiillee<<1166,, 88,, hhaallff22>> && DD,, ccoonnsstt ttiillee<<1166,, 88,, hhaallff22>> && AA,, ccoonnsstt ttiillee<<1166,, 88,, hhaallff22>> && BB)) {{ | | ^ ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh::419419::9696:: warning: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn]function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419419 | | ttiillee<<1166,, 88,, ffllooaatt>> && DD,, ccoonnsstt ttiillee<<1166,, 88,, ffllooaatt>> && AA,, ccoonnsstt ttiillee<<88,, 88,, ffllooaatt>> && BB)) {{ | | ^ ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436::43696::96 :warning: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn]function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436436 | | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const ti/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuhle:<4638:,110 :8 ,warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn]n v_bfloat162> & B) { 463| | ^ tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_2.cu/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_16.cu::33: : /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh::788788::4343:: warning: warning: unused parameter 'sinks_f' [-Wunused-parameter]unused parameter 'sinks_f' [-Wunused-parameter] 788788 | | ccoonnsstt ffllooaatt ** ccoonnsstt ____rreessttrriicctt____ ssiinnkkss__ff,, | | ^ ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_2.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx900. 14 warnings generated when compiling for gfx950. 14 warnings generated when compiling for gfx1031. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_16.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_2.cu14 warnings generated when compiling for gfx906. :3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_2.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_2.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for host. [ 45%] Building CXX object ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_4.cu.o cd /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build/ggml/src/ggml-hip && /usr/bin/hipcc -DGGML_BACKEND_BUILD -DGGML_BACKEND_SHARED -DGGML_HIP_NO_VMM -DGGML_HIP_ROCWMMA_FATTN_GFX12 -DGGML_SCHED_MAX_COPIES=4 -DGGML_SHARED -DGGML_USE_HIP -DUSE_PROF_API=1 -D_GNU_SOURCE -D_XOPEN_SOURCE=600 -D__HIP_PLATFORM_AMD__=1 -Dggml_hip_EXPORTS -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/.. -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -DNDEBUG -std=gnu++17 -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -x hip --offload-arch=gfx900 --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx942 --offload-arch=gfx950 --offload-arch=gfx1010 --offload-arch=gfx1012 --offload-arch=gfx1030 --offload-arch=gfx1031 --offload-arch=gfx1035 --offload-arch=gfx1036 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1103 --offload-arch=gfx1150 --offload-arch=gfx1151 --offload-arch=gfx1152 --offload-arch=gfx1153 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_4.cu.o -MF CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_4.cu.o.d -o CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_4.cu.o -c /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_4.cu 14 warnings generated when compiling for gfx1035. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_16.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 12 warnings generated when compiling for gfx908. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_4.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_2.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx1010. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_16.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx1036. 12 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_4.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_2.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_16.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx1012. 14 warnings generated when compiling for gfx1100. 12 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nIn file included from v_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ 270 | static/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh bool fp16_mma_available(const int cc) { | ^ :544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_4.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_2.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:521:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 521 | acc[0] = __builtin_amdgcn_mfma_i32_16x16x32_i8(((int64_t *) A.x)[0], | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:522:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 522 | ((int64_t *) B.x)[0], | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:549:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 549 | acc[0] = __builtin_amdgcn_mfma_i32_32x32x16_i8(((int64_t *) A.x)[0], | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:550:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 550 | ((int64_t *) B.x)[0], | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_16.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 16 warnings generated when compiling for gfx942. 14 warnings generated when compiling for gfx1030. 14 warnings generated when compiling for gfx1101. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_16.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_2.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_4.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx950. 14 warnings generated when compiling for gfx1102. 14 warnings generated when compiling for gfx1031. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_16.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for host. [ 45%] Building CXX object ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_8.cu.o cd /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build/ggml/src/ggml-hip && /usr/bin/hipcc -DGGML_BACKEND_BUILD -DGGML_BACKEND_SHARED -DGGML_HIP_NO_VMM -DGGML_HIP_ROCWMMA_FATTN_GFX12 -DGGML_SCHED_MAX_COPIES=4 -DGGML_SHARED -DGGML_USE_HIP -DUSE_PROF_API=1 -D_GNU_SOURCE -D_XOPEN_SOURCE=600 -D__HIP_PLATFORM_AMD__=1 -Dggml_hip_EXPORTS -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/.. -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -DNDEBUG -std=gnu++17 -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -x hip --offload-arch=gfx900 --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx942 --offload-arch=gfx950 --offload-arch=gfx1010 --offload-arch=gfx1012 --offload-arch=gfx1030 --offload-arch=gfx1031 --offload-arch=gfx1035 --offload-arch=gfx1036 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1103 --offload-arch=gfx1150 --offload-arch=gfx1151 --offload-arch=gfx1152 --offload-arch=gfx1153 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_8.cu.o -MF CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_8.cu.o.d -o CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_8.cu.o -c /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_8.cu In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_2.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_4.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx1103. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_8.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx1035. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ 14 warnings generated when compiling for gfx1010. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_2.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_4.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx1150. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_8.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx1036. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_2.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx1012. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_4.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx1151. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_8.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_2.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx1030. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_4.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx1152. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_8.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx1101. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_2.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 11 warnings generated when compiling for gfx90a. 14 warnings generated when compiling for gfx1031. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ 14 warnings generated when compiling for gfx1153. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_4.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmvq.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmvq.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmvq.cu:2: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cuh:4: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:521:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 521 | acc[0] = __builtin_amdgcn_mfma_i32_16x16x32_i8(((int64_t *) A.x)[0], | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:522:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 522 | ((int64_t *) B.x)[0], | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:549:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 549 | acc[0] = __builtin_amdgcn_mfma_i32_32x32x16_i8(((int64_t *) A.x)[0], | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:550:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 550 | ((int64_t *) B.x)[0], | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmvq.cu:2: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cuh:4: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:1796:31: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 1796 | for (int l = 0; l < sizeof(int); ++l) { | ~ ^ ~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmvq.cu:496:13: warning: 'break' will never be executed [-Wunreachable-code-break] 496 | break; | ^~~~~ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_8.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx1102. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_2.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx1035. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 14 warnings generated when compiling for gfx1200. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_4.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_8.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx1103. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_2.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx1036. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 14 warnings generated when compiling for gfx1201. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_4.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_8.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx1150. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_2.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx1100. 14 warnings generated when compiling for gfx900. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_4.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_8.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx1151. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_2.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx1101. 14 warnings generated when compiling for gfx906. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_4.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_8.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ 14 warnings generated when compiling for gfx1152. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_2.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx1102. 12 warnings generated when compiling for gfx908. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_4.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tiIn file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ le<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_8.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_2.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx1153. 14 warnings generated when compiling for gfx1103. 12 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_4.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_8.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_2.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx1200. 14 warnings generated when compiling for gfx1150. 12 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_4.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_8.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx1201. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:521:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 521 | acc[0] = __builtin_amdgcn_mfma_i32_16x16x32_i8(((int64_t *) A.x)[0], | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:522:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 522 | ((int64_t *) B.x)[0], | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:549:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 549 | acc[0] = __builtin_amdgcn_mfma_i32_32x32x16_i8(((int64_t *) A.x)[0], | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:550:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 550 | ((int64_t *) B.x)[0], | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_2.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx1151. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_4.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 16 warnings generated when compiling for gfx942. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_8.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx900. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_2.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx1152. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_4.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx950. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_8.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx906. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_2.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx1153. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_4.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for host. [ 46%] Building CXX object ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_64-ncols2_1.cu.o cd /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build/ggml/src/ggml-hip && /usr/bin/hipcc -DGGML_BACKEND_BUILD -DGGML_BACKEND_SHARED -DGGML_HIP_NO_VMM -DGGML_HIP_ROCWMMA_FATTN_GFX12 -DGGML_SCHED_MAX_COPIES=4 -DGGML_SHARED -DGGML_USE_HIP -DUSE_PROF_API=1 -D_GNU_SOURCE -D_XOPEN_SOURCE=600 -D__HIP_PLATFORM_AMD__=1 -Dggml_hip_EXPORTS -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/.. -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -DNDEBUG -std=gnu++17 -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -x hip --offload-arch=gfx900 --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx942 --offload-arch=gfx950 --offload-arch=gfx1010 --offload-arch=gfx1012 --offload-arch=gfx1030 --offload-arch=gfx1031 --offload-arch=gfx1035 --offload-arch=gfx1036 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1103 --offload-arch=gfx1150 --offload-arch=gfx1151 --offload-arch=gfx1152 --offload-arch=gfx1153 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_64-ncols2_1.cu.o -MF CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_64-ncols2_1.cu.o.d -o CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_64-ncols2_1.cu.o -c /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_64-ncols2_1.cu 12 warnings generated when compiling for gfx908. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_8.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_64-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_64-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_64-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_64-ncols2_1.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ 14 warnings generated when compiling for gfx1200. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_4.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx1010. 12 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_8.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_64-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_64-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_64-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_64-ncols2_1.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_4.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx1201. 14 warnings generated when compiling for gfx1012. 12 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_8.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_64-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_64-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_64-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_64-ncols2_1.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:521:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 521 | acc[0] = __builtin_amdgcn_mfma_i32_16x16x32_i8(((int64_t *) A.x)[0], | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:522:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 522 | ((int64_t *) B.x)[0], | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:549:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 549 | acc[0] = __builtin_amdgcn_mfma_i32_32x32x16_i8(((int64_t *) A.x)[0], | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:550:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 550 | ((int64_t *) B.x)[0], | ^ 14 warnings generated when compiling for gfx900. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_4.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx1030. 16 warnings generated when compiling for gfx942. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_8.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_64-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_64-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_64-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_64-ncols2_1.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx906. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_4.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx1031. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ 14 warnings generated when compiling for gfx950. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_8.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 15 warnings generated when compiling for gfx942. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_64-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 12 warnings generated when compiling for gfx908. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_64-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_64-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_64-ncols2_1.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_4.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmvq.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmvq.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmvq.cu:2: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cuh:4: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmvq.cu:2: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cuh:4: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 14 warnings generated when compiling for host. /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmvq.cu:496:13: warning: 'break' will never be executed [-Wunreachable-code-break] 496 | break; | ^~~~~ [ 46%] Building CXX object ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_1.cu.o cd /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build/ggml/src/ggml-hip && /usr/bin/hipcc -DGGML_BACKEND_BUILD -DGGML_BACKEND_SHARED -DGGML_HIP_NO_VMM -DGGML_HIP_ROCWMMA_FATTN_GFX12 -DGGML_SCHED_MAX_COPIES=4 -DGGML_SHARED -DGGML_USE_HIP -DUSE_PROF_API=1 -D_GNU_SOURCE -D_XOPEN_SOURCE=600 -D__HIP_PLATFORM_AMD__=1 -Dggml_hip_EXPORTS -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/.. -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -DNDEBUG -std=gnu++17 -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -x hip --offload-arch=gfx900 --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx942 --offload-arch=gfx950 --offload-arch=gfx1010 --offload-arch=gfx1012 --offload-arch=gfx1030 --offload-arch=gfx1031 --offload-arch=gfx1035 --offload-arch=gfx1036 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1103 --offload-arch=gfx1150 --offload-arch=gfx1151 --offload-arch=gfx1152 --offload-arch=gfx1153 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_1.cu.o -MF CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_1.cu.o.d -o CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_1.cu.o -c /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_1.cu In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ 14 warnings generated when compiling for gfx1035. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_8.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_1.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 12 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_64-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_64-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_64-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_64-ncols2_1.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx1010. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ 14 warnings generated when compiling for gfx1036. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_8.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_1.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 12 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_64-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_64-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_64-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_64-ncols2_1.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx1012. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:521:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 521 | acc[0] = __builtin_amdgcn_mfma_i32_16x16x32_i8(((int64_t *) A.x)[0], | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:522:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 522 | ((int64_t *) B.x)[0], | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:549:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 549 | acc[0] = __builtin_amdgcn_mfma_i32_32x32x16_i8(((int64_t *) A.x)[0], | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:550:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 550 | ((int64_t *) B.x)[0], | ^ 14 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_8.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_1.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 16 warnings generated when compiling for gfx942. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_64-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_64-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_64-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_64-ncols2_1.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx1030. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_8.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx1101. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_1.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx950. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_64-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_64-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_64-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_64-ncols2_1.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx1031. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_8.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx1102. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_1.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for host. [ 47%] Building CXX object ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_2.cu.o cd /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build/ggml/src/ggml-hip && /usr/bin/hipcc -DGGML_BACKEND_BUILD -DGGML_BACKEND_SHARED -DGGML_HIP_NO_VMM -DGGML_HIP_ROCWMMA_FATTN_GFX12 -DGGML_SCHED_MAX_COPIES=4 -DGGML_SHARED -DGGML_USE_HIP -DUSE_PROF_API=1 -D_GNU_SOURCE -D_XOPEN_SOURCE=600 -D__HIP_PLATFORM_AMD__=1 -Dggml_hip_EXPORTS -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/.. -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -DNDEBUG -std=gnu++17 -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -x hip --offload-arch=gfx900 --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx942 --offload-arch=gfx950 --offload-arch=gfx1010 --offload-arch=gfx1012 --offload-arch=gfx1030 --offload-arch=gfx1031 --offload-arch=gfx1035 --offload-arch=gfx1036 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1103 --offload-arch=gfx1150 --offload-arch=gfx1151 --offload-arch=gfx1152 --offload-arch=gfx1153 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_2.cu.o -MF CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_2.cu.o.d -o CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_2.cu.o -c /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_2.cu In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_64-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_64-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_64-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_64-ncols2_1.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx1035. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_2.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx1103. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_1.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx1010. 14 warnings generated when compiling for gfx1036. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_64-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_64-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_64-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_64-ncols2_1.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_2.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ 14 warnings generated when compiling for gfx1150. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_1.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx1012. 14 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_64-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_64-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_64-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_64-ncols2_1.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_2.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_1.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx1030. 14 warnings generated when compiling for gfx1151. 14 warnings generated when compiling for gfx1101. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_64-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_64-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_64-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_2.cu:3tile<: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ 788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh 788 | const float * const __restrict__ sinks_f, | ^ :326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_64-ncols2_1.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ 14 warnings generated when compiling for gfx1031. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_1.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx1152. 14 warnings generated when compiling for gfx1102. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_2.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_64-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_64-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_64-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_64-ncols2_1.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx1035. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_1.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx1153. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ 14 warnings generated when compiling for gfx1103. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_2.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_64-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_64-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_64-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_64-ncols2_1.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx1036. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_1.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx1200. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_2.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx1150. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_64-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_64-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_64-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ 14 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_64-ncols2_1.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_1.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 14 warnings generated when compiling for gfx1201. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_2.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx1151. 14 warnings generated when compiling for gfx1101. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_64-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_64-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_64-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_64-ncols2_1.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_1.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_2.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx900. 14 warnings generated when compiling for gfx1152. 14 warnings generated when compiling for gfx1102. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_64-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_64-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_64-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_64-ncols2_1.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_1.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_2.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx1153. 14 warnings generated when compiling for gfx906. 14 warnings generated when compiling for gfx1103. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_64-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_64-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_64-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_1.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_64-ncols2_1.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_2.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx1200. 12 warnings generated when compiling for gfx908. 14 warnings generated when compiling for gfx1150. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_1.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh::419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ 43: warning: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuhunused parameter 'sinks_f' [-Wunused-parameter] :436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ 788 | /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ const/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ floa/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuht:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_64-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_2.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_64-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_64-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_64-ncols2_1.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx1201. 14 warnings generated when compiling for gfx1151. 12 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | In file included from ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & bool fp16B) { | ^ _mma_available(co/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuhn:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ st int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8,In file included from int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_2.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh__ sinks_f, | ^ :326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_1.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_64-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_64-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_64-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_64-ncols2_1.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx1152. 14 warnings generated when compiling for gfx900. 12 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_2.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_1.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx1153. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_64-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_64-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_64-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:521:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 521 | acc[0] = __builtin_amdgcn_mfma_i32_16x16x32_i8(((int64_t *) A.x)[0], | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:522:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 522 | ((int64_t *) B.x)[0], | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:549:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 549 | acc[0] = __builtin_amdgcn_mfma_i32_32x32x16_i8(((int64_t *) A.x)[0], | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:550:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 550 | ((int64_t *) B.x)[0], | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_64-ncols2_1.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx906. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_2.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ 16 warnings generated when compiling for gfx942. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_1.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx1200. 12 warnings generated when compiling for gfx908. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_64-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_64-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_64-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_64-ncols2_1.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_2.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_1.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx950. 14 warnings generated when compiling for gfx1201. 12 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_64-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_64-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_64-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_64-ncols2_1.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_2.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for host. [ 47%] Building CXX object ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_4.cu.o cd /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build/ggml/src/ggml-hip && /usr/bin/hipcc -DGGML_BACKEND_BUILD -DGGML_BACKEND_SHARED -DGGML_HIP_NO_VMM -DGGML_HIP_ROCWMMA_FATTN_GFX12 -DGGML_SCHED_MAX_COPIES=4 -DGGML_SHARED -DGGML_USE_HIP -DUSE_PROF_API=1 -D_GNU_SOURCE -D_XOPEN_SOURCE=600 -D__HIP_PLATFORM_AMD__=1 -Dggml_hip_EXPORTS -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/.. -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -DNDEBUG -std=gnu++17 -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -x hip --offload-arch=gfx900 --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx942 --offload-arch=gfx950 --offload-arch=gfx1010 --offload-arch=gfx1012 --offload-arch=gfx1030 --offload-arch=gfx1031 --offload-arch=gfx1035 --offload-arch=gfx1036 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1103 --offload-arch=gfx1150 --offload-arch=gfx1151 --offload-arch=gfx1152 --offload-arch=gfx1153 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_4.cu.o -MF CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_4.cu.o.d -o CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_4.cu.o -c /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_4.cu In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_1.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx900. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_4.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 12 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_2.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx1010. 14 warnings generated when compiling for gfx906. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:521:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 521 | acc[0] = __builtin_amdgcn_mfma_i32_16x16x32_i8(((int64_t *) A.x)[0], | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:522:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 522 | ((int64_t *) B.x)[0], | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:549:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 549 | acc[0] = __builtin_amdgcn_mfma_i32_32x32x16_i8(((int64_t *) A.x)[0], | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:550:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 550 | ((int64_t *) B.x)[0], | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_1.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_4.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_2.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 16 warnings generated when compiling for gfx942. 14 warnings generated when compiling for gfx1012. 12 warnings generated when compiling for gfx908. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_1.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_4.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_2.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx950. 14 warnings generated when compiling for gfx1030. 12 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_1.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_4.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 14 warnings generated when compiling for host. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ [ 48%] Building CXX object ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_8.cu.o cd /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build/ggml/src/ggml-hip && /usr/bin/hipcc -DGGML_BACKEND_BUILD -DGGML_BACKEND_SHARED -DGGML_HIP_NO_VMM -DGGML_HIP_ROCWMMA_FATTN_GFX12 -DGGML_SCHED_MAX_COPIES=4 -DGGML_SHARED -DGGML_USE_HIP -DUSE_PROF_API=1 -D_GNU_SOURCE -D_XOPEN_SOURCE=600 -D__HIP_PLATFORM_AMD__=1 -Dggml_hip_EXPORTS -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/.. -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -DNDEBUG -std=gnu++17 -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -x hip --offload-arch=gfx900 --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx942 --offload-arch=gfx950 --offload-arch=gfx1010 --offload-arch=gfx1012 --offload-arch=gfx1030 --offload-arch=gfx1031 --offload-arch=gfx1035 --offload-arch=gfx1036 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1103 --offload-arch=gfx1150 --offload-arch=gfx1151 --offload-arch=gfx1152 --offload-arch=gfx1153 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_8.cu.o -MF CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_8.cu.o.d -o CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_8.cu.o -c /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_8.cu In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_2.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx1031. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ 12 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_8.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_4.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx1010. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:521:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 521 | acc[0] = __builtin_amdgcn_mfma_i32_16x16x32_i8(((int64_t *) A.x)[0], | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:522:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 522 | ((int64_t *) B.x)[0], | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:549:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 549 | acc[0] = __builtin_amdgcn_mfma_i32_32x32x16_i8(((int64_t *) A.x)[0], | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:550:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 550 | ((int64_t *) B.x)[0], | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_2.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx1035. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_8.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 16 warnings generated when compiling for gfx942. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_4.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx1012. 14 warnings generated when compiling for gfx1036. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_2.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, intIn file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_8.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ > & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ 1257 | /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ con/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ st in/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuht * __restrict__ KV_max, | ^ :480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_4.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx950. 14 warnings generated when compiling for gfx1030. 14 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_2.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_4.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for host. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_8.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ [ 48%] Building CXX object ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/mmq-instance-iq1_s.cu.o cd /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build/ggml/src/ggml-hip && /usr/bin/hipcc -DGGML_BACKEND_BUILD -DGGML_BACKEND_SHARED -DGGML_HIP_NO_VMM -DGGML_HIP_ROCWMMA_FATTN_GFX12 -DGGML_SCHED_MAX_COPIES=4 -DGGML_SHARED -DGGML_USE_HIP -DUSE_PROF_API=1 -D_GNU_SOURCE -D_XOPEN_SOURCE=600 -D__HIP_PLATFORM_AMD__=1 -Dggml_hip_EXPORTS -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/.. -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -DNDEBUG -std=gnu++17 -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -x hip --offload-arch=gfx900 --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx942 --offload-arch=gfx950 --offload-arch=gfx1010 --offload-arch=gfx1012 --offload-arch=gfx1030 --offload-arch=gfx1031 --offload-arch=gfx1035 --offload-arch=gfx1036 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1103 --offload-arch=gfx1150 --offload-arch=gfx1151 --offload-arch=gfx1152 --offload-arch=gfx1153 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/mmq-instance-iq1_s.cu.o -MF CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/mmq-instance-iq1_s.cu.o.d -o CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/mmq-instance-iq1_s.cu.o -c /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq1_s.cu 14 warnings generated when compiling for gfx1101. 14 warnings generated when compiling for gfx1031. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq1_s.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq1_s.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq1_s.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_4.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_8.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx1102. 14 warnings generated when compiling for gfx1035. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_4.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_8.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx1103. 14 warnings generated when compiling for gfx1036. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_4.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_8.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx1150. 14 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_4.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_8.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx1151. 14 warnings generated when compiling for gfx1101. 13 warnings generated when compiling for gfx950. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_4.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_8.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmvq.cu:1: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmvq.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmvq.cu:2: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cuh:4: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmvq.cu:2: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cuh:4: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/mmvq.cu:496:13: warning: 'break' will never be executed [-Wunreachable-code-break] 496 | break; | ^~~~~ 14 warnings generated when compiling for gfx1152. 14 warnings generated when compiling for gfx1102. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_4.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_8.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 12 warnings generated when compiling for gfx1010. 14 warnings generated when compiling for gfx1153. 13 warnings generated when compiling for host. [ 49%] Building CXX object ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/mmq-instance-iq2_s.cu.o cd /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build/ggml/src/ggml-hip && /usr/bin/hipcc -DGGML_BACKEND_BUILD -DGGML_BACKEND_SHARED -DGGML_HIP_NO_VMM -DGGML_HIP_ROCWMMA_FATTN_GFX12 -DGGML_SCHED_MAX_COPIES=4 -DGGML_SHARED -DGGML_USE_HIP -DUSE_PROF_API=1 -D_GNU_SOURCE -D_XOPEN_SOURCE=600 -D__HIP_PLATFORM_AMD__=1 -Dggml_hip_EXPORTS -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/.. -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -DNDEBUG -std=gnu++17 -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -x hip --offload-arch=gfx900 --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx942 --offload-arch=gfx950 --offload-arch=gfx1010 --offload-arch=gfx1012 --offload-arch=gfx1030 --offload-arch=gfx1031 --offload-arch=gfx1035 --offload-arch=gfx1036 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1103 --offload-arch=gfx1150 --offload-arch=gfx1151 --offload-arch=gfx1152 --offload-arch=gfx1153 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/mmq-instance-iq2_s.cu.o -MF CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/mmq-instance-iq2_s.cu.o.d -o CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/mmq-instance-iq2_s.cu.o -c /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_s.cu 14 warnings generated when compiling for gfx1103. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq1_s.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq1_s.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq1_s.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_4.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_s.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_s.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_s.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_8.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx1200. 14 warnings generated when compiling for gfx1150. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_4.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_8.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx1201. 14 warnings generated when compiling for gfx1151. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_4.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_8.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx900. 14 warnings generated when compiling for gfx1152. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_8.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx906. 14 warnings generated when compiling for gfx1153. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_4.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_8.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 12 warnings generated when compiling for gfx908. 14 warnings generated when compiling for gfx1200. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, consIn file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ t tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_4.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_8.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 12 warnings generated when compiling for gfx1012. 14 warnings generated when compiling for gfx1201. 12 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq1_s.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq1_s.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq1_s.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tiIn file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_8.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ le<16, 4, int> &/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_4.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 12 warnings generated when compiling for gfx1010. 14 warnings generated when compiling for gfx900. 12 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_s.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_s.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_s.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_8.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:521:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 521 | acc[0] = __builtin_amdgcn_mfma_i32_16x16x32_i8(((int64_t *) A.x)[0], | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:522:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 522 | ((int64_t *) B.x)[0], | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:549:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 549 | acc[0] = __builtin_amdgcn_mfma_i32_32x32x16_i8(((int64_t *) A.x)[0], | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:550:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 550 | ((int64_t *) B.x)[0], | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_4.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx906. 16 warnings generated when compiling for gfx942. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_8.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_4.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 12 warnings generated when compiling for gfx908. 14 warnings generated when compiling for gfx950. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_8.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_4.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 12 warnings generated when compiling for gfx90a. 14 warnings generated when compiling for host. [ 49%] Building CXX object ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/mmq-instance-iq2_xs.cu.o cd /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build/ggml/src/ggml-hip && /usr/bin/hipcc -DGGML_BACKEND_BUILD -DGGML_BACKEND_SHARED -DGGML_HIP_NO_VMM -DGGML_HIP_ROCWMMA_FATTN_GFX12 -DGGML_SCHED_MAX_COPIES=4 -DGGML_SHARED -DGGML_USE_HIP -DUSE_PROF_API=1 -D_GNU_SOURCE -D_XOPEN_SOURCE=600 -D__HIP_PLATFORM_AMD__=1 -Dggml_hip_EXPORTS -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/.. -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -DNDEBUG -std=gnu++17 -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -x hip --offload-arch=gfx900 --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx942 --offload-arch=gfx950 --offload-arch=gfx1010 --offload-arch=gfx1012 --offload-arch=gfx1030 --offload-arch=gfx1031 --offload-arch=gfx1035 --offload-arch=gfx1036 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1103 --offload-arch=gfx1150 --offload-arch=gfx1151 --offload-arch=gfx1152 --offload-arch=gfx1153 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/mmq-instance-iq2_xs.cu.o -MF CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/mmq-instance-iq2_xs.cu.o.d -o CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/mmq-instance-iq2_xs.cu.o -c /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xs.cu In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_8.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xs.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xs.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xs.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:521:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 521 | acc[0] = __builtin_amdgcn_mfma_i32_16x16x32_i8(((int64_t *) A.x)[0], | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:522:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 522 | ((int64_t *) B.x)[0], | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:549:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 549 | acc[0] = __builtin_amdgcn_mfma_i32_32x32x16_i8(((int64_t *) A.x)[0], | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:550:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 550 | ((int64_t *) B.x)[0], | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_8.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 16 warnings generated when compiling for gfx942. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_8.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx950. 12 warnings generated when compiling for gfx1012. 12 warnings generated when compiling for gfx1030. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_8.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_s.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_s.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_s.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 14 warnings generated when compiling for host. [ 50%] Building CXX object ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/mmq-instance-iq2_xxs.cu.o cd /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build/ggml/src/ggml-hip && /usr/bin/hipcc -DGGML_BACKEND_BUILD -DGGML_BACKEND_SHARED -DGGML_HIP_NO_VMM -DGGML_HIP_ROCWMMA_FATTN_GFX12 -DGGML_SCHED_MAX_COPIES=4 -DGGML_SHARED -DGGML_USE_HIP -DUSE_PROF_API=1 -D_GNU_SOURCE -D_XOPEN_SOURCE=600 -D__HIP_PLATFORM_AMD__=1 -Dggml_hip_EXPORTS -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/.. -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -DNDEBUG -std=gnu++17 -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -x hip --offload-arch=gfx900 --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx942 --offload-arch=gfx950 --offload-arch=gfx1010 --offload-arch=gfx1012 --offload-arch=gfx1030 --offload-arch=gfx1031 --offload-arch=gfx1035 --offload-arch=gfx1036 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1103 --offload-arch=gfx1150 --offload-arch=gfx1151 --offload-arch=gfx1152 --offload-arch=gfx1153 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/mmq-instance-iq2_xxs.cu.o -MF CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/mmq-instance-iq2_xxs.cu.o.d -o CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/mmq-instance-iq2_xxs.cu.o -c /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xxs.cu In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq1_s.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq1_s.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq1_s.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xxs.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xxs.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xxs.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1010. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xs.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xs.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xs.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1010. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xxs.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xxs.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xxs.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1031. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq1_s.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq1_s.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq1_s.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1030. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_s.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_s.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ 12 warnings generated when compiling for gfx1012. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_s.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xs.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xs.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xs.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1012. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xxs.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xxs.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xxs.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1035. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq1_s.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq1_s.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq1_s.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1031. 12 warnings generated when compiling for gfx1030. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_s.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_s.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_s.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xs.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xs.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xs.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1030. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xxs.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xxs.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xxs.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1036. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq1_s.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq1_s.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq1_s.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1031. 12 warnings generated when compiling for gfx1035. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xs.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xs.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xs.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_s.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_s.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_s.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1031. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xxs.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xxs.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xxs.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq1_s.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq1_s.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq1_s.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1035. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xs.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xs.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xs.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1036. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_s.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_s.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_s.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1035. 12 warnings generated when compiling for gfx1101. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xxs.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq1_s.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xxs.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xxs.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq1_s.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq1_s.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1036. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xs.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xs.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xs.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_s.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_s.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_s.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1102. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq1_s.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq1_s.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq1_s.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1036. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xxs.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xxs.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xxs.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xs.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xs.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xs.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1101. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_s.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_s.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_s.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1103. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq1_s.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq1_s.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq1_s.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xxs.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xxs.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xxs.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1101. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xs.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xs.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xs.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1150. 12 warnings generated when compiling for gfx1102. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq1_s.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq1_s.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq1_s.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_s.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_s.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_s.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1101. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xxs.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xxs.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xxs.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1102. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xs.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xs.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xs.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1151. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq1_s.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq1_s.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq1_s.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1103. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_s.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_s.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_s.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1102. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xxs.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xxs.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xxs.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1103. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xs.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xs.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xs.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1150. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_s.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_s.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_s.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1103. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xxs.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xxs.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xxs.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1152. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq1_s.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq1_s.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq1_s.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1150. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xs.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xs.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xs.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1151. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_s.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_s.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_s.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1150. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xxs.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xxs.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xxs.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1151. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xs.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xs.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xs.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1153. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq1_s.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq1_s.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq1_s.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1151. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xxs.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xxs.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xxs.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1152. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_s.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_s.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_s.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1200. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq1_s.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq1_s.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq1_s.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1152. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xs.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xs.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xs.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1201. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq1_s.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq1_s.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq1_s.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1152. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xxs.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xxs.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xxs.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1153. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_s.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_s.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_s.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1153. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xs.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xs.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xs.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx900. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq1_s.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq1_s.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq1_s.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1200. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_s.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_s.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_s.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1200. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xs.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xs.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xs.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1153. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xxs.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xxs.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xxs.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx906. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq1_s.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq1_s.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq1_s.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:1796:31: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 1796 | for (int l = 0; l < sizeof(int); ++l) { | ~ ^ ~~~~~~~~~~~ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq1_s.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:230:46: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 230 | const int64_t * xs = (int64_t *) ((const int *) xs0 + (threadIdx.x % t.I) * stride + 2 * (threadIdx.x / t.I)); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:992:13: note: in instantiation of function template specialization 'ggml_cuda_mma::load_generic<16, 8, int>' requested here 992 | load_generic(A[n], x_qs + (i0 + n*tile_A::I)*MMQ_MMA_TILE_X_K_Q8_1 + k0, MMQ_MMA_TILE_X_K_Q8_1); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3027:54: note: in instantiation of function template specialization 'vec_dot_q8_1_q8_1_mma<8, 128>' requested here 3027 | static constexpr vec_dot_mmq_t vec_dot_mma = vec_dot_q8_1_q8_1_mma; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3300:9: note: in instantiation of function template specialization 'mul_mat_q_process_tile' requested here 3300 | mul_mat_q_process_tile | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3571:13: note: in instantiation of function template specialization 'mul_mat_q' requested here 3571 | mul_mat_q<<>> | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3660:13: note: in instantiation of function template specialization 'launch_mul_mat_q' requested here 3660 | launch_mul_mat_q(ctx, args, stream); | ^ 12 warnings generated when compiling for gfx1201. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_s.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_s.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_s.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1201. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xs.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xs.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xs.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1200. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xxs.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xxs.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xxs.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 11 warnings generated when compiling for gfx908. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq1_s.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq1_s.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq1_s.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:1796:31: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 1796 | for (int l = 0; l < sizeof(int); ++l) { | ~ ^ ~~~~~~~~~~~ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq1_s.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:230:46: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 230 | const int64_t * xs = (int64_t *) ((const int *) xs0 + (threadIdx.x % t.I) * stride + 2 * (threadIdx.x / t.I)); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:992:13: note: in instantiation of function template specialization 'ggml_cuda_mma::load_generic<16, 8, int>' requested here 992 | load_generic(A[n], x_qs + (i0 + n*tile_A::I)*MMQ_MMA_TILE_X_K_Q8_1 + k0, MMQ_MMA_TILE_X_K_Q8_1); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3027:54: note: in instantiation of function template specialization 'vec_dot_q8_1_q8_1_mma<8, 128>' requested here 3027 | static constexpr vec_dot_mmq_t vec_dot_mma = vec_dot_q8_1_q8_1_mma; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3300:9: note: in instantiation of function template specialization 'mul_mat_q_process_tile' requested here 3300 | mul_mat_q_process_tile | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3571:13: note: in instantiation of function template specialization 'mul_mat_q' requested here 3571 | mul_mat_q<<>> | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3660:13: note: in instantiation of function template specialization 'launch_mul_mat_q' requested here 3660 | launch_mul_mat_q(ctx, args, stream); | ^ 12 warnings generated when compiling for gfx900. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_s.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_s.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_s.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx900. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xs.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xs.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xs.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1201. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xxs.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xxs.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xxs.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 11 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq1_s.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq1_s.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq1_s.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:1796:31: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 1796 | for (int l = 0; l < sizeof(int); ++l) { | ~ ^ ~~~~~~~~~~~ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq1_s.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:230:46: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 230 | const int64_t * xs = (int64_t *) ((const int *) xs0 + (threadIdx.x % t.I) * stride + 2 * (threadIdx.x / t.I)); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:992:13: note: in instantiation of function template specialization 'ggml_cuda_mma::load_generic<16, 8, int>' requested here 992 | load_generic(A[n], x_qs + (i0 + n*tile_A::I)*MMQ_MMA_TILE_X_K_Q8_1 + k0, MMQ_MMA_TILE_X_K_Q8_1); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3027:54: note: in instantiation of function template specialization 'vec_dot_q8_1_q8_1_mma<8, 128>' requested here 3027 | static constexpr vec_dot_mmq_t vec_dot_mma = vec_dot_q8_1_q8_1_mma; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3300:9: note: in instantiation of function template specialization 'mul_mat_q_process_tile' requested here 3300 | mul_mat_q_process_tile | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3571:13: note: in instantiation of function template specialization 'mul_mat_q' requested here 3571 | mul_mat_q<<>> | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3660:13: note: in instantiation of function template specialization 'launch_mul_mat_q' requested here 3660 | launch_mul_mat_q(ctx, args, stream); | ^ 12 warnings generated when compiling for gfx906. 12 warnings generated when compiling for gfx906. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_s.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_s.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_s.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:1796:31: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 1796 | for (int l = 0; l < sizeof(int); ++l) { | ~ ^ ~~~~~~~~~~~ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xs.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xs.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xs.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:1796:31: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 1796 | for (int l = 0; l < sizeof(int); ++l) { | ~ ^ ~~~~~~~~~~~ 12 warnings generated when compiling for gfx900. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xxs.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xxs.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xxs.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 11 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq1_s.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq1_s.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:521:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 521 | acc[0] = __builtin_amdgcn_mfma_i32_16x16x32_i8(((int64_t *) A.x)[0], | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:522:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 522 | ((int64_t *) B.x)[0], | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:549:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 549 | acc[0] = __builtin_amdgcn_mfma_i32_32x32x16_i8(((int64_t *) A.x)[0], | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:550:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 550 | ((int64_t *) B.x)[0], | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq1_s.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:1796:31: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 1796 | for (int l = 0; l < sizeof(int); ++l) { | ~ ^ ~~~~~~~~~~~ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq1_s.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:230:46: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 230 | const int64_t * xs = (int64_t *) ((const int *) xs0 + (threadIdx.x % t.I) * stride + 2 * (threadIdx.x / t.I)); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:992:13: note: in instantiation of function template specialization 'ggml_cuda_mma::load_generic<16, 8, int>' requested here 992 | load_generic(A[n], x_qs + (i0 + n*tile_A::I)*MMQ_MMA_TILE_X_K_Q8_1 + k0, MMQ_MMA_TILE_X_K_Q8_1); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3027:54: note: in instantiation of function template specialization 'vec_dot_q8_1_q8_1_mma<8, 128>' requested here 3027 | static constexpr vec_dot_mmq_t vec_dot_mma = vec_dot_q8_1_q8_1_mma; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3300:9: note: in instantiation of function template specialization 'mul_mat_q_process_tile' requested here 3300 | mul_mat_q_process_tile | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3571:13: note: in instantiation of function template specialization 'mul_mat_q' requested here 3571 | mul_mat_q<<>> | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3660:13: note: in instantiation of function template specialization 'launch_mul_mat_q' requested here 3660 | launch_mul_mat_q(ctx, args, stream); | ^ 10 warnings generated when compiling for gfx908. 10 warnings generated when compiling for gfx908. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xs.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xs.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xs.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:1796:31: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 1796 | for (int l = 0; l < sizeof(int); ++l) { | ~ ^ ~~~~~~~~~~~ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_s.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_s.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ 12 warnings generated when compiling for gfx906. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_s.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:1796:31: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 1796 | for (int l = 0; l < sizeof(int); ++l) { | ~ ^ ~~~~~~~~~~~ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xxs.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xxs.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xxs.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:1796:31: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 1796 | for (int l = 0; l < sizeof(int); ++l) { | ~ ^ ~~~~~~~~~~~ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xxs.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:230:46: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 230 | const int64_t * xs = (int64_t *) ((const int *) xs0 + (threadIdx.x % t.I) * stride + 2 * (threadIdx.x / t.I)); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:824:13: note: in instantiation of function template specialization 'ggml_cuda_mma::load_generic<16, 8, int>' requested here 824 | load_generic(A[n], x_qs + (i0 + n*tile_A::I)*MMQ_MMA_TILE_X_K_Q8_0 + k0, MMQ_MMA_TILE_X_K_Q8_0); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2987:54: note: in instantiation of function template specialization 'vec_dot_q8_0_q8_1_mma<8, 128, MMQ_Q8_1_DS_LAYOUT_D4>' requested here 2987 | static constexpr vec_dot_mmq_t vec_dot_mma = vec_dot_q8_0_q8_1_mma; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3300:9: note: in instantiation of function template specialization 'mul_mat_q_process_tile' requested here 3300 | mul_mat_q_process_tile | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3571:13: note: in instantiation of function template specialization 'mul_mat_q' requested here 3571 | mul_mat_q<<>> | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3660:13: note: in instantiation of function template specialization 'launch_mul_mat_q' requested here 3660 | launch_mul_mat_q(ctx, args, stream); | ^ 15 warnings generated when compiling for gfx942. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq1_s.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq1_s.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq1_s.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 10 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xs.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xs.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xs.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:1796:31: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 1796 | for (int l = 0; l < sizeof(int); ++l) { | ~ ^ ~~~~~~~~~~~ 10 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_s.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_s.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_s.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:1796:31: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 1796 | for (int l = 0; l < sizeof(int); ++l) { | ~ ^ ~~~~~~~~~~~ 11 warnings generated when compiling for gfx908. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xxs.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xxs.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xxs.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:1796:31: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 1796 | for (int l = 0; l < sizeof(int); ++l) { | ~ ^ ~~~~~~~~~~~ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xxs.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:230:46: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 230 | const int64_t * xs = (int64_t *) ((const int *) xs0 + (threadIdx.x % t.I) * stride + 2 * (threadIdx.x / t.I)); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:824:13: note: in instantiation of function template specialization 'ggml_cuda_mma::load_generic<16, 8, int>' requested here 824 | load_generic(A[n], x_qs + (i0 + n*tile_A::I)*MMQ_MMA_TILE_X_K_Q8_0 + k0, MMQ_MMA_TILE_X_K_Q8_0); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2987:54: note: in instantiation of function template specialization 'vec_dot_q8_0_q8_1_mma<8, 128, MMQ_Q8_1_DS_LAYOUT_D4>' requested here 2987 | static constexpr vec_dot_mmq_t vec_dot_mma = vec_dot_q8_0_q8_1_mma; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3300:9: note: in instantiation of function template specialization 'mul_mat_q_process_tile' requested here 3300 | mul_mat_q_process_tile | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3571:13: note: in instantiation of function template specialization 'mul_mat_q' requested here 3571 | mul_mat_q<<>> | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3660:13: note: in instantiation of function template specialization 'launch_mul_mat_q' requested here 3660 | launch_mul_mat_q(ctx, args, stream); | ^ 12 warnings generated when compiling for gfx950. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq1_s.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq1_s.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq1_s.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for host. [ 50%] Building CXX object ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/mmq-instance-iq3_s.cu.o cd /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build/ggml/src/ggml-hip && /usr/bin/hipcc -DGGML_BACKEND_BUILD -DGGML_BACKEND_SHARED -DGGML_HIP_NO_VMM -DGGML_HIP_ROCWMMA_FATTN_GFX12 -DGGML_SCHED_MAX_COPIES=4 -DGGML_SHARED -DGGML_USE_HIP -DUSE_PROF_API=1 -D_GNU_SOURCE -D_XOPEN_SOURCE=600 -D__HIP_PLATFORM_AMD__=1 -Dggml_hip_EXPORTS -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/.. -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -DNDEBUG -std=gnu++17 -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -x hip --offload-arch=gfx900 --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx942 --offload-arch=gfx950 --offload-arch=gfx1010 --offload-arch=gfx1012 --offload-arch=gfx1030 --offload-arch=gfx1031 --offload-arch=gfx1035 --offload-arch=gfx1036 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1103 --offload-arch=gfx1150 --offload-arch=gfx1151 --offload-arch=gfx1152 --offload-arch=gfx1153 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/mmq-instance-iq3_s.cu.o -MF CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/mmq-instance-iq3_s.cu.o.d -o CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/mmq-instance-iq3_s.cu.o -c /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_s.cu In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_s.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_s.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_s.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 10 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xs.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xs.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:521:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 521 | acc[0] = __builtin_amdgcn_mfma_i32_16x16x32_i8(((int64_t *) A.x)[0], | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:522:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 522 | ((int64_t *) B.x)[0], | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:549:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 549 | acc[0] = __builtin_amdgcn_mfma_i32_32x32x16_i8(((int64_t *) A.x)[0], | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:550:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 550 | ((int64_t *) B.x)[0], | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xs.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:1796:31: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 1796 | for (int l = 0; l < sizeof(int); ++l) { | ~ ^ ~~~~~~~~~~~ 10 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_s.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_s.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:521:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 521 | acc[0] = __builtin_amdgcn_mfma_i32_16x16x32_i8(((int64_t *) A.x)[0], | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:522:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 522 | ((int64_t *) B.x)[0], | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:549:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 549 | acc[0] = __builtin_amdgcn_mfma_i32_32x32x16_i8(((int64_t *) A.x)[0], | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:550:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 550 | ((int64_t *) B.x)[0], | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_s.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:1796:31: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 1796 | for (int l = 0; l < sizeof(int); ++l) { | ~ ^ ~~~~~~~~~~~ 12 warnings generated when compiling for gfx1010. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_s.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_s.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_s.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 11 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xxs.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xxs.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xxs.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:1796:31: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 1796 | for (int l = 0; l < sizeof(int); ++l) { | ~ ^ ~~~~~~~~~~~ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xxs.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:230:46: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 230 | const int64_t * xs = (int64_t *) ((const int *) xs0 + (threadIdx.x % t.I) * stride + 2 * (threadIdx.x / t.I)); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:824:13: note: in instantiation of function template specialization 'ggml_cuda_mma::load_generic<16, 8, int>' requested here 824 | load_generic(A[n], x_qs + (i0 + n*tile_A::I)*MMQ_MMA_TILE_X_K_Q8_0 + k0, MMQ_MMA_TILE_X_K_Q8_0); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2987:54: note: in instantiation of function template specialization 'vec_dot_q8_0_q8_1_mma<8, 128, MMQ_Q8_1_DS_LAYOUT_D4>' requested here 2987 | static constexpr vec_dot_mmq_t vec_dot_mma = vec_dot_q8_0_q8_1_mma; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3300:9: note: in instantiation of function template specialization 'mul_mat_q_process_tile' requested here 3300 | mul_mat_q_process_tile | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3571:13: note: in instantiation of function template specialization 'mul_mat_q' requested here 3571 | mul_mat_q<<>> | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3660:13: note: in instantiation of function template specialization 'launch_mul_mat_q' requested here 3660 | launch_mul_mat_q(ctx, args, stream); | ^ 14 warnings generated when compiling for gfx942. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xs.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xs.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ 12 warnings generated when compiling for gfx1012. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xs.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_s.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_s.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_s.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 14 warnings generated when compiling for gfx942. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_s.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_s.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_s.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 11 warnings generated when compiling for gfx90a. 12 warnings generated when compiling for gfx1030. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xxs.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xxs.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:521:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 521 | acc[0] = __builtin_amdgcn_mfma_i32_16x16x32_i8(((int64_t *) A.x)[0], | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:522:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 522 | ((int64_t *) B.x)[0], | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:549:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 549 | acc[0] = __builtin_amdgcn_mfma_i32_32x32x16_i8(((int64_t *) A.x)[0], | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:550:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 550 | ((int64_t *) B.x)[0], | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xxs.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:1796:31: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 1796 | for (int l = 0; l < sizeof(int); ++l) { | ~ ^ ~~~~~~~~~~~ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xxs.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:230:46: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 230 | const int64_t * xs = (int64_t *) ((const int *) xs0 + (threadIdx.x % t.I) * stride + 2 * (threadIdx.x / t.I)); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:824:13: note: in instantiation of function template specialization 'ggml_cuda_mma::load_generic<16, 8, int>' requested here 824 | load_generic(A[n], x_qs + (i0 + n*tile_A::I)*MMQ_MMA_TILE_X_K_Q8_0 + k0, MMQ_MMA_TILE_X_K_Q8_0); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2987:54: note: in instantiation of function template specialization 'vec_dot_q8_0_q8_1_mma<8, 128, MMQ_Q8_1_DS_LAYOUT_D4>' requested here 2987 | static constexpr vec_dot_mmq_t vec_dot_mma = vec_dot_q8_0_q8_1_mma; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3300:9: note: in instantiation of function template specialization 'mul_mat_q_process_tile' requested here 3300 | mul_mat_q_process_tile | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3571:13: note: in instantiation of function template specialization 'mul_mat_q' requested here 3571 | mul_mat_q<<>> | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3660:13: note: in instantiation of function template specialization 'launch_mul_mat_q' requested here 3660 | launch_mul_mat_q(ctx, args, stream); | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_s.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_s.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_s.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx950. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xs.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xs.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xs.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for host. [ 50%] Building CXX object ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/mmq-instance-iq3_xxs.cu.o cd /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build/ggml/src/ggml-hip && /usr/bin/hipcc -DGGML_BACKEND_BUILD -DGGML_BACKEND_SHARED -DGGML_HIP_NO_VMM -DGGML_HIP_ROCWMMA_FATTN_GFX12 -DGGML_SCHED_MAX_COPIES=4 -DGGML_SHARED -DGGML_USE_HIP -DUSE_PROF_API=1 -D_GNU_SOURCE -D_XOPEN_SOURCE=600 -D__HIP_PLATFORM_AMD__=1 -Dggml_hip_EXPORTS -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/.. -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -DNDEBUG -std=gnu++17 -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -x hip --offload-arch=gfx900 --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx942 --offload-arch=gfx950 --offload-arch=gfx1010 --offload-arch=gfx1012 --offload-arch=gfx1030 --offload-arch=gfx1031 --offload-arch=gfx1035 --offload-arch=gfx1036 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1103 --offload-arch=gfx1150 --offload-arch=gfx1151 --offload-arch=gfx1152 --offload-arch=gfx1153 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/mmq-instance-iq3_xxs.cu.o -MF CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/mmq-instance-iq3_xxs.cu.o.d -o CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/mmq-instance-iq3_xxs.cu.o -c /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_xxs.cu 12 warnings generated when compiling for gfx1031. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_xxs.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_xxs.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_xxs.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_s.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_s.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_s.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx950. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_s.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_s.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_s.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for host. [ 51%] Building CXX object ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/mmq-instance-iq4_nl.cu.o cd /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build/ggml/src/ggml-hip && /usr/bin/hipcc -DGGML_BACKEND_BUILD -DGGML_BACKEND_SHARED -DGGML_HIP_NO_VMM -DGGML_HIP_ROCWMMA_FATTN_GFX12 -DGGML_SCHED_MAX_COPIES=4 -DGGML_SHARED -DGGML_USE_HIP -DUSE_PROF_API=1 -D_GNU_SOURCE -D_XOPEN_SOURCE=600 -D__HIP_PLATFORM_AMD__=1 -Dggml_hip_EXPORTS -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/.. -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -DNDEBUG -std=gnu++17 -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -x hip --offload-arch=gfx900 --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx942 --offload-arch=gfx950 --offload-arch=gfx1010 --offload-arch=gfx1012 --offload-arch=gfx1030 --offload-arch=gfx1031 --offload-arch=gfx1035 --offload-arch=gfx1036 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1103 --offload-arch=gfx1150 --offload-arch=gfx1151 --offload-arch=gfx1152 --offload-arch=gfx1153 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/mmq-instance-iq4_nl.cu.o -MF CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/mmq-instance-iq4_nl.cu.o.d -o CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/mmq-instance-iq4_nl.cu.o -c /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_nl.cu In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_nl.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_nl.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_nl.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 15 warnings generated when compiling for gfx942. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xxs.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xxs.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xxs.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1010. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_xxs.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_xxs.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_xxs.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1035. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_s.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_s.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_s.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1010. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_nl.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_nl.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_nl.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1012. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_xxs.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_xxs.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_xxs.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1012. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_nl.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_nl.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_nl.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1036. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_s.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_s.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_s.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1030. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_xxs.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_xxs.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_xxs.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx950. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xxs.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xxs.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xxs.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for host. [ 51%] Building CXX object ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/mmq-instance-iq4_xs.cu.o cd /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build/ggml/src/ggml-hip && /usr/bin/hipcc -DGGML_BACKEND_BUILD -DGGML_BACKEND_SHARED -DGGML_HIP_NO_VMM -DGGML_HIP_ROCWMMA_FATTN_GFX12 -DGGML_SCHED_MAX_COPIES=4 -DGGML_SHARED -DGGML_USE_HIP -DUSE_PROF_API=1 -D_GNU_SOURCE -D_XOPEN_SOURCE=600 -D__HIP_PLATFORM_AMD__=1 -Dggml_hip_EXPORTS -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/.. -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -DNDEBUG -std=gnu++17 -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -x hip --offload-arch=gfx900 --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx942 --offload-arch=gfx950 --offload-arch=gfx1010 --offload-arch=gfx1012 --offload-arch=gfx1030 --offload-arch=gfx1031 --offload-arch=gfx1035 --offload-arch=gfx1036 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1103 --offload-arch=gfx1150 --offload-arch=gfx1151 --offload-arch=gfx1152 --offload-arch=gfx1153 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/mmq-instance-iq4_xs.cu.o -MF CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/mmq-instance-iq4_xs.cu.o.d -o CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/mmq-instance-iq4_xs.cu.o -c /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_xs.cu In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_xs.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_xs.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_xs.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1030. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_nl.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_nl.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_nl.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_s.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_s.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_s.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1031. 12 warnings generated when compiling for gfx1010. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_xxs.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_xxs.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_xxs.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_xs.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_xs.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_xs.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1031. 12 warnings generated when compiling for gfx1012. 12 warnings generated when compiling for gfx1101. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_nl.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_xs.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_nl.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_xs.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_nl.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_xs.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_s.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_s.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_s.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1035. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_xxs.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_xxs.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_xxs.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1030. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_xs.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_xs.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_xs.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1035. 12 warnings generated when compiling for gfx1102. 12 warnings generated when compiling for gfx1036. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_nl.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_nl.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_nl.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_s.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_s.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_s.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_xxs.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_xxs.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_xxs.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1031. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_xs.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_xs.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_xs.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_xxs.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 12 warnings generated when compiling for gfx1103. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_xxs.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float>12 warnings generated when compiling for gfx1036. & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_xxs.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_s.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_s.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_nl.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuhIn file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ :356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_s.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_nl.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_nl.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1035. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_xs.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_xs.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_xs.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1101. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_xxs.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_xxs.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_xxs.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1150. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_s.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_s.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_s.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_nl.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_nl.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_nl.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1036. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_xs.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 12 warnings generated when compiling for gfx1102. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_xs.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_xs.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_xxs.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_xxs.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_xxs.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1151. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_s.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_s.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_s.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1101. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_nl.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_nl.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_nl.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1100. 12 warnings generated when compiling for gfx1103. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_xs.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_xs.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_xs.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_xxs.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_xxs.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_xxs.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1102. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_nl.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_nl.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_nl.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1152. 12 warnings generated when compiling for gfx1150. 12 warnings generated when compiling for gfx1101. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_s.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_s.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_s.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_xxs.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_xxs.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_xxs.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_xs.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_xs.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_xs.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1103. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_nl.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_nl.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_nl.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1151. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_xxs.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_xxs.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_xxs.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1102. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_xs.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_xs.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_xs.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1150. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_nl.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_nl.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_nl.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1153. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_s.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_s.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_s.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1103. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_xs.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_xs.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_xs.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1151. 12 warnings generated when compiling for gfx1152. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_nl.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_nl.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_nl.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_xxs.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_xxs.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_xxs.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1200. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_s.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_s.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_s.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1150. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_xs.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_xs.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_xs.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1201. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_s.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_s.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_s.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1151. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_xs.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_xs.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_xs.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1153. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_xxs.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_xxs.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_xxs.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1152. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_nl.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_nl.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_nl.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx900. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_s.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_s.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_s.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1200. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_xxs.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_xxs.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_xxs.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1152. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_xs.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_xs.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_xs.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx906. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_s.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_s.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_s.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:1796:31: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 1796 | for (int l = 0; l < sizeof(int); ++l) { | ~ ^ ~~~~~~~~~~~ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_s.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:230:46: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 230 | const int64_t * xs = (int64_t *) ((const int *) xs0 + (threadIdx.x % t.I) * stride + 2 * (threadIdx.x / t.I)); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:824:13: note: in instantiation of function template specialization 'ggml_cuda_mma::load_generic<16, 8, int>' requested here 824 | load_generic(A[n], x_qs + (i0 + n*tile_A::I)*MMQ_MMA_TILE_X_K_Q8_0 + k0, MMQ_MMA_TILE_X_K_Q8_0); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3019:54: note: in instantiation of function template specialization 'vec_dot_q8_0_q8_1_mma<8, 128, MMQ_Q8_1_DS_LAYOUT_D4>' requested here 3019 | static constexpr vec_dot_mmq_t vec_dot_mma = vec_dot_q8_0_q8_1_mma; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3300:9: note: in instantiation of function template specialization 'mul_mat_q_process_tile' requested here 3300 | mul_mat_q_process_tile | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3571:13: note: in instantiation of function template specialization 'mul_mat_q' requested here 3571 | mul_mat_q<<>> | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3660:13: note: in instantiation of function template specialization 'launch_mul_mat_q' requested here 3660 | launch_mul_mat_q(ctx, args, stream); | ^ 12 warnings generated when compiling for gfx1201. 12 warnings generated when compiling for gfx1153. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_xxs.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_xxs.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_xxs.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_nl.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_nl.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_nl.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1200. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_nl.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_nl.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_nl.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx900. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_xxs.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_xxs.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_xxs.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1153. 11 warnings generated when compiling for gfx908. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_xs.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_xs.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_xs.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_s.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_s.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_s.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:1796:31: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 1796 | for (int l = 0; l < sizeof(int); ++l) { | ~ ^ ~~~~~~~~~~~ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_s.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:230:46: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 230 | const int64_t * xs = (int64_t *) ((const int *) xs0 + (threadIdx.x % t.I) * stride + 2 * (threadIdx.x / t.I)); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:824:13: note: in instantiation of function template specialization 'ggml_cuda_mma::load_generic<16, 8, int>' requested here 824 | load_generic(A[n], x_qs + (i0 + n*tile_A::I)*MMQ_MMA_TILE_X_K_Q8_0 + k0, MMQ_MMA_TILE_X_K_Q8_0); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3019:54: note: in instantiation of function template specialization 'vec_dot_q8_0_q8_1_mma<8, 128, MMQ_Q8_1_DS_LAYOUT_D4>' requested here 3019 | static constexpr vec_dot_mmq_t vec_dot_mma = vec_dot_q8_0_q8_1_mma; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3300:9: note: in instantiation of function template specialization 'mul_mat_q_process_tile' requested here 3300 | mul_mat_q_process_tile | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3571:13: note: in instantiation of function template specialization 'mul_mat_q' requested here 3571 | mul_mat_q<<>> | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3660:13: note: in instantiation of function template specialization 'launch_mul_mat_q' requested here 3660 | launch_mul_mat_q(ctx, args, stream); | ^ 12 warnings generated when compiling for gfx1201. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_nl.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_nl.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_nl.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx906. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_xxs.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_xxs.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_xxs.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:1796:31: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 1796 | for (int l = 0; l < sizeof(int); ++l) { | ~ ^ ~~~~~~~~~~~ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_xxs.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:230:46: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 230 | const int64_t * xs = (int64_t *) ((const int *) xs0 + (threadIdx.x % t.I) * stride + 2 * (threadIdx.x / t.I)); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:824:13: note: in instantiation of function template specialization 'ggml_cuda_mma::load_generic<16, 8, int>' requested here 824 | load_generic(A[n], x_qs + (i0 + n*tile_A::I)*MMQ_MMA_TILE_X_K_Q8_0 + k0, MMQ_MMA_TILE_X_K_Q8_0); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3011:54: note: in instantiation of function template specialization 'vec_dot_q8_0_q8_1_mma<8, 128, MMQ_Q8_1_DS_LAYOUT_D4>' requested here 3011 | static constexpr vec_dot_mmq_t vec_dot_mma = vec_dot_q8_0_q8_1_mma; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3300:9: note: in instantiation of function template specialization 'mul_mat_q_process_tile' requested here 3300 | mul_mat_q_process_tile | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3571:13: note: in instantiation of function template specialization 'mul_mat_q' requested here 3571 | mul_mat_q<<>> | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3660:13: note: in instantiation of function template specialization 'launch_mul_mat_q' requested here 3660 | launch_mul_mat_q(ctx, args, stream); | ^ 12 warnings generated when compiling for gfx1200. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_xs.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_xs.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_xs.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 11 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_s.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_s.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_s.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:1796:31: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 1796 | for (int l = 0; l < sizeof(int); ++l) { | ~ ^ ~~~~~~~~~~~ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_s.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:230:46: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 230 | const int64_t * xs = (int64_t *) ((const int *) xs0 + (threadIdx.x % t.I) * stride + 2 * (threadIdx.x / t.I)); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:824:13: note: in instantiation of function template specialization 'ggml_cuda_mma::load_generic<16, 8, int>' requested here 824 | load_generic(A[n], x_qs + (i0 + n*tile_A::I)*MMQ_MMA_TILE_X_K_Q8_0 + k0, MMQ_MMA_TILE_X_K_Q8_0); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3019:54: note: in instantiation of function template specialization 'vec_dot_q8_0_q8_1_mma<8, 128, MMQ_Q8_1_DS_LAYOUT_D4>' requested here 3019 | static constexpr vec_dot_mmq_t vec_dot_mma = vec_dot_q8_0_q8_1_mma; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3300:9: note: in instantiation of function template specialization 'mul_mat_q_process_tile' requested here 3300 | mul_mat_q_process_tile | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3571:13: note: in instantiation of function template specialization 'mul_mat_q' requested here 3571 | mul_mat_q<<>> | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3660:13: note: in instantiation of function template specialization 'launch_mul_mat_q' requested here 3660 | launch_mul_mat_q(ctx, args, stream); | ^ 12 warnings generated when compiling for gfx1201. 12 warnings generated when compiling for gfx900. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_xs.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_nl.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_xs.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_xs.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_nl.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_nl.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 11 warnings generated when compiling for gfx908. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_xxs.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_xxs.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_xxs.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:1796:31: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 1796 | for (int l = 0; l < sizeof(int); ++l) { | ~ ^ ~~~~~~~~~~~ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_xxs.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:230:46: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 230 | const int64_t * xs = (int64_t *) ((const int *) xs0 + (threadIdx.x % t.I) * stride + 2 * (threadIdx.x / t.I)); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:824:13: note: in instantiation of function template specialization 'ggml_cuda_mma::load_generic<16, 8, int>' requested here 824 | load_generic(A[n], x_qs + (i0 + n*tile_A::I)*MMQ_MMA_TILE_X_K_Q8_0 + k0, MMQ_MMA_TILE_X_K_Q8_0); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3011:54: note: in instantiation of function template specialization 'vec_dot_q8_0_q8_1_mma<8, 128, MMQ_Q8_1_DS_LAYOUT_D4>' requested here 3011 | static constexpr vec_dot_mmq_t vec_dot_mma = vec_dot_q8_0_q8_1_mma; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3300:9: note: in instantiation of function template specialization 'mul_mat_q_process_tile' requested here 3300 | mul_mat_q_process_tile | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3571:13: note: in instantiation of function template specialization 'mul_mat_q' requested here 3571 | mul_mat_q<<>> | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3660:13: note: in instantiation of function template specialization 'launch_mul_mat_q' requested here 3660 | launch_mul_mat_q(ctx, args, stream); | ^ 12 warnings generated when compiling for gfx900. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_xs.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_xs.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_xs.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx906. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_nl.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_nl.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_nl.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:1796:31: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 1796 | for (int l = 0; l < sizeof(int); ++l) { | ~ ^ ~~~~~~~~~~~ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_nl.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:230:46: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 230 | const int64_t * xs = (int64_t *) ((const int *) xs0 + (threadIdx.x % t.I) * stride + 2 * (threadIdx.x / t.I)); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:824:13: note: in instantiation of function template specialization 'ggml_cuda_mma::load_generic<16, 8, int>' requested here 824 | load_generic(A[n], x_qs + (i0 + n*tile_A::I)*MMQ_MMA_TILE_X_K_Q8_0 + k0, MMQ_MMA_TILE_X_K_Q8_0); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3035:54: note: in instantiation of function template specialization 'vec_dot_q8_0_q8_1_mma<8, 128, MMQ_Q8_1_DS_LAYOUT_D4>' requested here 3035 | static constexpr vec_dot_mmq_t vec_dot_mma = vec_dot_q8_0_q8_1_mma; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3300:9: note: in instantiation of function template specialization 'mul_mat_q_process_tile' requested here 3300 | mul_mat_q_process_tile | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3571:13: note: in instantiation of function template specialization 'mul_mat_q' requested here 3571 | mul_mat_q<<>> | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3660:13: note: in instantiation of function template specialization 'launch_mul_mat_q' requested here 3660 | launch_mul_mat_q(ctx, args, stream); | ^ 11 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_s.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_s.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:521:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 521 | acc[0] = __builtin_amdgcn_mfma_i32_16x16x32_i8(((int64_t *) A.x)[0], | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:522:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 522 | ((int64_t *) B.x)[0], | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:549:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 549 | acc[0] = __builtin_amdgcn_mfma_i32_32x32x16_i8(((int64_t *) A.x)[0], | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:550:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 550 | ((int64_t *) B.x)[0], | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_s.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:1796:31: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 1796 | for (int l = 0; l < sizeof(int); ++l) { | ~ ^ ~~~~~~~~~~~ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_s.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:230:46: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 230 | const int64_t * xs = (int64_t *) ((const int *) xs0 + (threadIdx.x % t.I) * stride + 2 * (threadIdx.x / t.I)); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:824:13: note: in instantiation of function template specialization 'ggml_cuda_mma::load_generic<16, 8, int>' requested here 824 | load_generic(A[n], x_qs + (i0 + n*tile_A::I)*MMQ_MMA_TILE_X_K_Q8_0 + k0, MMQ_MMA_TILE_X_K_Q8_0); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3019:54: note: in instantiation of function template specialization 'vec_dot_q8_0_q8_1_mma<8, 128, MMQ_Q8_1_DS_LAYOUT_D4>' requested here 3019 | static constexpr vec_dot_mmq_t vec_dot_mma = vec_dot_q8_0_q8_1_mma; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3300:9: note: in instantiation of function template specialization 'mul_mat_q_process_tile' requested here 3300 | mul_mat_q_process_tile | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3571:13: note: in instantiation of function template specialization 'mul_mat_q' requested here 3571 | mul_mat_q<<>> | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3660:13: note: in instantiation of function template specialization 'launch_mul_mat_q' requested here 3660 | launch_mul_mat_q(ctx, args, stream); | ^ 11 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_xxs.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_xxs.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_xxs.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:1796:31: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 1796 | for (int l = 0; l < sizeof(int); ++l) { | ~ ^ ~~~~~~~~~~~ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_xxs.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:230:46: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 230 | const int64_t * xs = (int64_t *) ((const int *) xs0 + (threadIdx.x % t.I) * stride + 2 * (threadIdx.x / t.I)); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:824:13: note: in instantiation of function template specialization 'ggml_cuda_mma::load_generic<16, 8, int>' requested here 824 | load_generic(A[n], x_qs + (i0 + n*tile_A::I)*MMQ_MMA_TILE_X_K_Q8_0 + k0, MMQ_MMA_TILE_X_K_Q8_0); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3011:54: note: in instantiation of function template specialization 'vec_dot_q8_0_q8_1_mma<8, 128, MMQ_Q8_1_DS_LAYOUT_D4>' requested here 3011 | static constexpr vec_dot_mmq_t vec_dot_mma = vec_dot_q8_0_q8_1_mma; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3300:9: note: in instantiation of function template specialization 'mul_mat_q_process_tile' requested here 3300 | mul_mat_q_process_tile | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3571:13: note: in instantiation of function template specialization 'mul_mat_q' requested here 3571 | mul_mat_q<<>> | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3660:13: note: in instantiation of function template specialization 'launch_mul_mat_q' requested here 3660 | launch_mul_mat_q(ctx, args, stream); | ^ 12 warnings generated when compiling for gfx906. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_xs.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_xs.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_xs.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:1796:31: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 1796 | for (int l = 0; l < sizeof(int); ++l) { | ~ ^ ~~~~~~~~~~~ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_xs.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:230:46: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 230 | const int64_t * xs = (int64_t *) ((const int *) xs0 + (threadIdx.x % t.I) * stride + 2 * (threadIdx.x / t.I)); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:824:13: note: in instantiation of function template specialization 'ggml_cuda_mma::load_generic<16, 8, int>' requested here 824 | load_generic(A[n], x_qs + (i0 + n*tile_A::I)*MMQ_MMA_TILE_X_K_Q8_0 + k0, MMQ_MMA_TILE_X_K_Q8_0); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3043:54: note: in instantiation of function template specialization 'vec_dot_q8_0_q8_1_mma<8, 128, MMQ_Q8_1_DS_LAYOUT_D4>' requested here 3043 | static constexpr vec_dot_mmq_t vec_dot_mma = vec_dot_q8_0_q8_1_mma; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3300:9: note: in instantiation of function template specialization 'mul_mat_q_process_tile' requested here 3300 | mul_mat_q_process_tile | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3571:13: note: in instantiation of function template specialization 'mul_mat_q' requested here 3571 | mul_mat_q<<>> | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3660:13: note: in instantiation of function template specialization 'launch_mul_mat_q' requested here 3660 | launch_mul_mat_q(ctx, args, stream); | ^ 11 warnings generated when compiling for gfx908. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_nl.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_nl.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_nl.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:1796:31: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 1796 | for (int l = 0; l < sizeof(int); ++l) { | ~ ^ ~~~~~~~~~~~ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_nl.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:230:46: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 230 | const int64_t * xs = (int64_t *) ((const int *) xs0 + (threadIdx.x % t.I) * stride + 2 * (threadIdx.x / t.I)); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:824:13: note: in instantiation of function template specialization 'ggml_cuda_mma::load_generic<16, 8, int>' requested here 824 | load_generic(A[n], x_qs + (i0 + n*tile_A::I)*MMQ_MMA_TILE_X_K_Q8_0 + k0, MMQ_MMA_TILE_X_K_Q8_0); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3035:54: note: in instantiation of function template specialization 'vec_dot_q8_0_q8_1_mma<8, 128, MMQ_Q8_1_DS_LAYOUT_D4>' requested here 3035 | static constexpr vec_dot_mmq_t vec_dot_mma = vec_dot_q8_0_q8_1_mma; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3300:9: note: in instantiation of function template specialization 'mul_mat_q_process_tile' requested here 3300 | mul_mat_q_process_tile | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3571:13: note: in instantiation of function template specialization 'mul_mat_q' requested here 3571 | mul_mat_q<<>> | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3660:13: note: in instantiation of function template specialization 'launch_mul_mat_q' requested here 3660 | launch_mul_mat_q(ctx, args, stream); | ^ 15 warnings generated when compiling for gfx942. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_s.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_s.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_s.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 11 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_xxs.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_xxs.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:521:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 521 | acc[0] = __builtin_amdgcn_mfma_i32_16x16x32_i8(((int64_t *) A.x)[0], | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:522:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 522 | ((int64_t *) B.x)[0], | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:549:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 549 | acc[0] = __builtin_amdgcn_mfma_i32_32x32x16_i8(((int64_t *) A.x)[0], | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:550:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 550 | ((int64_t *) B.x)[0], | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_xxs.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:1796:31: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 1796 | for (int l = 0; l < sizeof(int); ++l) { | ~ ^ ~~~~~~~~~~~ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_xxs.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:230:46: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 230 | const int64_t * xs = (int64_t *) ((const int *) xs0 + (threadIdx.x % t.I) * stride + 2 * (threadIdx.x / t.I)); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:824:13: note: in instantiation of function template specialization 'ggml_cuda_mma::load_generic<16, 8, int>' requested here 824 | load_generic(A[n], x_qs + (i0 + n*tile_A::I)*MMQ_MMA_TILE_X_K_Q8_0 + k0, MMQ_MMA_TILE_X_K_Q8_0); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3011:54: note: in instantiation of function template specialization 'vec_dot_q8_0_q8_1_mma<8, 128, MMQ_Q8_1_DS_LAYOUT_D4>' requested here 3011 | static constexpr vec_dot_mmq_t vec_dot_mma = vec_dot_q8_0_q8_1_mma; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3300:9: note: in instantiation of function template specialization 'mul_mat_q_process_tile' requested here 3300 | mul_mat_q_process_tile | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3571:13: note: in instantiation of function template specialization 'mul_mat_q' requested here 3571 | mul_mat_q<<>> | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3660:13: note: in instantiation of function template specialization 'launch_mul_mat_q' requested here 3660 | launch_mul_mat_q(ctx, args, stream); | ^ 11 warnings generated when compiling for gfx908. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_xs.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_xs.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_xs.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:1796:31: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 1796 | for (int l = 0; l < sizeof(int); ++l) { | ~ ^ ~~~~~~~~~~~ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_xs.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:230:46: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 230 | const int64_t * xs = (int64_t *) ((const int *) xs0 + (threadIdx.x % t.I) * stride + 2 * (threadIdx.x / t.I)); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:824:13: note: in instantiation of function template specialization 'ggml_cuda_mma::load_generic<16, 8, int>' requested here 824 | load_generic(A[n], x_qs + (i0 + n*tile_A::I)*MMQ_MMA_TILE_X_K_Q8_0 + k0, MMQ_MMA_TILE_X_K_Q8_0); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3043:54: note: in instantiation of function template specialization 'vec_dot_q8_0_q8_1_mma<8, 128, MMQ_Q8_1_DS_LAYOUT_D4>' requested here 3043 | static constexpr vec_dot_mmq_t vec_dot_mma = vec_dot_q8_0_q8_1_mma; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3300:9: note: in instantiation of function template specialization 'mul_mat_q_process_tile' requested here 3300 | mul_mat_q_process_tile | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3571:13: note: in instantiation of function template specialization 'mul_mat_q' requested here 3571 | mul_mat_q<<>> | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3660:13: note: in instantiation of function template specialization 'launch_mul_mat_q' requested here 3660 | launch_mul_mat_q(ctx, args, stream); | ^ 11 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_nl.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_nl.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_nl.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:1796:31: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 1796 | for (int l = 0; l < sizeof(int); ++l) { | ~ ^ ~~~~~~~~~~~ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_nl.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:230:46: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 230 | const int64_t * xs = (int64_t *) ((const int *) xs0 + (threadIdx.x % t.I) * stride + 2 * (threadIdx.x / t.I)); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:824:13: note: in instantiation of function template specialization 'ggml_cuda_mma::load_generic<16, 8, int>' requested here 824 | load_generic(A[n], x_qs + (i0 + n*tile_A::I)*MMQ_MMA_TILE_X_K_Q8_0 + k0, MMQ_MMA_TILE_X_K_Q8_0); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3035:54: note: in instantiation of function template specialization 'vec_dot_q8_0_q8_1_mma<8, 128, MMQ_Q8_1_DS_LAYOUT_D4>' requested here 3035 | static constexpr vec_dot_mmq_t vec_dot_mma = vec_dot_q8_0_q8_1_mma; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3300:9: note: in instantiation of function template specialization 'mul_mat_q_process_tile' requested here 3300 | mul_mat_q_process_tile | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3571:13: note: in instantiation of function template specialization 'mul_mat_q' requested here 3571 | mul_mat_q<<>> | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3660:13: note: in instantiation of function template specialization 'launch_mul_mat_q' requested here 3660 | launch_mul_mat_q(ctx, args, stream); | ^ 15 warnings generated when compiling for gfx942. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_xxs.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_xxs.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_xxs.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 11 warnings generated when compiling for gfx90a. 12 warnings generated when compiling for gfx950. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_xs.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_xs.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_xs.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:1796:31: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 1796 | for (int l = 0; l < sizeof(int); ++l) { | ~ ^ ~~~~~~~~~~~ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_s.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_s.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_xs.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:230:46: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 230 | const int64_t * xs = (int64_t *) ((const int *) xs0 + (threadIdx.x % t.I) * stride + 2 * (threadIdx.x / t.I)); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:824:13: note: in instantiation of function template specialization 'ggml_cuda_mma::load_generic<16, 8, int>' requested here 824 | load_generic(A[n], x_qs + (i0 + n*tile_A::I)*MMQ_MMA_TILE_X_K_Q8_0 + k0, MMQ_MMA_TILE_X_K_Q8_0); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3043:54: note: in instantiation of function template specialization 'vec_dot_q8_0_q8_1_mma<8, 128, MMQ_Q8_1_DS_LAYOUT_D4>' requested here In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_s.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 3043 | static constexpr vec_dot_mmq_t vec_dot_mma = vec_dot_q8_0_q8_1_mma; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3300:9: note: in instantiation of function template specialization 'mul_mat_q_process_tile' requested here 3300 | mul_mat_q_process_tile | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3571:13: note: in instantiation of function template specialization 'mul_mat_q' requested here 3571 | mul_mat_q<<>> | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3660:13: note: in instantiation of function template specialization 'launch_mul_mat_q' requested here 3660 | launch_mul_mat_q(ctx, args, stream); | ^ 12 warnings generated when compiling for host. [ 52%] Building CXX object ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/mmq-instance-mxfp4.cu.o cd /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build/ggml/src/ggml-hip && /usr/bin/hipcc -DGGML_BACKEND_BUILD -DGGML_BACKEND_SHARED -DGGML_HIP_NO_VMM -DGGML_HIP_ROCWMMA_FATTN_GFX12 -DGGML_SCHED_MAX_COPIES=4 -DGGML_SHARED -DGGML_USE_HIP -DUSE_PROF_API=1 -D_GNU_SOURCE -D_XOPEN_SOURCE=600 -D__HIP_PLATFORM_AMD__=1 -Dggml_hip_EXPORTS -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/.. -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -DNDEBUG -std=gnu++17 -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -x hip --offload-arch=gfx900 --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx942 --offload-arch=gfx950 --offload-arch=gfx1010 --offload-arch=gfx1012 --offload-arch=gfx1030 --offload-arch=gfx1031 --offload-arch=gfx1035 --offload-arch=gfx1036 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1103 --offload-arch=gfx1150 --offload-arch=gfx1151 --offload-arch=gfx1152 --offload-arch=gfx1153 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/mmq-instance-mxfp4.cu.o -MF CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/mmq-instance-mxfp4.cu.o.d -o CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/mmq-instance-mxfp4.cu.o -c /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-mxfp4.cu In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-mxfp4.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-mxfp4.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-mxfp4.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1010. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-mxfp4.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-mxfp4.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-mxfp4.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 11 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_nl.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_nl.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:521:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 521 | acc[0] = __builtin_amdgcn_mfma_i32_16x16x32_i8(((int64_t *) A.x)[0], | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:522:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 522 | ((int64_t *) B.x)[0], | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:549:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 549 | acc[0] = __builtin_amdgcn_mfma_i32_32x32x16_i8(((int64_t *) A.x)[0], | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:550:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 550 | ((int64_t *) B.x)[0], | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_nl.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:1796:31: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 1796 | for (int l = 0; l < sizeof(int); ++l) { | ~ ^ ~~~~~~~~~~~ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_nl.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:230:46: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 230 | const int64_t * xs = (int64_t *) ((const int *) xs0 + (threadIdx.x % t.I) * stride + 2 * (threadIdx.x / t.I)); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:824:13: note: in instantiation of function template specialization 'ggml_cuda_mma::load_generic<16, 8, int>' requested here 824 | load_generic(A[n], x_qs + (i0 + n*tile_A::I)*MMQ_MMA_TILE_X_K_Q8_0 + k0, MMQ_MMA_TILE_X_K_Q8_0); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3035:54: note: in instantiation of function template specialization 'vec_dot_q8_0_q8_1_mma<8, 128, MMQ_Q8_1_DS_LAYOUT_D4>' requested here 3035 | static constexpr vec_dot_mmq_t vec_dot_mma = vec_dot_q8_0_q8_1_mma; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3300:9: note: in instantiation of function template specialization 'mul_mat_q_process_tile' requested here 3300 | mul_mat_q_process_tile | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3571:13: note: in instantiation of function template specialization 'mul_mat_q' requested here 3571 | mul_mat_q<<>> | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3660:13: note: in instantiation of function template specialization 'launch_mul_mat_q' requested here 3660 | launch_mul_mat_q(ctx, args, stream); | ^ 12 warnings generated when compiling for gfx950. 11 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_xxs.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_xxs.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_xxs.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_xs.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_xs.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:521:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 521 | acc[0] = __builtin_amdgcn_mfma_i32_16x16x32_i8(((int64_t *) A.x)[0], | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:522:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 522 | ((int64_t *) B.x)[0], | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:549:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 549 | acc[0] = __builtin_amdgcn_mfma_i32_32x32x16_i8(((int64_t *) A.x)[0], | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:550:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 550 | ((int64_t *) B.x)[0], | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_xs.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:1796:31: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 1796 | for (int l = 0; l < sizeof(int); ++l) { | ~ ^ ~~~~~~~~~~~ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_xs.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:230:46: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 230 | const int64_t * xs = (int64_t *) ((const int *) xs0 + (threadIdx.x % t.I) * stride + 2 * (threadIdx.x / t.I)); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:824:13: note: in instantiation of function template specialization 'ggml_cuda_mma::load_generic<16, 8, int>' requested here 824 | load_generic(A[n], x_qs + (i0 + n*tile_A::I)*MMQ_MMA_TILE_X_K_Q8_0 + k0, MMQ_MMA_TILE_X_K_Q8_0); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3043:54: note: in instantiation of function template specialization 'vec_dot_q8_0_q8_1_mma<8, 128, MMQ_Q8_1_DS_LAYOUT_D4>' requested here 3043 | static constexpr vec_dot_mmq_t vec_dot_mma = vec_dot_q8_0_q8_1_mma; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3300:9: note: in instantiation of function template specialization 'mul_mat_q_process_tile' requested here 3300 | mul_mat_q_process_tile | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3571:13: note: in instantiation of function template specialization 'mul_mat_q' requested here 3571 | mul_mat_q<<>> | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3660:13: note: in instantiation of function template specialization 'launch_mul_mat_q' requested here 3660 | launch_mul_mat_q(ctx, args, stream); | ^ 12 warnings generated when compiling for host. [ 52%] Building CXX object ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/mmq-instance-q2_k.cu.o cd /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build/ggml/src/ggml-hip && /usr/bin/hipcc -DGGML_BACKEND_BUILD -DGGML_BACKEND_SHARED -DGGML_HIP_NO_VMM -DGGML_HIP_ROCWMMA_FATTN_GFX12 -DGGML_SCHED_MAX_COPIES=4 -DGGML_SHARED -DGGML_USE_HIP -DUSE_PROF_API=1 -D_GNU_SOURCE -D_XOPEN_SOURCE=600 -D__HIP_PLATFORM_AMD__=1 -Dggml_hip_EXPORTS -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/.. -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -DNDEBUG -std=gnu++17 -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -x hip --offload-arch=gfx900 --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx942 --offload-arch=gfx950 --offload-arch=gfx1010 --offload-arch=gfx1012 --offload-arch=gfx1030 --offload-arch=gfx1031 --offload-arch=gfx1035 --offload-arch=gfx1036 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1103 --offload-arch=gfx1150 --offload-arch=gfx1151 --offload-arch=gfx1152 --offload-arch=gfx1153 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/mmq-instance-q2_k.cu.o -MF CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/mmq-instance-q2_k.cu.o.d -o CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/mmq-instance-q2_k.cu.o -c /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q2_k.cu In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q2_k.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q2_k.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q2_k.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1012. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-mxfp4.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-mxfp4.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-mxfp4.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 15 warnings generated when compiling for gfx942. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_nl.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_nl.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_nl.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1030. 15 warnings generated when compiling for gfx942. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-mxfp4.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-mxfp4.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-mxfp4.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_xs.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_xs.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_xs.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1010. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q2_k.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q2_k.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q2_k.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1031. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-mxfp4.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-mxfp4.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-mxfp4.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1012. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q2_k.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q2_k.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q2_k.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx950. 12 warnings generated when compiling for gfx950. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_nl.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_nl.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_nl.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_xs.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_xs.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_xs.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1035. 12 warnings generated when compiling for host. [ 53%] Building CXX object ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/mmq-instance-q3_k.cu.o cd /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build/ggml/src/ggml-hip && /usr/bin/hipcc -DGGML_BACKEND_BUILD -DGGML_BACKEND_SHARED -DGGML_HIP_NO_VMM -DGGML_HIP_ROCWMMA_FATTN_GFX12 -DGGML_SCHED_MAX_COPIES=4 -DGGML_SHARED -DGGML_USE_HIP -DUSE_PROF_API=1 -D_GNU_SOURCE -D_XOPEN_SOURCE=600 -D__HIP_PLATFORM_AMD__=1 -Dggml_hip_EXPORTS -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/.. -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -DNDEBUG -std=gnu++17 -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -x hip --offload-arch=gfx900 --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx942 --offload-arch=gfx950 --offload-arch=gfx1010 --offload-arch=gfx1012 --offload-arch=gfx1030 --offload-arch=gfx1031 --offload-arch=gfx1035 --offload-arch=gfx1036 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1103 --offload-arch=gfx1150 --offload-arch=gfx1151 --offload-arch=gfx1152 --offload-arch=gfx1153 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/mmq-instance-q3_k.cu.o -MF CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/mmq-instance-q3_k.cu.o.d -o CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/mmq-instance-q3_k.cu.o -c /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q3_k.cu 12 warnings generated when compiling for host. [ 53%] Building CXX object ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/mmq-instance-q4_0.cu.o cd /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build/ggml/src/ggml-hip && /usr/bin/hipcc -DGGML_BACKEND_BUILD -DGGML_BACKEND_SHARED -DGGML_HIP_NO_VMM -DGGML_HIP_ROCWMMA_FATTN_GFX12 -DGGML_SCHED_MAX_COPIES=4 -DGGML_SHARED -DGGML_USE_HIP -DUSE_PROF_API=1 -D_GNU_SOURCE -D_XOPEN_SOURCE=600 -D__HIP_PLATFORM_AMD__=1 -Dggml_hip_EXPORTS -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/.. -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -DNDEBUG -std=gnu++17 -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -x hip --offload-arch=gfx900 --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx942 --offload-arch=gfx950 --offload-arch=gfx1010 --offload-arch=gfx1012 --offload-arch=gfx1030 --offload-arch=gfx1031 --offload-arch=gfx1035 --offload-arch=gfx1036 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1103 --offload-arch=gfx1150 --offload-arch=gfx1151 --offload-arch=gfx1152 --offload-arch=gfx1153 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/mmq-instance-q4_0.cu.o -MF CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/mmq-instance-q4_0.cu.o.d -o CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/mmq-instance-q4_0.cu.o -c /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_0.cu In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-mxfp4.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-mxfp4.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-mxfp4.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q3_k.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_0.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q3_k.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int>In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_0.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ & B/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ ) { /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ | /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q3_k.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_0.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1010. 12 warnings generated when compiling for gfx1010. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q3_k.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q3_k.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q3_k.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_0.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_0.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_0.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1036. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-mxfp4.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-mxfp4.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-mxfp4.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1012. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q3_k.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q3_k.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q3_k.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1012. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_0.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_0.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_0.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1030. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q2_k.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q2_k.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q2_k.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-mxfp4.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-mxfp4.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-mxfp4.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1030. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q3_k.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q3_k.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q3_k.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1030. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_0.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_0.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_0.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1101. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-mxfp4.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-mxfp4.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-mxfp4.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1031. 12 warnings generated when compiling for gfx1031. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q3_k.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q3_k.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q3_k.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_0.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_0.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_0.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1031. 12 warnings generated when compiling for gfx1102. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q2_k.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q2_k.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q2_k.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-mxfp4.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-mxfp4.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-mxfp4.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1035. 12 warnings generated when compiling for gfx1035. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_0.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q3_k.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_0.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q3_k.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_0.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q3_k.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1103. 12 warnings generated when compiling for gfx1036. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-mxfp4.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-mxfp4.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-mxfp4.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_0.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 12 warnings generated when compiling for gfx1036. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_0.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_0.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q3_k.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q3_k.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q3_k.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1150. 12 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-mxfp4.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-mxfp4.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-mxfp4.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_0.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_0.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_0.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1100. 12 warnings generated when compiling for gfx1035. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q3_k.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q3_k.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q3_k.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q2_k.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q2_k.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q2_k.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1151. 12 warnings generated when compiling for gfx1101. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-mxfp4.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-mxfp4.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-mxfp4.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_0.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_0.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_0.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1101. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q3_k.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q3_k.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q3_k.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1102. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_0.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_0.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_0.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1102. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q3_k.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q3_k.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q3_k.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1036. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q2_k.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q2_k.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q2_k.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1152. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-mxfp4.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-mxfp4.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-mxfp4.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1103. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_0.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_0.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_0.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1103. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q3_k.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q3_k.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q3_k.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1150. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_0.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_0.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_0.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1150. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q3_k.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q3_k.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q3_k.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1153. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-mxfp4.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-mxfp4.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-mxfp4.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1100. 12 warnings generated when compiling for gfx1151. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q2_k.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q2_k.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q2_k.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_0.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_0.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_0.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1151. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q3_k.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q3_k.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q3_k.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1200. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-mxfp4.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-mxfp4.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-mxfp4.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1201. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-mxfp4.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-mxfp4.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-mxfp4.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1152. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_0.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_0.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_0.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1152. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q3_k.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q3_k.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q3_k.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1101. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q2_k.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q2_k.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q2_k.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx900. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-mxfp4.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-mxfp4.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-mxfp4.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1153. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q3_k.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q3_k.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q3_k.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1153. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_0.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_0.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_0.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx906. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-mxfp4.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-mxfp4.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-mxfp4.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:1796:31: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 1796 | for (int l = 0; l < sizeof(int); ++l) { | ~ ^ ~~~~~~~~~~~ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-mxfp4.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:230:46: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 230 | const int64_t * xs = (int64_t *) ((const int *) xs0 + (threadIdx.x % t.I) * stride + 2 * (threadIdx.x / t.I)); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:824:13: note: in instantiation of function template specialization 'ggml_cuda_mma::load_generic<16, 8, int>' requested here 824 | load_generic(A[n], x_qs + (i0 + n*tile_A::I)*MMQ_MMA_TILE_X_K_Q8_0 + k0, MMQ_MMA_TILE_X_K_Q8_0); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2939:54: note: in instantiation of function template specialization 'vec_dot_q8_0_q8_1_mma<8, 128, MMQ_Q8_1_DS_LAYOUT_D4>' requested here 2939 | static constexpr vec_dot_mmq_t vec_dot_mma = vec_dot_q8_0_q8_1_mma; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3300:9: note: in instantiation of function template specialization 'mul_mat_q_process_tile' requested here 3300 | mul_mat_q_process_tile | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3571:13: note: in instantiation of function template specialization 'mul_mat_q' requested here 3571 | mul_mat_q<<>> | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3660:13: note: in instantiation of function template specialization 'launch_mul_mat_q' requested here 3660 | launch_mul_mat_q(ctx, args, stream); | ^ 12 warnings generated when compiling for gfx1200. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q3_k.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q3_k.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q3_k.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1200. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_0.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_0.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_0.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1102. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q2_k.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q2_k.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q2_k.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1201. 12 warnings generated when compiling for gfx1201. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_0.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_0.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_0.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q3_k.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q3_k.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q3_k.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 11 warnings generated when compiling for gfx908. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-mxfp4.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-mxfp4.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-mxfp4.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:1796:31: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 1796 | for (int l = 0; l < sizeof(int); ++l) { | ~ ^ ~~~~~~~~~~~ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-mxfp4.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:230:46: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 230 | const int64_t * xs = (int64_t *) ((const int *) xs0 + (threadIdx.x % t.I) * stride + 2 * (threadIdx.x / t.I)); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:824:13: note: in instantiation of function template specialization 'ggml_cuda_mma::load_generic<16, 8, int>' requested here 824 | load_generic(A[n], x_qs + (i0 + n*tile_A::I)*MMQ_MMA_TILE_X_K_Q8_0 + k0, MMQ_MMA_TILE_X_K_Q8_0); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2939:54: note: in instantiation of function template specialization 'vec_dot_q8_0_q8_1_mma<8, 128, MMQ_Q8_1_DS_LAYOUT_D4>' requested here 2939 | static constexpr vec_dot_mmq_t vec_dot_mma = vec_dot_q8_0_q8_1_mma; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3300:9: note: in instantiation of function template specialization 'mul_mat_q_process_tile' requested here 3300 | mul_mat_q_process_tile | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3571:13: note: in instantiation of function template specialization 'mul_mat_q' requested here 3571 | mul_mat_q<<>> | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3660:13: note: in instantiation of function template specialization 'launch_mul_mat_q' requested here 3660 | launch_mul_mat_q(ctx, args, stream); | ^ 12 warnings generated when compiling for gfx900. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q3_k.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q3_k.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q3_k.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx900. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_0.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_0.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_0.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1103. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q2_k.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q2_k.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q2_k.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 11 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-mxfp4.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-mxfp4.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-mxfp4.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:1796:31: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 1796 | for (int l = 0; l < sizeof(int); ++l) { | ~ ^ ~~~~~~~~~~~ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-mxfp4.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:230:46: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 230 | const int64_t * xs = (int64_t *) ((const int *) xs0 + (threadIdx.x % t.I) * stride + 2 * (threadIdx.x / t.I)); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:824:13: note: in instantiation of function template specialization 'ggml_cuda_mma::load_generic<16, 8, int>' requested here 824 | load_generic(A[n], x_qs + (i0 + n*tile_A::I)*MMQ_MMA_TILE_X_K_Q8_0 + k0, MMQ_MMA_TILE_X_K_Q8_0); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2939:54: note: in instantiation of function template specialization 'vec_dot_q8_0_q8_1_mma<8, 128, MMQ_Q8_1_DS_LAYOUT_D4>' requested here 2939 | static constexpr vec_dot_mmq_t vec_dot_mma = vec_dot_q8_0_q8_1_mma; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3300:9: note: in instantiation of function template specialization 'mul_mat_q_process_tile' requested here 3300 | mul_mat_q_process_tile | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3571:13: note: in instantiation of function template specialization 'mul_mat_q' requested here 3571 | mul_mat_q<<>> | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3660:13: note: in instantiation of function template specialization 'launch_mul_mat_q' requested here 3660 | launch_mul_mat_q(ctx, args, stream); | ^ 12 warnings generated when compiling for gfx906. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q3_k.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q3_k.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q3_k.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:1796:31: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 1796 | for (int l = 0; l < sizeof(int); ++l) { | ~ ^ ~~~~~~~~~~~ 12 warnings generated when compiling for gfx906. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_0.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_0.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_0.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:1796:31: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 1796 | for (int l = 0; l < sizeof(int); ++l) { | ~ ^ ~~~~~~~~~~~ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_0.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:230:46: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 230 | const int64_t * xs = (int64_t *) ((const int *) xs0 + (threadIdx.x % t.I) * stride + 2 * (threadIdx.x / t.I)); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:824:13: note: in instantiation of function template specialization 'ggml_cuda_mma::load_generic<16, 8, int>' requested here 824 | load_generic(A[n], x_qs + (i0 + n*tile_A::I)*MMQ_MMA_TILE_X_K_Q8_0 + k0, MMQ_MMA_TILE_X_K_Q8_0); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2899:54: note: in instantiation of function template specialization 'vec_dot_q8_0_q8_1_mma<8, 128, MMQ_Q8_1_DS_LAYOUT_DS4>' requested here 2899 | static constexpr vec_dot_mmq_t vec_dot_mma = vec_dot_q8_0_q8_1_mma; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3300:9: note: in instantiation of function template specialization 'mul_mat_q_process_tile' requested here 3300 | mul_mat_q_process_tile | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3571:13: note: in instantiation of function template specialization 'mul_mat_q' requested here 3571 | mul_mat_q<<>> | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3660:13: note: in instantiation of function template specialization 'launch_mul_mat_q' requested here 3660 | launch_mul_mat_q(ctx, args, stream); | ^ 10 warnings generated when compiling for gfx908. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q3_k.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q3_k.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q3_k.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:1796:31: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 1796 | for (int l = 0; l < sizeof(int); ++l) { | ~ ^ ~~~~~~~~~~~ 11 warnings generated when compiling for gfx90a. 11 warnings generated when compiling for gfx908. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-mxfp4.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-mxfp4.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:521:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 521 | acc[0] = __builtin_amdgcn_mfma_i32_16x16x32_i8(((int64_t *) A.x)[0], | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:522:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 522 | ((int64_t *) B.x)[0], | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:549:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 549 | acc[0] = __builtin_amdgcn_mfma_i32_32x32x16_i8(((int64_t *) A.x)[0], | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:550:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 550 | ((int64_t *) B.x)[0], | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-mxfp4.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:1796:31: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 1796 | for (int l = 0; l < sizeof(int); ++l) { | ~ ^ ~~~~~~~~~~~ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-mxfp4.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:230:46: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 230 | const int64_t * xs = (int64_t *) ((const int *) xs0 + (threadIdx.x % t.I) * stride + 2 * (threadIdx.x / t.I)); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:824:13: note: in instantiation of function template specialization 'ggml_cuda_mma::load_generic<16, 8, int>' requested here 824 | load_generic(A[n], x_qs + (i0 + n*tile_A::I)*MMQ_MMA_TILE_X_K_Q8_0 + k0, MMQ_MMA_TILE_X_K_Q8_0); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2939:54: note: in instantiation of function template specialization 'vec_dot_q8_0_q8_1_mma<8, 128, MMQ_Q8_1_DS_LAYOUT_D4>' requested here 2939 | static constexpr vec_dot_mmq_t vec_dot_mma = vec_dot_q8_0_q8_1_mma; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3300:9: note: in instantiation of function template specialization 'mul_mat_q_process_tile' requested here 3300 | mul_mat_q_process_tile | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3571:13: note: in instantiation of function template specialization 'mul_mat_q' requested here 3571 | mul_mat_q<<>> | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3660:13: note: in instantiation of function template specialization 'launch_mul_mat_q' requested here 3660 | launch_mul_mat_q(ctx, args, stream); | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_0.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_0.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_0.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:1796:31: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 1796 | for (int l = 0; l < sizeof(int); ++l) { | ~ ^ ~~~~~~~~~~~ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_0.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:230:46: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 230 | const int64_t * xs = (int64_t *) ((const int *) xs0 + (threadIdx.x % t.I) * stride + 2 * (threadIdx.x / t.I)); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:824:13: note: in instantiation of function template specialization 'ggml_cuda_mma::load_generic<16, 8, int>' requested here 824 | load_generic(A[n], x_qs + (i0 + n*tile_A::I)*MMQ_MMA_TILE_X_K_Q8_0 + k0, MMQ_MMA_TILE_X_K_Q8_0); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2899:54: note: in instantiation of function template specialization 'vec_dot_q8_0_q8_1_mma<8, 128, MMQ_Q8_1_DS_LAYOUT_DS4>' requested here 2899 | static constexpr vec_dot_mmq_t vec_dot_mma = vec_dot_q8_0_q8_1_mma; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3300:9: note: in instantiation of function template specialization 'mul_mat_q_process_tile' requested here 3300 | mul_mat_q_process_tile | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3571:13: note: in instantiation of function template specialization 'mul_mat_q' requested here 3571 | mul_mat_q<<>> | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3660:13: note: in instantiation of function template specialization 'launch_mul_mat_q' requested here 3660 | launch_mul_mat_q(ctx, args, stream); | ^ 12 warnings generated when compiling for gfx1150. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q2_k.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q2_k.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q2_k.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 10 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q3_k.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q3_k.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q3_k.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:1796:31: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 1796 | for (int l = 0; l < sizeof(int); ++l) { | ~ ^ ~~~~~~~~~~~ 11 warnings generated when compiling for gfx90a. 15 warnings generated when compiling for gfx942. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_0.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_0.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_0.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:1796:31: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 1796 | for (int l = 0; l < sizeof(int); ++l) { | ~ ^ ~~~~~~~~~~~ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_0.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:230:46: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 230 | const int64_t * xs = (int64_t *) ((const int *) xs0 + (threadIdx.x % t.I) * stride + 2 * (threadIdx.x / t.I)); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:824:13: note: in instantiation of function template specialization 'ggml_cuda_mma::load_generic<16, 8, int>' requested here 824 | load_generic(A[n], x_qs + (i0 + n*tile_A::I)*MMQ_MMA_TILE_X_K_Q8_0 + k0, MMQ_MMA_TILE_X_K_Q8_0); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2899:54: note: in instantiation of function template specialization 'vec_dot_q8_0_q8_1_mma<8, 128, MMQ_Q8_1_DS_LAYOUT_DS4>' requested here 2899 | static constexpr vec_dot_mmq_t vec_dot_mma = vec_dot_q8_0_q8_1_mma; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3300:9: note: in instantiation of function template specialization 'mul_mat_q_process_tile' requested here 3300 | mul_mat_q_process_tile | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3571:13: note: in instantiation of function template specialization 'mul_mat_q' requested here 3571 | mul_mat_q<<>> | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3660:13: note: in instantiation of function template specialization 'launch_mul_mat_q' requested here 3660 | launch_mul_mat_q(ctx, args, stream); | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-mxfp4.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-mxfp4.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-mxfp4.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 10 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q3_k.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q3_k.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:521:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 521 | acc[0] = __builtin_amdgcn_mfma_i32_16x16x32_i8(((int64_t *) A.x)[0], | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:522:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 522 | ((int64_t *) B.x)[0], | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:549:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 549 | acc[0] = __builtin_amdgcn_mfma_i32_32x32x16_i8(((int64_t *) A.x)[0], | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:550:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 550 | ((int64_t *) B.x)[0], | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q3_k.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:1796:31: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 1796 | for (int l = 0; l < sizeof(int); ++l) { | ~ ^ ~~~~~~~~~~~ 12 warnings generated when compiling for gfx1151. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q2_k.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q2_k.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q2_k.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 11 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_0.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_0.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:521:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 521 | acc[0] = __builtin_amdgcn_mfma_i32_16x16x32_i8(((int64_t *) A.x)[0], | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:522:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 522 | ((int64_t *) B.x)[0], | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:549:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 549 | acc[0] = __builtin_amdgcn_mfma_i32_32x32x16_i8(((int64_t *) A.x)[0], | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:550:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 550 | ((int64_t *) B.x)[0], | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_0.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:1796:31: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 1796 | for (int l = 0; l < sizeof(int); ++l) { | ~ ^ ~~~~~~~~~~~ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_0.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:230:46: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 230 | const int64_t * xs = (int64_t *) ((const int *) xs0 + (threadIdx.x % t.I) * stride + 2 * (threadIdx.x / t.I)); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:824:13: note: in instantiation of function template specialization 'ggml_cuda_mma::load_generic<16, 8, int>' requested here 824 | load_generic(A[n], x_qs + (i0 + n*tile_A::I)*MMQ_MMA_TILE_X_K_Q8_0 + k0, MMQ_MMA_TILE_X_K_Q8_0); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2899:54: note: in instantiation of function template specialization 'vec_dot_q8_0_q8_1_mma<8, 128, MMQ_Q8_1_DS_LAYOUT_DS4>' requested here 2899 | static constexpr vec_dot_mmq_t vec_dot_mma = vec_dot_q8_0_q8_1_mma; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3300:9: note: in instantiation of function template specialization 'mul_mat_q_process_tile' requested here 3300 | mul_mat_q_process_tile | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3571:13: note: in instantiation of function template specialization 'mul_mat_q' requested here 3571 | mul_mat_q<<>> | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3660:13: note: in instantiation of function template specialization 'launch_mul_mat_q' requested here 3660 | launch_mul_mat_q(ctx, args, stream); | ^ 14 warnings generated when compiling for gfx942. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q3_k.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q3_k.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q3_k.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx950. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-mxfp4.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-mxfp4.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-mxfp4.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for host. [ 54%] Building CXX object ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/mmq-instance-q4_1.cu.o cd /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build/ggml/src/ggml-hip && /usr/bin/hipcc -DGGML_BACKEND_BUILD -DGGML_BACKEND_SHARED -DGGML_HIP_NO_VMM -DGGML_HIP_ROCWMMA_FATTN_GFX12 -DGGML_SCHED_MAX_COPIES=4 -DGGML_SHARED -DGGML_USE_HIP -DUSE_PROF_API=1 -D_GNU_SOURCE -D_XOPEN_SOURCE=600 -D__HIP_PLATFORM_AMD__=1 -Dggml_hip_EXPORTS -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/.. -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -DNDEBUG -std=gnu++17 -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -x hip --offload-arch=gfx900 --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx942 --offload-arch=gfx950 --offload-arch=gfx1010 --offload-arch=gfx1012 --offload-arch=gfx1030 --offload-arch=gfx1031 --offload-arch=gfx1035 --offload-arch=gfx1036 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1103 --offload-arch=gfx1150 --offload-arch=gfx1151 --offload-arch=gfx1152 --offload-arch=gfx1153 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/mmq-instance-q4_1.cu.o -MF CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/mmq-instance-q4_1.cu.o.d -o CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/mmq-instance-q4_1.cu.o -c /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_1.cu In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_1.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1010. 15 warnings generated when compiling for gfx942. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_1.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_0.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_0.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_0.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx950. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q3_k.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q3_k.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q3_k.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for host. [ 54%] Building CXX object ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/mmq-instance-q4_k.cu.o cd /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build/ggml/src/ggml-hip && /usr/bin/hipcc -DGGML_BACKEND_BUILD -DGGML_BACKEND_SHARED -DGGML_HIP_NO_VMM -DGGML_HIP_ROCWMMA_FATTN_GFX12 -DGGML_SCHED_MAX_COPIES=4 -DGGML_SHARED -DGGML_USE_HIP -DUSE_PROF_API=1 -D_GNU_SOURCE -D_XOPEN_SOURCE=600 -D__HIP_PLATFORM_AMD__=1 -Dggml_hip_EXPORTS -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/.. -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -DNDEBUG -std=gnu++17 -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -x hip --offload-arch=gfx900 --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx942 --offload-arch=gfx950 --offload-arch=gfx1010 --offload-arch=gfx1012 --offload-arch=gfx1030 --offload-arch=gfx1031 --offload-arch=gfx1035 --offload-arch=gfx1036 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1103 --offload-arch=gfx1150 --offload-arch=gfx1151 --offload-arch=gfx1152 --offload-arch=gfx1153 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/mmq-instance-q4_k.cu.o -MF CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/mmq-instance-q4_k.cu.o.d -o CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/mmq-instance-q4_k.cu.o -c /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_k.cu In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_k.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_k.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<112 warnings generated when compiling for gfx1012. 6, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_k.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_1.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1010. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_k.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_k.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_k.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1030. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_1.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1012. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_k.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_k.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_k.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx950. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_0.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_0.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_0.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for host. [ 55%] Building CXX object ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/mmq-instance-q5_0.cu.o cd /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build/ggml/src/ggml-hip && /usr/bin/hipcc -DGGML_BACKEND_BUILD -DGGML_BACKEND_SHARED -DGGML_HIP_NO_VMM -DGGML_HIP_ROCWMMA_FATTN_GFX12 -DGGML_SCHED_MAX_COPIES=4 -DGGML_SHARED -DGGML_USE_HIP -DUSE_PROF_API=1 -D_GNU_SOURCE -D_XOPEN_SOURCE=600 -D__HIP_PLATFORM_AMD__=1 -Dggml_hip_EXPORTS -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/.. -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -DNDEBUG -std=gnu++17 -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -x hip --offload-arch=gfx900 --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx942 --offload-arch=gfx950 --offload-arch=gfx1010 --offload-arch=gfx1012 --offload-arch=gfx1030 --offload-arch=gfx1031 --offload-arch=gfx1035 --offload-arch=gfx1036 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1103 --offload-arch=gfx1150 --offload-arch=gfx1151 --offload-arch=gfx1152 --offload-arch=gfx1153 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/mmq-instance-q5_0.cu.o -MF CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/mmq-instance-q5_0.cu.o.d -o CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/mmq-instance-q5_0.cu.o -c /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_0.cu In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_0.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_0.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_0.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1031. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_1.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1030. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_k.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_k.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_k.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1010. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_0.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_0.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_0.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1152. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q2_k.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q2_k.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q2_k.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1035. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_1.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1012. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_0.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_0.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_0.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1031. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_k.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_k.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_k.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1036. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_1.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1030. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_0.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_0.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_0.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1035. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_k.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_k.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_k.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_1.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1031. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_0.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_0.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_0.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1036. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_k.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_k.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_k.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1101. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_1.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1035. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_0.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_0.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_0.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1102. 12 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_1.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_k.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_k.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_k.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1103. 12 warnings generated when compiling for gfx1036. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_1.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_0.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_0.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_0.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1101. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_k.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_k.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_k.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1153. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q2_k.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q2_k.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q2_k.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1150. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_1.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_0.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_0.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_0.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1102. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_k.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_k.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_k.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1151. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_1.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1101. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_0.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_0.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_0.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1103. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_k.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_k.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_k.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1200. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q2_k.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q2_k.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q2_k.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1102. 12 warnings generated when compiling for gfx1150. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_0.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_0.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_0.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_k.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_k.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_k.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1152. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_1.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1151. 12 warnings generated when compiling for gfx1103. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_k.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_k.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_k.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_0.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_0.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_0.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1201. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q2_k.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q2_k.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q2_k.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1150. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_0.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_0.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_0.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1153. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_1.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1200. 12 warnings generated when compiling for gfx1151. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_1.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_0.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_0.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_0.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1201. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_1.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1152. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_0.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_0.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_0.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx900. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_1.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx906. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_1.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:1796:31: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 1796 | for (int l = 0; l < sizeof(int); ++l) { | ~ ^ ~~~~~~~~~~~ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:230:46: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 230 | const int64_t * xs = (int64_t *) ((const int *) xs0 + (threadIdx.x % t.I) * stride + 2 * (threadIdx.x / t.I)); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:992:13: note: in instantiation of function template specialization 'ggml_cuda_mma::load_generic<16, 8, int>' requested here 992 | load_generic(A[n], x_qs + (i0 + n*tile_A::I)*MMQ_MMA_TILE_X_K_Q8_1 + k0, MMQ_MMA_TILE_X_K_Q8_1); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2907:54: note: in instantiation of function template specialization 'vec_dot_q8_1_q8_1_mma<8, 128>' requested here 2907 | static constexpr vec_dot_mmq_t vec_dot_mma = vec_dot_q8_1_q8_1_mma; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3300:9: note: in instantiation of function template specialization 'mul_mat_q_process_tile' requested here 3300 | mul_mat_q_process_tile | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3571:13: note: in instantiation of function template specialization 'mul_mat_q' requested here 3571 | mul_mat_q<<>> | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3660:13: note: in instantiation of function template specialization 'launch_mul_mat_q' requested here 3660 | launch_mul_mat_q(ctx, args, stream); | ^ 12 warnings generated when compiling for gfx1153. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_0.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_0.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_0.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1152. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_k.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_k.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_k.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 11 warnings generated when compiling for gfx908. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_1.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:1796:31: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 1796 | for (int l = 0; l < sizeof(int); ++l) { | ~ ^ ~~~~~~~~~~~ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:230:46: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 230 | const int64_t * xs = (int64_t *) ((const int *) xs0 + (threadIdx.x % t.I) * stride + 2 * (threadIdx.x / t.I)); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:992:13: note: in instantiation of function template specialization 'ggml_cuda_mma::load_generic<16, 8, int>' requested here 992 | load_generic(A[n], x_qs + (i0 + n*tile_A::I)*MMQ_MMA_TILE_X_K_Q8_1 + k0, MMQ_MMA_TILE_X_K_Q8_1); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2907:54: note: in instantiation of function template specialization 'vec_dot_q8_1_q8_1_mma<8, 128>' requested here 2907 | static constexpr vec_dot_mmq_t vec_dot_mma = vec_dot_q8_1_q8_1_mma; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3300:9: note: in instantiation of function template specialization 'mul_mat_q_process_tile' requested here 3300 | mul_mat_q_process_tile | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3571:13: note: in instantiation of function template specialization 'mul_mat_q' requested here 3571 | mul_mat_q<<>> | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3660:13: note: in instantiation of function template specialization 'launch_mul_mat_q' requested here 3660 | launch_mul_mat_q(ctx, args, stream); | ^ 12 warnings generated when compiling for gfx1200. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_0.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_0.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_0.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx900. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q2_k.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q2_k.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q2_k.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1201. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_0.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_0.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_0.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 11 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_1.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:1796:31: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 1796 | for (int l = 0; l < sizeof(int); ++l) { | ~ ^ ~~~~~~~~~~~ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:230:46: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 230 | const int64_t * xs = (int64_t *) ((const int *) xs0 + (threadIdx.x % t.I) * stride + 2 * (threadIdx.x / t.I)); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:992:13: note: in instantiation of function template specialization 'ggml_cuda_mma::load_generic<16, 8, int>' requested here 992 | load_generic(A[n], x_qs + (i0 + n*tile_A::I)*MMQ_MMA_TILE_X_K_Q8_1 + k0, MMQ_MMA_TILE_X_K_Q8_1); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2907:54: note: in instantiation of function template specialization 'vec_dot_q8_1_q8_1_mma<8, 128>' requested here 2907 | static constexpr vec_dot_mmq_t vec_dot_mma = vec_dot_q8_1_q8_1_mma; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3300:9: note: in instantiation of function template specialization 'mul_mat_q_process_tile' requested here 3300 | mul_mat_q_process_tile | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3571:13: note: in instantiation of function template specialization 'mul_mat_q' requested here 3571 | mul_mat_q<<>> | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3660:13: note: in instantiation of function template specialization 'launch_mul_mat_q' requested here 3660 | launch_mul_mat_q(ctx, args, stream); | ^ 12 warnings generated when compiling for gfx900. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_0.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_0.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_0.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 11 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:521:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 521 | acc[0] = __builtin_amdgcn_mfma_i32_16x16x32_i8(((int64_t *) A.x)[0], | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:522:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 522 | ((int64_t *) B.x)[0], | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:549:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 549 | acc[0] = __builtin_amdgcn_mfma_i32_32x32x16_i8(((int64_t *) A.x)[0], | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:550:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 550 | ((int64_t *) B.x)[0], | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_1.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:1796:31: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 1796 | for (int l = 0; l < sizeof(int); ++l) { | ~ ^ ~~~~~~~~~~~ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:230:46: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 230 | const int64_t * xs = (int64_t *) ((const int *) xs0 + (threadIdx.x % t.I) * stride + 2 * (threadIdx.x / t.I)); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:992:13: note: in instantiation of function template specialization 'ggml_cuda_mma::load_generic<16, 8, int>' requested here 992 | load_generic(A[n], x_qs + (i0 + n*tile_A::I)*MMQ_MMA_TILE_X_K_Q8_1 + k0, MMQ_MMA_TILE_X_K_Q8_1); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2907:54: note: in instantiation of function template specialization 'vec_dot_q8_1_q8_1_mma<8, 128>' requested here 2907 | static constexpr vec_dot_mmq_t vec_dot_mma = vec_dot_q8_1_q8_1_mma; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3300:9: note: in instantiation of function template specialization 'mul_mat_q_process_tile' requested here 3300 | mul_mat_q_process_tile | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3571:13: note: in instantiation of function template specialization 'mul_mat_q' requested here 3571 | mul_mat_q<<>> | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3660:13: note: in instantiation of function template specialization 'launch_mul_mat_q' requested here 3660 | launch_mul_mat_q(ctx, args, stream); | ^ 12 warnings generated when compiling for gfx906. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q2_k.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q2_k.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q2_k.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:1796:31: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 1796 | for (int l = 0; l < sizeof(int); ++l) { | ~ ^ ~~~~~~~~~~~ 12 warnings generated when compiling for gfx906. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_0.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_0.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_0.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:1796:31: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 1796 | for (int l = 0; l < sizeof(int); ++l) { | ~ ^ ~~~~~~~~~~~ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_0.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:230:46: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 230 | const int64_t * xs = (int64_t *) ((const int *) xs0 + (threadIdx.x % t.I) * stride + 2 * (threadIdx.x / t.I)); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:824:13: note: in instantiation of function template specialization 'ggml_cuda_mma::load_generic<16, 8, int>' requested here 824 | load_generic(A[n], x_qs + (i0 + n*tile_A::I)*MMQ_MMA_TILE_X_K_Q8_0 + k0, MMQ_MMA_TILE_X_K_Q8_0); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2915:54: note: in instantiation of function template specialization 'vec_dot_q8_0_q8_1_mma<8, 128, MMQ_Q8_1_DS_LAYOUT_D4>' requested here 2915 | static constexpr vec_dot_mmq_t vec_dot_mma = vec_dot_q8_0_q8_1_mma; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3300:9: note: in instantiation of function template specialization 'mul_mat_q_process_tile' requested here 3300 | mul_mat_q_process_tile | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3571:13: note: in instantiation of function template specialization 'mul_mat_q' requested here 3571 | mul_mat_q<<>> | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3660:13: note: in instantiation of function template specialization 'launch_mul_mat_q' requested here 3660 | launch_mul_mat_q(ctx, args, stream); | ^ 15 warnings generated when compiling for gfx942. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_1.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1153. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_k.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_k.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_k.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 11 warnings generated when compiling for gfx908. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_0.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_0.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_0.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:1796:31: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 1796 | for (int l = 0; l < sizeof(int); ++l) { | ~ ^ ~~~~~~~~~~~ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_0.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:230:46: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 230 | const int64_t * xs = (int64_t *) ((const int *) xs0 + (threadIdx.x % t.I) * stride + 2 * (threadIdx.x / t.I)); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:824:13: note: in instantiation of function template specialization 'ggml_cuda_mma::load_generic<16, 8, int>' requested here 824 | load_generic(A[n], x_qs + (i0 + n*tile_A::I)*MMQ_MMA_TILE_X_K_Q8_0 + k0, MMQ_MMA_TILE_X_K_Q8_0); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2915:54: note: in instantiation of function template specialization 'vec_dot_q8_0_q8_1_mma<8, 128, MMQ_Q8_1_DS_LAYOUT_D4>' requested here 2915 | static constexpr vec_dot_mmq_t vec_dot_mma = vec_dot_q8_0_q8_1_mma; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3300:9: note: in instantiation of function template specialization 'mul_mat_q_process_tile' requested here 3300 | mul_mat_q_process_tile | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3571:13: note: in instantiation of function template specialization 'mul_mat_q' requested here 3571 | mul_mat_q<<>> | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3660:13: note: in instantiation of function template specialization 'launch_mul_mat_q' requested here 3660 | launch_mul_mat_q(ctx, args, stream); | ^ 12 warnings generated when compiling for gfx1200. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_k.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_k.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_k.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 10 warnings generated when compiling for gfx908. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q2_k.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q2_k.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q2_k.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:1796:31: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 1796 | for (int l = 0; l < sizeof(int); ++l) { | ~ ^ ~~~~~~~~~~~ 12 warnings generated when compiling for gfx950. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_1.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for host. [ 55%] Building CXX object ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/mmq-instance-q5_1.cu.o cd /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build/ggml/src/ggml-hip && /usr/bin/hipcc -DGGML_BACKEND_BUILD -DGGML_BACKEND_SHARED -DGGML_HIP_NO_VMM -DGGML_HIP_ROCWMMA_FATTN_GFX12 -DGGML_SCHED_MAX_COPIES=4 -DGGML_SHARED -DGGML_USE_HIP -DUSE_PROF_API=1 -D_GNU_SOURCE -D_XOPEN_SOURCE=600 -D__HIP_PLATFORM_AMD__=1 -Dggml_hip_EXPORTS -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/.. -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -DNDEBUG -std=gnu++17 -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -x hip --offload-arch=gfx900 --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx942 --offload-arch=gfx950 --offload-arch=gfx1010 --offload-arch=gfx1012 --offload-arch=gfx1030 --offload-arch=gfx1031 --offload-arch=gfx1035 --offload-arch=gfx1036 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1103 --offload-arch=gfx1150 --offload-arch=gfx1151 --offload-arch=gfx1152 --offload-arch=gfx1153 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/mmq-instance-q5_1.cu.o -MF CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/mmq-instance-q5_1.cu.o.d -o CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/mmq-instance-q5_1.cu.o -c /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_1.cu In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_1.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1201. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_k.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_k.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_k.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 11 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_0.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_0.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_0.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:1796:31: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 1796 | for (int l = 0; l < sizeof(int); ++l) { | ~ ^ ~~~~~~~~~~~ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_0.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:230:46: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 230 | const int64_t * xs = (int64_t *) ((const int *) xs0 + (threadIdx.x % t.I) * stride + 2 * (threadIdx.x / t.I)); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:824:13: note: in instantiation of function template specialization 'ggml_cuda_mma::load_generic<16, 8, int>' requested here 824 | load_generic(A[n], x_qs + (i0 + n*tile_A::I)*MMQ_MMA_TILE_X_K_Q8_0 + k0, MMQ_MMA_TILE_X_K_Q8_0); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2915:54: note: in instantiation of function template specialization 'vec_dot_q8_0_q8_1_mma<8, 128, MMQ_Q8_1_DS_LAYOUT_D4>' requested here 2915 | static constexpr vec_dot_mmq_t vec_dot_mma = vec_dot_q8_0_q8_1_mma; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3300:9: note: in instantiation of function template specialization 'mul_mat_q_process_tile' requested here 3300 | mul_mat_q_process_tile | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3571:13: note: in instantiation of function template specialization 'mul_mat_q' requested here 3571 | mul_mat_q<<>> | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3660:13: note: in instantiation of function template specialization 'launch_mul_mat_q' requested here 3660 | launch_mul_mat_q(ctx, args, stream); | ^ 12 warnings generated when compiling for gfx1010. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_1.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx900. 12 warnings generated when compiling for gfx1012. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_k.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_k.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_k.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_1.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 11 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_0.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_0.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:521:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 521 | acc[0] = __builtin_amdgcn_mfma_i32_16x16x32_i8(((int64_t *) A.x)[0], | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:522:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 522 | ((int64_t *) B.x)[0], | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:549:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 549 | acc[0] = __builtin_amdgcn_mfma_i32_32x32x16_i8(((int64_t *) A.x)[0], | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:550:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 550 | ((int64_t *) B.x)[0], | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_0.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:1796:31: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 1796 | for (int l = 0; l < sizeof(int); ++l) { | ~ ^ ~~~~~~~~~~~ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_0.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:230:46: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 230 | const int64_t * xs = (int64_t *) ((const int *) xs0 + (threadIdx.x % t.I) * stride + 2 * (threadIdx.x / t.I)); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:824:13: note: in instantiation of function template specialization 'ggml_cuda_mma::load_generic<16, 8, int>' requested here 824 | load_generic(A[n], x_qs + (i0 + n*tile_A::I)*MMQ_MMA_TILE_X_K_Q8_0 + k0, MMQ_MMA_TILE_X_K_Q8_0); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2915:54: note: in instantiation of function template specialization 'vec_dot_q8_0_q8_1_mma<8, 128, MMQ_Q8_1_DS_LAYOUT_D4>' requested here 2915 | static constexpr vec_dot_mmq_t vec_dot_mma = vec_dot_q8_0_q8_1_mma; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3300:9: note: in instantiation of function template specialization 'mul_mat_q_process_tile' requested here 3300 | mul_mat_q_process_tile | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3571:13: note: in instantiation of function template specialization 'mul_mat_q' requested here 3571 | mul_mat_q<<>> | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3660:13: note: in instantiation of function template specialization 'launch_mul_mat_q' requested here 3660 | launch_mul_mat_q(ctx, args, stream); | ^ 12 warnings generated when compiling for gfx1030. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_1.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx906. 10 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_k.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_k.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_k.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:1796:31: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 1796 | for (int l = 0; l < sizeof(int); ++l) { | ~ ^ ~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:1796:31: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 1796 | for (int l = 0; l < sizeof(int); ++l) { | ~ ^ ~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2962:54: note: in instantiation of function template specialization 'load_tiles_q4_K<128, false>' requested here 2962 | static constexpr load_tiles_mmq_t load_tiles = load_tiles_q4_K; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3300:9: note: in instantiation of function template specialization 'mul_mat_q_process_tile' requested here 3300 | mul_mat_q_process_tile | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3571:13: note: in instantiation of function template specialization 'mul_mat_q' requested here 3571 | mul_mat_q<<>> | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3660:13: note: in instantiation of function template specialization 'launch_mul_mat_q' requested here 3660 | launch_mul_mat_q(ctx, args, stream); | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_k.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:230:46: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 230 | const int64_t * xs = (int64_t *) ((const int *) xs0 + (threadIdx.x % t.I) * stride + 2 * (threadIdx.x / t.I)); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:992:13: note: in instantiation of function template specialization 'ggml_cuda_mma::load_generic<16, 8, int>' requested here 992 | load_generic(A[n], x_qs + (i0 + n*tile_A::I)*MMQ_MMA_TILE_X_K_Q8_1 + k0, MMQ_MMA_TILE_X_K_Q8_1); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2963:54: note: in instantiation of function template specialization 'vec_dot_q8_1_q8_1_mma<8, 128>' requested here 2963 | static constexpr vec_dot_mmq_t vec_dot_mma = vec_dot_q8_1_q8_1_mma; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3300:9: note: in instantiation of function template specialization 'mul_mat_q_process_tile' requested here 3300 | mul_mat_q_process_tile | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3571:13: note: in instantiation of function template specialization 'mul_mat_q' requested here 3571 | mul_mat_q<<>> | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3660:13: note: in instantiation of function template specialization 'launch_mul_mat_q' requested here 3660 | launch_mul_mat_q(ctx, args, stream); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:1796:31: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 1796 | for (int l = 0; l < sizeof(int); ++l) { | ~ ^ ~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2962:54: note: in instantiation of function template specialization 'load_tiles_q4_K<128, true>' requested here 2962 | static constexpr load_tiles_mmq_t load_tiles = load_tiles_q4_K; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3300:9: note: in instantiation of function template specialization 'mul_mat_q_process_tile' requested here 3300 | mul_mat_q_process_tile | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3578:13: note: in instantiation of function template specialization 'mul_mat_q' requested here 3578 | mul_mat_q<<>> | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3660:13: note: in instantiation of function template specialization 'launch_mul_mat_q' requested here 3660 | launch_mul_mat_q(ctx, args, stream); | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q2_k.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q2_k.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q2_k.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:1796:31: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 1796 | for (int l = 0; l < sizeof(int); ++l) { | ~ ^ ~~~~~~~~~~~ 12 warnings generated when compiling for gfx1031. 13 warnings generated when compiling for gfx908. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_1.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_k.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_k.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_k.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:1796:31: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 1796 | for (int l = 0; l < sizeof(int); ++l) { | ~ ^ ~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:1796:31: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 1796 | for (int l = 0; l < sizeof(int); ++l) { | ~ ^ ~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2962:54: note: in instantiation of function template specialization 'load_tiles_q4_K<128, false>' requested here 2962 | static constexpr load_tiles_mmq_t load_tiles = load_tiles_q4_K; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3300:9: note: in instantiation of function template specialization 'mul_mat_q_process_tile' requested here 3300 | mul_mat_q_process_tile | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3571:13: note: in instantiation of function template specialization 'mul_mat_q' requested here 3571 | mul_mat_q<<>> | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3660:13: note: in instantiation of function template specialization 'launch_mul_mat_q' requested here 3660 | launch_mul_mat_q(ctx, args, stream); | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_k.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:230:46: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 230 | const int64_t * xs = (int64_t *) ((const int *) xs0 + (threadIdx.x % t.I) * stride + 2 * (threadIdx.x / t.I)); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:992:13: note: in instantiation of function template specialization 'ggml_cuda_mma::load_generic<16, 8, int>' requested here 992 | load_generic(A[n], x_qs + (i0 + n*tile_A::I)*MMQ_MMA_TILE_X_K_Q8_1 + k0, MMQ_MMA_TILE_X_K_Q8_1); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2963:54: note: in instantiation of function template specialization 'vec_dot_q8_1_q8_1_mma<8, 128>' requested here 2963 | static constexpr vec_dot_mmq_t vec_dot_mma = vec_dot_q8_1_q8_1_mma; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3300:9: note: in instantiation of function template specialization 'mul_mat_q_process_tile' requested here 3300 | mul_mat_q_process_tile | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3571:13: note: in instantiation of function template specialization 'mul_mat_q' requested here 3571 | mul_mat_q<<>> | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3660:13: note: in instantiation of function template specialization 'launch_mul_mat_q' requested here 3660 | launch_mul_mat_q(ctx, args, stream); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:1796:31: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 1796 | for (int l = 0; l < sizeof(int); ++l) { | ~ ^ ~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2962:54: note: in instantiation of function template specialization 'load_tiles_q4_K<128, true>' requested here 2962 | static constexpr load_tiles_mmq_t load_tiles = load_tiles_q4_K; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3300:9: note: in instantiation of function template specialization 'mul_mat_q_process_tile' requested here 3300 | mul_mat_q_process_tile | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3578:13: note: in instantiation of function template specialization 'mul_mat_q' requested here 3578 | mul_mat_q<<>> | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3660:13: note: in instantiation of function template specialization 'launch_mul_mat_q' requested here 3660 | launch_mul_mat_q(ctx, args, stream); | ^ 15 warnings generated when compiling for gfx942. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_0.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_0.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_0.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1035. 13 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_1.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_k.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_k.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_k.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:1796:31: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 1796 | for (int l = 0; l < sizeof(int); ++l) { | ~ ^ ~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:1796:31: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 1796 | for (int l = 0; l < sizeof(int); ++l) { | ~ ^ ~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2962:54: note: in instantiation of function template specialization 'load_tiles_q4_K<128, false>' requested here 2962 | static constexpr load_tiles_mmq_t load_tiles = load_tiles_q4_K; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3300:9: note: in instantiation of function template specialization 'mul_mat_q_process_tile' requested here 3300 | mul_mat_q_process_tile | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3571:13: note: in instantiation of function template specialization 'mul_mat_q' requested here 3571 | mul_mat_q<<>> | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3660:13: note: in instantiation of function template specialization 'launch_mul_mat_q' requested here 3660 | launch_mul_mat_q(ctx, args, stream); | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_k.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:230:46: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 230 | const int64_t * xs = (int64_t *) ((const int *) xs0 + (threadIdx.x % t.I) * stride + 2 * (threadIdx.x / t.I)); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:992:13: note: in instantiation of function template specialization 'ggml_cuda_mma::load_generic<16, 8, int>' requested here 992 | load_generic(A[n], x_qs + (i0 + n*tile_A::I)*MMQ_MMA_TILE_X_K_Q8_1 + k0, MMQ_MMA_TILE_X_K_Q8_1); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2963:54: note: in instantiation of function template specialization 'vec_dot_q8_1_q8_1_mma<8, 128>' requested here 2963 | static constexpr vec_dot_mmq_t vec_dot_mma = vec_dot_q8_1_q8_1_mma; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3300:9: note: in instantiation of function template specialization 'mul_mat_q_process_tile' requested here 3300 | mul_mat_q_process_tile | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3571:13: note: in instantiation of function template specialization 'mul_mat_q' requested here 3571 | mul_mat_q<<>> | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3660:13: note: in instantiation of function template specialization 'launch_mul_mat_q' requested here 3660 | launch_mul_mat_q(ctx, args, stream); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:1796:31: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 1796 | for (int l = 0; l < sizeof(int); ++l) { | ~ ^ ~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2962:54: note: in instantiation of function template specialization 'load_tiles_q4_K<128, true>' requested here 2962 | static constexpr load_tiles_mmq_t load_tiles = load_tiles_q4_K; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3300:9: note: in instantiation of function template specialization 'mul_mat_q_process_tile' requested here 3300 | mul_mat_q_process_tile | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3578:13: note: in instantiation of function template specialization 'mul_mat_q' requested here 3578 | mul_mat_q<<>> | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3660:13: note: in instantiation of function template specialization 'launch_mul_mat_q' requested here 3660 | launch_mul_mat_q(ctx, args, stream); | ^ 12 warnings generated when compiling for gfx1036. 13 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_1.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_k.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_k.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:521:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 521 | acc[0] = __builtin_amdgcn_mfma_i32_16x16x32_i8(((int64_t *) A.x)[0], | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:522:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 522 | ((int64_t *) B.x)[0], | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:549:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 549 | acc[0] = __builtin_amdgcn_mfma_i32_32x32x16_i8(((int64_t *) A.x)[0], | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:550:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 550 | ((int64_t *) B.x)[0], | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_k.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:1796:31: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 1796 | for (int l = 0; l < sizeof(int); ++l) { | ~ ^ ~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:1796:31: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 1796 | for (int l = 0; l < sizeof(int); ++l) { | ~ ^ ~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2962:54: note: in instantiation of function template specialization 'load_tiles_q4_K<128, false>' requested here 2962 | static constexpr load_tiles_mmq_t load_tiles = load_tiles_q4_K; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3300:9: note: in instantiation of function template specialization 'mul_mat_q_process_tile' requested here 3300 | mul_mat_q_process_tile | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3571:13: note: in instantiation of function template specialization 'mul_mat_q' requested here 3571 | mul_mat_q<<>> | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3660:13: note: in instantiation of function template specialization 'launch_mul_mat_q' requested here 3660 | launch_mul_mat_q(ctx, args, stream); | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_k.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:230:46: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 230 | const int64_t * xs = (int64_t *) ((const int *) xs0 + (threadIdx.x % t.I) * stride + 2 * (threadIdx.x / t.I)); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:992:13: note: in instantiation of function template specialization 'ggml_cuda_mma::load_generic<16, 8, int>' requested here 992 | load_generic(A[n], x_qs + (i0 + n*tile_A::I)*MMQ_MMA_TILE_X_K_Q8_1 + k0, MMQ_MMA_TILE_X_K_Q8_1); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2963:54: note: in instantiation of function template specialization 'vec_dot_q8_1_q8_1_mma<8, 128>' requested here 2963 | static constexpr vec_dot_mmq_t vec_dot_mma = vec_dot_q8_1_q8_1_mma; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3300:9: note: in instantiation of function template specialization 'mul_mat_q_process_tile' requested here 3300 | mul_mat_q_process_tile | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3571:13: note: in instantiation of function template specialization 'mul_mat_q' requested here 3571 | mul_mat_q<<>> | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3660:13: note: in instantiation of function template specialization 'launch_mul_mat_q' requested here 3660 | launch_mul_mat_q(ctx, args, stream); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:1796:31: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 1796 | for (int l = 0; l < sizeof(int); ++l) { | ~ ^ ~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2962:54: note: in instantiation of function template specialization 'load_tiles_q4_K<128, true>' requested here 2962 | static constexpr load_tiles_mmq_t load_tiles = load_tiles_q4_K; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3300:9: note: in instantiation of function template specialization 'mul_mat_q_process_tile' requested here 3300 | mul_mat_q_process_tile | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3578:13: note: in instantiation of function template specialization 'mul_mat_q' requested here 3578 | mul_mat_q<<>> | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3660:13: note: in instantiation of function template specialization 'launch_mul_mat_q' requested here 3660 | launch_mul_mat_q(ctx, args, stream); | ^ 10 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q2_k.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q2_k.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:521:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 521 | acc[0] = __builtin_amdgcn_mfma_i32_16x16x32_i8(((int64_t *) A.x)[0], | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:522:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 522 | ((int64_t *) B.x)[0], | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:549:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 549 | acc[0] = __builtin_amdgcn_mfma_i32_32x32x16_i8(((int64_t *) A.x)[0], | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:550:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 550 | ((int64_t *) B.x)[0], | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q2_k.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:1796:31: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 1796 | for (int l = 0; l < sizeof(int); ++l) { | ~ ^ ~~~~~~~~~~~ 12 warnings generated when compiling for gfx950. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_0.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_0.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_0.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for host. [ 56%] Building CXX object ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/mmq-instance-q5_k.cu.o cd /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build/ggml/src/ggml-hip && /usr/bin/hipcc -DGGML_BACKEND_BUILD -DGGML_BACKEND_SHARED -DGGML_HIP_NO_VMM -DGGML_HIP_ROCWMMA_FATTN_GFX12 -DGGML_SCHED_MAX_COPIES=4 -DGGML_SHARED -DGGML_USE_HIP -DUSE_PROF_API=1 -D_GNU_SOURCE -D_XOPEN_SOURCE=600 -D__HIP_PLATFORM_AMD__=1 -Dggml_hip_EXPORTS -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/.. -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -DNDEBUG -std=gnu++17 -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -x hip --offload-arch=gfx900 --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx942 --offload-arch=gfx950 --offload-arch=gfx1010 --offload-arch=gfx1012 --offload-arch=gfx1030 --offload-arch=gfx1031 --offload-arch=gfx1035 --offload-arch=gfx1036 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1103 --offload-arch=gfx1150 --offload-arch=gfx1151 --offload-arch=gfx1152 --offload-arch=gfx1153 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/mmq-instance-q5_k.cu.o -MF CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/mmq-instance-q5_k.cu.o.d -o CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/mmq-instance-q5_k.cu.o -c /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_k.cu In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_k.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_k.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_k.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const t17ile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ warnings generated when compiling for gfx942. /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_1.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_k.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_k.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_k.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1010. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_k.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_k.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_k.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1101. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_1.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1012. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_k.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_k.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_k.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1102. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_1.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1030. 14 warnings generated when compiling for gfx942. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_k.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_k.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_k.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q2_k.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q2_k.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q2_k.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1103. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_1.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1031. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_k.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_k.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_k.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1150. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_1.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1035. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_k.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_k.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_k.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1151. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_1.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx950. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_k.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_k.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_k.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for host. [ 56%] Building CXX object ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/mmq-instance-q6_k.cu.o cd /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build/ggml/src/ggml-hip && /usr/bin/hipcc -DGGML_BACKEND_BUILD -DGGML_BACKEND_SHARED -DGGML_HIP_NO_VMM -DGGML_HIP_ROCWMMA_FATTN_GFX12 -DGGML_SCHED_MAX_COPIES=4 -DGGML_SHARED -DGGML_USE_HIP -DUSE_PROF_API=1 -D_GNU_SOURCE -D_XOPEN_SOURCE=600 -D__HIP_PLATFORM_AMD__=1 -Dggml_hip_EXPORTS -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/.. -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -DNDEBUG -std=gnu++17 -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -x hip --offload-arch=gfx900 --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx942 --offload-arch=gfx950 --offload-arch=gfx1010 --offload-arch=gfx1012 --offload-arch=gfx1030 --offload-arch=gfx1031 --offload-arch=gfx1035 --offload-arch=gfx1036 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1103 --offload-arch=gfx1150 --offload-arch=gfx1151 --offload-arch=gfx1152 --offload-arch=gfx1153 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/mmq-instance-q6_k.cu.o -MF CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/mmq-instance-q6_k.cu.o.d -o CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/mmq-instance-q6_k.cu.o -c /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q6_k.cu 12 warnings generated when compiling for gfx1036. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q6_k.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_k.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q6_k.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tileIn file included from <16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_k.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ 5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ :302:90/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ : warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q6_k.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_k.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1010. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q6_k.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q6_k.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q6_k.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1152. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_1.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_k.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_k.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_k.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1012. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q6_k.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q6_k.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q6_k.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1101. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_k.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_k.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_k.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1153. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_1.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1030. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q6_k.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q6_k.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q6_k.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1102. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_k.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_k.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_k.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1200. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_1.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1031. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q6_k.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q6_k.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q6_k.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1103. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_k.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 12 warnings generated when compiling for gfx1201. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_k.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_k.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_1.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1035. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q6_k.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q6_k.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q6_k.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1150. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_k.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_k.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_k.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx900. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_1.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1036. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q6_k.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q6_k.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q6_k.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1151. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_k.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_k.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_k.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx906. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_1.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:1796:31: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 1796 | for (int l = 0; l < sizeof(int); ++l) { | ~ ^ ~~~~~~~~~~~ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:230:46: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 230 | const int64_t * xs = (int64_t *) ((const int *) xs0 + (threadIdx.x % t.I) * stride + 2 * (threadIdx.x / t.I)); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:992:13: note: in instantiation of function template specialization 'ggml_cuda_mma::load_generic<16, 8, int>' requested here 992 | load_generic(A[n], x_qs + (i0 + n*tile_A::I)*MMQ_MMA_TILE_X_K_Q8_1 + k0, MMQ_MMA_TILE_X_K_Q8_1); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2923:54: note: in instantiation of function template specialization 'vec_dot_q8_1_q8_1_mma<8, 128>' requested here 2923 | static constexpr vec_dot_mmq_t vec_dot_mma = vec_dot_q8_1_q8_1_mma; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3300:9: note: in instantiation of function template specialization 'mul_mat_q_process_tile' requested here 3300 | mul_mat_q_process_tile | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3571:13: note: in instantiation of function template specialization 'mul_mat_q' requested here 3571 | mul_mat_q<<>> | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3660:13: note: in instantiation of function template specialization 'launch_mul_mat_q' requested here 3660 | launch_mul_mat_q(ctx, args, stream); | ^ 12 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q6_k.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q6_k.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q6_k.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 11 warnings generated when compiling for gfx908. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_1.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:1796:31: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 1796 | for (int l = 0; l < sizeof(int); ++l) { | ~ ^ ~~~~~~~~~~~ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:230:46: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 230 | const int64_t * xs = (int64_t *) ((const int *) xs0 + (threadIdx.x % t.I) * stride + 2 * (threadIdx.x / t.I)); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:992:13: note: in instantiation of function template specialization 'ggml_cuda_mma::load_generic<16, 8, int>' requested here 992 | load_generic(A[n], x_qs + (i0 + n*tile_A::I)*MMQ_MMA_TILE_X_K_Q8_1 + k0, MMQ_MMA_TILE_X_K_Q8_1); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2923:54: note: in instantiation of function template specialization 'vec_dot_q8_1_q8_1_mma<8, 128>' requested here 2923 | static constexpr vec_dot_mmq_t vec_dot_mma = vec_dot_q8_1_q8_1_mma; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3300:9: note: in instantiation of function template specialization 'mul_mat_q_process_tile' requested here 3300 | mul_mat_q_process_tile | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3571:13: note: in instantiation of function template specialization 'mul_mat_q' requested here 3571 | mul_mat_q<<>> | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3660:13: note: in instantiation of function template specialization 'launch_mul_mat_q' requested here 3660 | launch_mul_mat_q(ctx, args, stream); | ^ 12 warnings generated when compiling for gfx1101. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q6_k.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q6_k.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q6_k.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1152. 11 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_k.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_k.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_k.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_1.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:1796:31: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 1796 | for (int l = 0; l < sizeof(int); ++l) { | ~ ^ ~~~~~~~~~~~ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:230:46: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 230 | const int64_t * xs = (int64_t *) ((const int *) xs0 + (threadIdx.x % t.I) * stride + 2 * (threadIdx.x / t.I)); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:992:13: note: in instantiation of function template specialization 'ggml_cuda_mma::load_generic<16, 8, int>' requested here 992 | load_generic(A[n], x_qs + (i0 + n*tile_A::I)*MMQ_MMA_TILE_X_K_Q8_1 + k0, MMQ_MMA_TILE_X_K_Q8_1); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2923:54: note: in instantiation of function template specialization 'vec_dot_q8_1_q8_1_mma<8, 128>' requested here 2923 | static constexpr vec_dot_mmq_t vec_dot_mma = vec_dot_q8_1_q8_1_mma; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3300:9: note: in instantiation of function template specialization 'mul_mat_q_process_tile' requested here 3300 | mul_mat_q_process_tile | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3571:13: note: in instantiation of function template specialization 'mul_mat_q' requested here 3571 | mul_mat_q<<>> | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3660:13: note: in instantiation of function template specialization 'launch_mul_mat_q' requested here 3660 | launch_mul_mat_q(ctx, args, stream); | ^ 12 warnings generated when compiling for gfx1102. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q6_k.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q6_k.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q6_k.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1103. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q6_k.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q6_k.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q6_k.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 11 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:521:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 521 | acc[0] = __builtin_amdgcn_mfma_i32_16x16x32_i8(((int64_t *) A.x)[0], | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:522:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 522 | ((int64_t *) B.x)[0], | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:549:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 549 | acc[0] = __builtin_amdgcn_mfma_i32_32x32x16_i8(((int64_t *) A.x)[0], | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:550:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 550 | ((int64_t *) B.x)[0], | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_1.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:1796:31: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 1796 | for (int l = 0; l < sizeof(int); ++l) { | ~ ^ ~~~~~~~~~~~ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:230:46: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 230 | const int64_t * xs = (int64_t *) ((const int *) xs0 + (threadIdx.x % t.I) * stride + 2 * (threadIdx.x / t.I)); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:992:13: note: in instantiation of function template specialization 'ggml_cuda_mma::load_generic<16, 8, int>' requested here 992 | load_generic(A[n], x_qs + (i0 + n*tile_A::I)*MMQ_MMA_TILE_X_K_Q8_1 + k0, MMQ_MMA_TILE_X_K_Q8_1); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2923:54: note: in instantiation of function template specialization 'vec_dot_q8_1_q8_1_mma<8, 128>' requested here 2923 | static constexpr vec_dot_mmq_t vec_dot_mma = vec_dot_q8_1_q8_1_mma; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3300:9: note: in instantiation of function template specialization 'mul_mat_q_process_tile' requested here 3300 | mul_mat_q_process_tile | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3571:13: note: in instantiation of function template specialization 'mul_mat_q' requested here 3571 | mul_mat_q<<>> | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3660:13: note: in instantiation of function template specialization 'launch_mul_mat_q' requested here 3660 | launch_mul_mat_q(ctx, args, stream); | ^ 12 warnings generated when compiling for gfx950. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q2_k.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q2_k.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q2_k.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for host. [ 57%] Building CXX object ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/mmq-instance-q8_0.cu.o cd /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build/ggml/src/ggml-hip && /usr/bin/hipcc -DGGML_BACKEND_BUILD -DGGML_BACKEND_SHARED -DGGML_HIP_NO_VMM -DGGML_HIP_ROCWMMA_FATTN_GFX12 -DGGML_SCHED_MAX_COPIES=4 -DGGML_SHARED -DGGML_USE_HIP -DUSE_PROF_API=1 -D_GNU_SOURCE -D_XOPEN_SOURCE=600 -D__HIP_PLATFORM_AMD__=1 -Dggml_hip_EXPORTS -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/.. -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -DNDEBUG -std=gnu++17 -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -x hip --offload-arch=gfx900 --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx942 --offload-arch=gfx950 --offload-arch=gfx1010 --offload-arch=gfx1012 --offload-arch=gfx1030 --offload-arch=gfx1031 --offload-arch=gfx1035 --offload-arch=gfx1036 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1103 --offload-arch=gfx1150 --offload-arch=gfx1151 --offload-arch=gfx1152 --offload-arch=gfx1153 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/mmq-instance-q8_0.cu.o -MF CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/mmq-instance-q8_0.cu.o.d -o CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/mmq-instance-q8_0.cu.o -c /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q8_0.cu In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q8_0.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q8_0.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q8_0.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1150. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q6_k.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q6_k.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q6_k.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1010. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q8_0.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q8_0.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q8_0.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 15 warnings generated when compiling for gfx942. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_1.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1153. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_k.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_k.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_k.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1012. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q8_0.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q8_0.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q8_0.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1151. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q6_k.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q6_k.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q6_k.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1200. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_k.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_k.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_k.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1030. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q8_0.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q8_0.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q8_0.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx950. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_1.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_1.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for host. [ 57%] Building CXX object ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-q4_0-q4_0.cu.o cd /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build/ggml/src/ggml-hip && /usr/bin/hipcc -DGGML_BACKEND_BUILD -DGGML_BACKEND_SHARED -DGGML_HIP_NO_VMM -DGGML_HIP_ROCWMMA_FATTN_GFX12 -DGGML_SCHED_MAX_COPIES=4 -DGGML_SHARED -DGGML_USE_HIP -DUSE_PROF_API=1 -D_GNU_SOURCE -D_XOPEN_SOURCE=600 -D__HIP_PLATFORM_AMD__=1 -Dggml_hip_EXPORTS -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/.. -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -DNDEBUG -std=gnu++17 -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -x hip --offload-arch=gfx900 --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx942 --offload-arch=gfx950 --offload-arch=gfx1010 --offload-arch=gfx1012 --offload-arch=gfx1030 --offload-arch=gfx1031 --offload-arch=gfx1035 --offload-arch=gfx1036 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1103 --offload-arch=gfx1150 --offload-arch=gfx1151 --offload-arch=gfx1152 --offload-arch=gfx1153 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-q4_0-q4_0.cu.o -MF CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-q4_0-q4_0.cu.o.d -o CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-q4_0-q4_0.cu.o -c /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-q4_0-q4_0.cu 12 warnings generated when compiling for gfx1201. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-q4_0-q4_0.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-q4_0-q4_0.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 1, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:402:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 1, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 402 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 1, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:405:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 1, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 405 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 2, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:414:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 2, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 414 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 2, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:417:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 2, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 417 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 4, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:426:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 4, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 426 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 4, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:429:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 4, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 429 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 8, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:437:9: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 8, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 437 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 8, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:440:9: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 8, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 440 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_k.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_k.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_k.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1031. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q8_0.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q8_0.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q8_0.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 28 warnings generated when compiling for gfx1010. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-q4_0-q4_0.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-q4_0-q4_0.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 1, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:402:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 1, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 402 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 1, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:405:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 1, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 405 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 2, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:414:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 2, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 414 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 2, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:417:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 2, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 417 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 4, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:426:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 4, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 426 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 4, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:429:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 4, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 429 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 8, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:437:9: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 8, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 437 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 8, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:440:9: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 8, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 440 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ 12 warnings generated when compiling for gfx1035. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q8_0.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q8_0.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q8_0.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx900. 28 warnings generated when compiling for gfx1012. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_k.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_k.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_k.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-q4_0-q4_0.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-q4_0-q4_0.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 1, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:402:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 1, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 402 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 1, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:405:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 1, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 405 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 2, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:414:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 2, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 414 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 2, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:417:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 2, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 417 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 4, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:426:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 4, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 426 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 4, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:429:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 4, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 429 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 8, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:437:9: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 8, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 437 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 8, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:440:9: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 8, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 440 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ 12 warnings generated when compiling for gfx1152. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q6_k.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q6_k.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q6_k.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 28 warnings generated when compiling for gfx1030. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-q4_0-q4_0.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-q4_0-q4_0.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 1, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:402:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 1, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 402 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 1, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:405:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 1, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 405 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 2, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:414:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 2, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 414 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 2, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:417:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 2, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 417 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 4, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:426:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 4, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 426 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 4, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:429:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 4, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 429 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 8, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:437:9: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 8, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 437 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 8, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:440:9: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 8, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 440 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ 12 warnings generated when compiling for gfx1036. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q8_0.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q8_0.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q8_0.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx906. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_k.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_k.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_k.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:1796:31: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 1796 | for (int l = 0; l < sizeof(int); ++l) { | ~ ^ ~~~~~~~~~~~ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_k.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:230:46: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 230 | const int64_t * xs = (int64_t *) ((const int *) xs0 + (threadIdx.x % t.I) * stride + 2 * (threadIdx.x / t.I)); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:992:13: note: in instantiation of function template specialization 'ggml_cuda_mma::load_generic<16, 8, int>' requested here 992 | load_generic(A[n], x_qs + (i0 + n*tile_A::I)*MMQ_MMA_TILE_X_K_Q8_1 + k0, MMQ_MMA_TILE_X_K_Q8_1); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2971:54: note: in instantiation of function template specialization 'vec_dot_q8_1_q8_1_mma<8, 128>' requested here 2971 | static constexpr vec_dot_mmq_t vec_dot_mma = vec_dot_q8_1_q8_1_mma; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3300:9: note: in instantiation of function template specialization 'mul_mat_q_process_tile' requested here 3300 | mul_mat_q_process_tile | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3571:13: note: in instantiation of function template specialization 'mul_mat_q' requested here 3571 | mul_mat_q<<>> | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3660:13: note: in instantiation of function template specialization 'launch_mul_mat_q' requested here 3660 | launch_mul_mat_q(ctx, args, stream); | ^ 28 warnings generated when compiling for gfx1031. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-q4_0-q4_0.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-q4_0-q4_0.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 1, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:402:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 1, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 402 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 1, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:405:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 1, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 405 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 2, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:414:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 2, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 414 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 2, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:417:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 2, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 417 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 4, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:426:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 4, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 426 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 4, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:429:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 4, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 429 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 8, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:437:9: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 8, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 437 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 8, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:440:9: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 8, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 440 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ 12 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q8_0.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q8_0.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q8_0.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 28 warnings generated when compiling for gfx1035. 11 warnings generated when compiling for gfx908. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-q4_0-q4_0.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-q4_0-q4_0.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 1, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:402:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 1, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 402 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 1, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:405:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 1, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 405 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 2, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:414:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 2, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 414 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 2, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:417:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 2, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 417 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 4, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:426:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 4, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 426 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 4, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:429:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 4, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 429 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 8, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:437:9: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 8, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 437 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 8, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:440:9: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 8, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 440 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_k.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_k.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_k.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:1796:31: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 1796 | for (int l = 0; l < sizeof(int); ++l) { | ~ ^ ~~~~~~~~~~~ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_k.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:230:46: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 230 | const int64_t * xs = (int64_t *) ((const int *) xs0 + (threadIdx.x % t.I) * stride + 2 * (threadIdx.x / t.I)); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:992:13: note: in instantiation of function template specialization 'ggml_cuda_mma::load_generic<16, 8, int>' requested here 992 | load_generic(A[n], x_qs + (i0 + n*tile_A::I)*MMQ_MMA_TILE_X_K_Q8_1 + k0, MMQ_MMA_TILE_X_K_Q8_1); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2971:54: note: in instantiation of function template specialization 'vec_dot_q8_1_q8_1_mma<8, 128>' requested here 2971 | static constexpr vec_dot_mmq_t vec_dot_mma = vec_dot_q8_1_q8_1_mma; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3300:9: note: in instantiation of function template specialization 'mul_mat_q_process_tile' requested here 3300 | mul_mat_q_process_tile | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3571:13: note: in instantiation of function template specialization 'mul_mat_q' requested here 3571 | mul_mat_q<<>> | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3660:13: note: in instantiation of function template specialization 'launch_mul_mat_q' requested here 3660 | launch_mul_mat_q(ctx, args, stream); | ^ 12 warnings generated when compiling for gfx1101. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q8_0.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q8_0.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q8_0.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 28 warnings generated when compiling for gfx1036. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-q4_0-q4_0.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-q4_0-q4_0.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 1, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:402:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 1, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 402 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 1, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:405:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 1, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 405 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 2, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:414:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 2, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 414 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 2, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:417:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 2, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 417 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 4, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:426:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 4, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 426 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 4, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:429:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 4, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 429 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 8, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:437:9: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 8, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 437 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 8, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:440:9: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 8, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 440 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ 12 warnings generated when compiling for gfx1153. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q6_k.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q6_k.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q6_k.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 11 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_k.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_k.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_k.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:1796:31: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 1796 | for (int l = 0; l < sizeof(int); ++l) { | ~ ^ ~~~~~~~~~~~ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_k.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:230:46: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 230 | const int64_t * xs = (int64_t *) ((const int *) xs0 + (threadIdx.x % t.I) * stride + 2 * (threadIdx.x / t.I)); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:992:13: note: in instantiation of function template specialization 'ggml_cuda_mma::load_generic<16, 8, int>' requested here 992 | load_generic(A[n], x_qs + (i0 + n*tile_A::I)*MMQ_MMA_TILE_X_K_Q8_1 + k0, MMQ_MMA_TILE_X_K_Q8_1); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2971:54: note: in instantiation of function template specialization 'vec_dot_q8_1_q8_1_mma<8, 128>' requested here 2971 | static constexpr vec_dot_mmq_t vec_dot_mma = vec_dot_q8_1_q8_1_mma; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3300:9: note: in instantiation of function template specialization 'mul_mat_q_process_tile' requested here 3300 | mul_mat_q_process_tile | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3571:13: note: in instantiation of function template specialization 'mul_mat_q' requested here 3571 | mul_mat_q<<>> | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3660:13: note: in instantiation of function template specialization 'launch_mul_mat_q' requested here 3660 | launch_mul_mat_q(ctx, args, stream); | ^ 28 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-q4_0-q4_0.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-q4_0-q4_0.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 1, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:402:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 1, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 402 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 1, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:405:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 1, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 405 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 2, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:414:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 2, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 414 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 2, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:417:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 2, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 417 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 4, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:426:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 4, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 426 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 4, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:429:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 4, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 429 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 8, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:437:9: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 8, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 437 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 8, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:440:9: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 8, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 440 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ 12 warnings generated when compiling for gfx1102. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q8_0.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q8_0.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q8_0.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1200. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q6_k.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 11 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q6_k.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q6_k.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 28 warnings generated when compiling for gfx1101. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_k.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_k.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:521:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 521 | acc[0] = __builtin_amdgcn_mfma_i32_16x16x32_i8(((int64_t *) A.x)[0], | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:522:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 522 | ((int64_t *) B.x)[0], | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:549:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 549 | acc[0] = __builtin_amdgcn_mfma_i32_32x32x16_i8(((int64_t *) A.x)[0], | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:550:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 550 | ((int64_t *) B.x)[0], | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_k.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:1796:31: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 1796 | for (int l = 0; l < sizeof(int); ++l) { | ~ ^ ~~~~~~~~~~~ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_k.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:230:46: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 230 | const int64_t * xs = (int64_t *) ((const int *) xs0 + (threadIdx.x % t.I) * stride + 2 * (threadIdx.x / t.I)); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:992:13: note: in instantiation of function template specialization 'ggml_cuda_mma::load_generic<16, 8, int>' requested here 992 | load_generic(A[n], x_qs + (i0 + n*tile_A::I)*MMQ_MMA_TILE_X_K_Q8_1 + k0, MMQ_MMA_TILE_X_K_Q8_1); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2971:54: note: in instantiation of function template specialization 'vec_dot_q8_1_q8_1_mma<8, 128>' requested here 2971 | static constexpr vec_dot_mmq_t vec_dot_mma = vec_dot_q8_1_q8_1_mma; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3300:9: note: in instantiation of function template specialization 'mul_mat_q_process_tile' requested here 3300 | mul_mat_q_process_tile | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3571:13: note: in instantiation of function template specialization 'mul_mat_q' requested here 3571 | mul_mat_q<<>> | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3660:13: note: in instantiation of function template specialization 'launch_mul_mat_q' requested here 3660 | launch_mul_mat_q(ctx, args, stream); | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-q4_0-q4_0.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-q4_0-q4_0.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 1, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:402:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 1, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 402 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 1, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:405:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 1, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 405 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 2, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:414:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 2, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 414 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 2, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:417:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 2, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 417 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 4, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:426:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 4, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 426 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 4, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:429:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 4, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 429 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 8, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:437:9: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 8, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 437 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 8, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:440:9: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 8, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 440 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ 12 warnings generated when compiling for gfx1103. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q8_0.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q8_0.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q8_0.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 28 warnings generated when compiling for gfx1102. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-q4_0-q4_0.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-q4_0-q4_0.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 1, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:402:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 1, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 402 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 1, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:405:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 1, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 405 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 2, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:414:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 2, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 414 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 2, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:417:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 2, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 417 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 4, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:426:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 4, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 426 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 4, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:429:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 4, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 429 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 8, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:437:9: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 8, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 437 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 8, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:440:9: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 8, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 440 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ 15 warnings generated when compiling for gfx942. 12 warnings generated when compiling for gfx1201. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_k.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_k.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q6_k.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_k.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q6_k.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q6_k.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1150. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q8_0.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q8_0.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q8_0.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 28 warnings generated when compiling for gfx1103. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-q4_0-q4_0.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-q4_0-q4_0.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 1, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:402:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 1, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 402 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 1, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:405:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 1, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 405 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 2, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:414:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 2, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 414 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 2, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:417:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 2, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 417 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 4, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:426:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 4, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 426 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 4, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:429:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 4, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 429 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 8, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:437:9: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 8, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 437 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 8, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:440:9: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 8, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 440 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ 12 warnings generated when compiling for gfx1151. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q8_0.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q8_0.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q8_0.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 28 warnings generated when compiling for gfx1150. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-q4_0-q4_0.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-q4_0-q4_0.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 1, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:402:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 1, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 402 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 1, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:405:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 1, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 405 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 2, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:414:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 2, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 414 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 2, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:417:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 2, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 417 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 4, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:426:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 4, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 426 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 4, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:429:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 4, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 429 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 8, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:437:9: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 8, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 437 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 8, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:440:9: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 8, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 440 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ 12 warnings generated when compiling for gfx900. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q6_k.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q6_k.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q6_k.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 28 warnings generated when compiling for gfx1151. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-q4_0-q4_0.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-q4_0-q4_0.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 1, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:402:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 1, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 402 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 1, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:405:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 1, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 405 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 2, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:414:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 2, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 414 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 2, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:417:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 2, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 417 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 4, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:426:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 4, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 426 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 4, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:429:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 4, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 429 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 8, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:437:9: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 8, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 437 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 8, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:440:9: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 8, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 440 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ 28 warnings generated when compiling for gfx1152. 12 warnings generated when compiling for gfx906. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-q4_0-q4_0.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-q4_0-q4_0.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 1, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:402:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 1, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 402 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 1, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:405:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 1, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 405 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 2, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:414:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 2, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 414 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 2, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:417:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 2, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 417 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 4, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:426:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 4, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 426 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 4, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:429:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 4, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 429 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 8, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:437:9: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 8, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 437 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 8, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:440:9: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 8, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 440 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q6_k.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q6_k.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q6_k.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:1796:31: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 1796 | for (int l = 0; l < sizeof(int); ++l) { | ~ ^ ~~~~~~~~~~~ 12 warnings generated when compiling for gfx1152. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q8_0.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q8_0.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q8_0.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 28 warnings generated when compiling for gfx1153. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-q4_0-q4_0.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-q4_0-q4_0.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 1, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:402:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 1, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 402 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 1, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:405:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 1, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 405 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 2, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:414:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 2, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 414 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 2, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:417:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 2, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 417 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 4, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:426:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 4, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 426 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 4, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:429:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 4, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 429 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 8, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:437:9: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 8, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 437 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 8, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:440:9: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 8, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 440 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ 12 warnings generated when compiling for gfx950. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_k.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_k.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_k.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for host. [ 58%] Building CXX object ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-vec-f32-instance-hs128-q4_0-q4_0.cu.o cd /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build/ggml/src/ggml-hip && /usr/bin/hipcc -DGGML_BACKEND_BUILD -DGGML_BACKEND_SHARED -DGGML_HIP_NO_VMM -DGGML_HIP_ROCWMMA_FATTN_GFX12 -DGGML_SCHED_MAX_COPIES=4 -DGGML_SHARED -DGGML_USE_HIP -DUSE_PROF_API=1 -D_GNU_SOURCE -D_XOPEN_SOURCE=600 -D__HIP_PLATFORM_AMD__=1 -Dggml_hip_EXPORTS -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/.. -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -DNDEBUG -std=gnu++17 -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -x hip --offload-arch=gfx900 --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx942 --offload-arch=gfx950 --offload-arch=gfx1010 --offload-arch=gfx1012 --offload-arch=gfx1030 --offload-arch=gfx1031 --offload-arch=gfx1035 --offload-arch=gfx1036 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1103 --offload-arch=gfx1150 --offload-arch=gfx1151 --offload-arch=gfx1152 --offload-arch=gfx1153 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-vec-f32-instance-hs128-q4_0-q4_0.cu.o -MF CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-vec-f32-instance-hs128-q4_0-q4_0.cu.o.d -o CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-vec-f32-instance-hs128-q4_0-q4_0.cu.o -c /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f32-instance-hs128-q4_0-q4_0.cu In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f32-instance-hs128-q4_0-q4_0.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f32.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 10 warnings generated when compiling for gfx908. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q6_k.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q6_k.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q6_k.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:1796:31: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 1796 | for (int l = 0; l < sizeof(int); ++l) { | ~ ^ ~~~~~~~~~~~ 28 warnings generated when compiling for gfx1200. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-q4_0-q4_0.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-q4_0-q4_0.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 1, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:402:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 1, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 402 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 1, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:405:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 1, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 405 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 2, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:414:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 2, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 414 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 2, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:417:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 2, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 417 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 4, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:426:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 4, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 426 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 4, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:429:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 4, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 429 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 8, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:437:9: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 8, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 437 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 8, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:440:9: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 8, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 440 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ 1 warning generated when compiling for gfx1010. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f32-instance-hs128-q4_0-q4_0.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f32.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 12 warnings generated when compiling for gfx1153. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q8_0.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q8_0.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q8_0.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 28 warnings generated when compiling for gfx1201. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-q4_0-q4_0.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-q4_0-q4_0.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 1, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:402:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 1, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 402 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 1, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:405:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 1, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 405 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 2, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:414:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 2, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 414 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 2, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:417:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 2, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 417 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 4, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:426:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 4, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 426 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 4, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:429:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 4, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 429 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ 10 warnings generated when compiling for gfx90a. /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 8, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:437:9: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 8, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 437 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 8, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:440:9: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 8, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 440 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ 1 warning generated when compiling for gfx1012. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q6_k.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q6_k.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q6_k.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:1796:31: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 1796 | for (int l = 0; l < sizeof(int); ++l) { | ~ ^ ~~~~~~~~~~~ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f32-instance-hs128-q4_0-q4_0.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f32.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 12 warnings generated when compiling for gfx1200. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q8_0.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q8_0.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q8_0.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 28 warnings generated when compiling for gfx900. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-q4_0-q4_0.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-q4_0-q4_0.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 1, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:402:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 1, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 402 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 1, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:405:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 1, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 405 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 2, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:414:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 2, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 414 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 2, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:417:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 2, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 417 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 4, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:426:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 4, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 426 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 4, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:429:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 4, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 429 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 8, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:437:9: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 8, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 437 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 8, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:440:9: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 8, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 440 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ 1 warning generated when compiling for gfx1030. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f32-instance-hs128-q4_0-q4_0.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f32.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 10 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q6_k.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q6_k.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:521:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 521 | acc[0] = __builtin_amdgcn_mfma_i32_16x16x32_i8(((int64_t *) A.x)[0], | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:522:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 522 | ((int64_t *) B.x)[0], | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:549:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 549 | acc[0] = __builtin_amdgcn_mfma_i32_32x32x16_i8(((int64_t *) A.x)[0], | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:550:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 550 | ((int64_t *) B.x)[0], | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q6_k.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:1796:31: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 1796 | for (int l = 0; l < sizeof(int); ++l) { | ~ ^ ~~~~~~~~~~~ 12 warnings generated when compiling for gfx1201. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q8_0.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q8_0.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q8_0.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 28 warnings generated when compiling for gfx906. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-q4_0-q4_0.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-q4_0-q4_0.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 1, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:402:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 1, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 402 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 1, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:405:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 1, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 405 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 2, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:414:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 2, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 414 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 2, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:417:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 2, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 417 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 4, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:426:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 4, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 426 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 4, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:429:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 4, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 429 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 8, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:437:9: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 8, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 437 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 8, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:440:9: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 8, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 440 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ 1 warning generated when compiling for gfx1031. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f32-instance-hs128-q4_0-q4_0.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f32.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 28 warnings generated when compiling for gfx908. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-q4_0-q4_0.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-q4_0-q4_0.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 1, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:402:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 1, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 402 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 1, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:405:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 1, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 405 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 2, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:414:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 2, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 414 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 2, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:417:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 2, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 417 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 4, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:426:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 4, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 426 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 4, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:429:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 4, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 429 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 8, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:437:9: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 8, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 437 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 8, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:440:9: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 8, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 440 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ 14 warnings generated when compiling for gfx942. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q6_k.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q6_k.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q6_k.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 1 warning generated when compiling for gfx1035. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f32-instance-hs128-q4_0-q4_0.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f32.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 12 warnings generated when compiling for gfx900. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q8_0.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q8_0.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q8_0.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 28 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-q4_0-q4_0.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-q4_0-q4_0.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 1, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:402:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 1, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 402 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 1, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:405:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 1, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 405 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 2, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:414:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 2, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 414 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 2, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:417:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 2, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 417 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 4, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:426:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 4, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 426 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 4, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:429:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 4, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 429 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 8, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:437:9: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 8, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 437 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 8, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:440:9: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 8, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 440 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ 1 warning generated when compiling for gfx1036. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f32-instance-hs128-q4_0-q4_0.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f32.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 12 warnings generated when compiling for gfx906. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q8_0.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q8_0.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q8_0.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:1796:31: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 1796 | for (int l = 0; l < sizeof(int); ++l) { | ~ ^ ~~~~~~~~~~~ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q8_0.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:230:46: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 230 | const int64_t * xs = (int64_t *) ((const int *) xs0 + (threadIdx.x % t.I) * stride + 2 * (threadIdx.x / t.I)); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:824:13: note: in instantiation of function template specialization 'ggml_cuda_mma::load_generic<16, 8, int>' requested here 824 | load_generic(A[n], x_qs + (i0 + n*tile_A::I)*MMQ_MMA_TILE_X_K_Q8_0 + k0, MMQ_MMA_TILE_X_K_Q8_0); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2931:54: note: in instantiation of function template specialization 'vec_dot_q8_0_q8_1_mma<8, 128, MMQ_Q8_1_DS_LAYOUT_D4>' requested here 2931 | static constexpr vec_dot_mmq_t vec_dot_mma = vec_dot_q8_0_q8_1_mma; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3300:9: note: in instantiation of function template specialization 'mul_mat_q_process_tile' requested here 3300 | mul_mat_q_process_tile | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3571:13: note: in instantiation of function template specialization 'mul_mat_q' requested here 3571 | mul_mat_q<<>> | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3660:13: note: in instantiation of function template specialization 'launch_mul_mat_q' requested here 3660 | launch_mul_mat_q(ctx, args, stream); | ^ 28 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-q4_0-q4_0.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-q4_0-q4_0.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 1, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:402:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 1, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 402 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 1, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:405:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 1, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 405 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 2, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:414:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 2, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 414 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 2, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:417:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 2, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 417 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 4, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:426:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 4, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 426 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 4, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:429:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 4, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 429 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 8, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:437:9: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 8, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 437 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 8, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:440:9: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 8, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 440 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ 1 warning generated when compiling for gfx1100. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f32-instance-hs128-q4_0-q4_0.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f32.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 28 warnings generated when compiling for gfx942. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-q4_0-q4_0.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-q4_0-q4_0.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 1, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:402:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 1, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 402 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 1, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:405:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 1, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 405 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 2, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:414:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 2, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 414 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 2, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:417:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 2, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 417 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 4, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:426:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 4, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 426 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 4, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:429:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 4, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 429 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 8, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:437:9: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 8, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 437 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 8, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:440:9: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 8, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 440 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ 1 warning generated when compiling for gfx1101. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f32-instance-hs128-q4_0-q4_0.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f32.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 11 warnings generated when compiling for gfx908. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q8_0.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q8_0.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q8_0.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:1796:31: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 1796 | for (int l = 0; l < sizeof(int); ++l) { | ~ ^ ~~~~~~~~~~~ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q8_0.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:230:46: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 230 | const int64_t * xs = (int64_t *) ((const int *) xs0 + (threadIdx.x % t.I) * stride + 2 * (threadIdx.x / t.I)); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:824:13: note: in instantiation of function template specialization 'ggml_cuda_mma::load_generic<16, 8, int>' requested here 824 | load_generic(A[n], x_qs + (i0 + n*tile_A::I)*MMQ_MMA_TILE_X_K_Q8_0 + k0, MMQ_MMA_TILE_X_K_Q8_0); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2931:54: note: in instantiation of function template specialization 'vec_dot_q8_0_q8_1_mma<8, 128, MMQ_Q8_1_DS_LAYOUT_D4>' requested here 2931 | static constexpr vec_dot_mmq_t vec_dot_mma = vec_dot_q8_0_q8_1_mma; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3300:9: note: in instantiation of function template specialization 'mul_mat_q_process_tile' requested here 3300 | mul_mat_q_process_tile | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3571:13: note: in instantiation of function template specialization 'mul_mat_q' requested here 3571 | mul_mat_q<<>> | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3660:13: note: in instantiation of function template specialization 'launch_mul_mat_q' requested here 3660 | launch_mul_mat_q(ctx, args, stream); | ^ 28 warnings generated when compiling for gfx950. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-q4_0-q4_0.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-q4_0-q4_0.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 1, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:402:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 1, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 402 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 1, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:405:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 1, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 405 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 2, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:414:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 2, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 414 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 2, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:417:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 2, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 417 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 4, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:426:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 4, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 426 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 4, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:429:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 4, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 429 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 8, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:437:9: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 8, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 437 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 8, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:440:9: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 8, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 440 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ 28 warnings generated when compiling for host. [ 58%] Building CXX object ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-q8_0-q8_0.cu.o cd /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build/ggml/src/ggml-hip && /usr/bin/hipcc -DGGML_BACKEND_BUILD -DGGML_BACKEND_SHARED -DGGML_HIP_NO_VMM -DGGML_HIP_ROCWMMA_FATTN_GFX12 -DGGML_SCHED_MAX_COPIES=4 -DGGML_SHARED -DGGML_USE_HIP -DUSE_PROF_API=1 -D_GNU_SOURCE -D_XOPEN_SOURCE=600 -D__HIP_PLATFORM_AMD__=1 -Dggml_hip_EXPORTS -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/.. -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -DNDEBUG -std=gnu++17 -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -x hip --offload-arch=gfx900 --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx942 --offload-arch=gfx950 --offload-arch=gfx1010 --offload-arch=gfx1012 --offload-arch=gfx1030 --offload-arch=gfx1031 --offload-arch=gfx1035 --offload-arch=gfx1036 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1103 --offload-arch=gfx1150 --offload-arch=gfx1151 --offload-arch=gfx1152 --offload-arch=gfx1153 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-q8_0-q8_0.cu.o -MF CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-q8_0-q8_0.cu.o.d -o CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-q8_0-q8_0.cu.o -c /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-q8_0-q8_0.cu In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-q8_0-q8_0.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-q8_0-q8_0.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 1, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:402:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 1, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 402 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 1, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:405:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 1, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 405 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 2, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:414:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 2, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 414 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 2, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:417:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 2, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 417 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 4, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:426:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 4, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 426 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 4, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:429:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 4, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 429 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 8, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:437:9: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 8, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 437 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 8, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:440:9: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 8, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 440 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ 1 warning generated when compiling for gfx1102. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f32-instance-hs128-q4_0-q4_0.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f32.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 12 warnings generated when compiling for gfx950. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q6_k.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q6_k.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q6_k.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for host. [ 59%] Building CXX object ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-vec-f32-instance-hs128-q8_0-q8_0.cu.o cd /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build/ggml/src/ggml-hip && /usr/bin/hipcc -DGGML_BACKEND_BUILD -DGGML_BACKEND_SHARED -DGGML_HIP_NO_VMM -DGGML_HIP_ROCWMMA_FATTN_GFX12 -DGGML_SCHED_MAX_COPIES=4 -DGGML_SHARED -DGGML_USE_HIP -DUSE_PROF_API=1 -D_GNU_SOURCE -D_XOPEN_SOURCE=600 -D__HIP_PLATFORM_AMD__=1 -Dggml_hip_EXPORTS -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/.. -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -DNDEBUG -std=gnu++17 -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -x hip --offload-arch=gfx900 --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx942 --offload-arch=gfx950 --offload-arch=gfx1010 --offload-arch=gfx1012 --offload-arch=gfx1030 --offload-arch=gfx1031 --offload-arch=gfx1035 --offload-arch=gfx1036 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1103 --offload-arch=gfx1150 --offload-arch=gfx1151 --offload-arch=gfx1152 --offload-arch=gfx1153 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-vec-f32-instance-hs128-q8_0-q8_0.cu.o -MF CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-vec-f32-instance-hs128-q8_0-q8_0.cu.o.d -o CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-vec-f32-instance-hs128-q8_0-q8_0.cu.o -c /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f32-instance-hs128-q8_0-q8_0.cu In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f32-instance-hs128-q8_0-q8_0.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f32.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 28 warnings generated when compiling for gfx1010. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-q8_0-q8_0.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-q8_0-q8_0.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 1, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:402:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 1, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 402 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 1, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:405:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 1, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 405 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 2, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:414:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 2, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 414 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 2, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:417:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 2, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 417 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 4, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:426:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 4, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 426 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 4, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:429:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 4, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 429 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 8, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:437:9: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 8, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 437 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 8, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:440:9: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 8, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 440 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ 1 warning generated when compiling for gfx1103. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f32-instance-hs128-q4_0-q4_0.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f32.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1010. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f32-instance-hs128-q8_0-q8_0.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f32.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 28 warnings generated when compiling for gfx1012. 11 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-q8_0-q8_0.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-q8_0-q8_0.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 1, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:402:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 1, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 402 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 1, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:405:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 1, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 405 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 2, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:414:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 2, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 414 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q8_0.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 2, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:417:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 2, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 417 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 4, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:426:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 4, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 426 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 4, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:429:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 4, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 429 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q8_0.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 8, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:437:9: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 8, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 437 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q8_0.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:1796:31: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 1796 | for (int l = 0; l < sizeof(int); ++l) { | ~ ^ ~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 8, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:440:9: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 8, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 440 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q8_0.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:230:46: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 230 | const int64_t * xs = (int64_t *) ((const int *) xs0 + (threadIdx.x % t.I) * stride + 2 * (threadIdx.x / t.I)); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:824:13: note: in instantiation of function template specialization 'ggml_cuda_mma::load_generic<16, 8, int>' requested here 824 | load_generic(A[n], x_qs + (i0 + n*tile_A::I)*MMQ_MMA_TILE_X_K_Q8_0 + k0, MMQ_MMA_TILE_X_K_Q8_0); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2931:54: note: in instantiation of function template specialization 'vec_dot_q8_0_q8_1_mma<8, 128, MMQ_Q8_1_DS_LAYOUT_D4>' requested here 2931 | static constexpr vec_dot_mmq_t vec_dot_mma = vec_dot_q8_0_q8_1_mma; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3300:9: note: in instantiation of function template specialization 'mul_mat_q_process_tile' requested here 3300 | mul_mat_q_process_tile | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3571:13: note: in instantiation of function template specialization 'mul_mat_q' requested here 3571 | mul_mat_q<<>> | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3660:13: note: in instantiation of function template specialization 'launch_mul_mat_q' requested here 3660 | launch_mul_mat_q(ctx, args, stream); | ^ 1 warning generated when compiling for gfx1012. 1 warning generated when compiling for gfx1150. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f32-instance-hs128-q8_0-q8_0.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f32.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f32-instance-hs128-q4_0-q4_0.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f32.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 28 warnings generated when compiling for gfx1030. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-q8_0-q8_0.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-q8_0-q8_0.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 1, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:402:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 1, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 402 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 1, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:405:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 1, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 405 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 2, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:414:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 2, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 414 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 2, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:417:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 2, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 417 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 4, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:426:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 4, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 426 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 4, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:429:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 4, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 429 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 8, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:437:9: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 8, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 437 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 8, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:440:9: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 8, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 440 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ 1 warning generated when compiling for gfx1030. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f32-instance-hs128-q8_0-q8_0.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f32.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1151. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f32-instance-hs128-q4_0-q4_0.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f32.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 28 warnings generated when compiling for gfx1031. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-q8_0-q8_0.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-q8_0-q8_0.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 1, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:402:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 1, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 402 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 1, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:405:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 1, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 405 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 2, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:414:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 2, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 414 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 2, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:417:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 2, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 417 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 4, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:426:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 4, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 426 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 4, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:429:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 4, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 429 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 8, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:437:9: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 8, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 437 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 8, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:440:9: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 8, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 440 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ 11 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q8_0.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q8_0.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:521:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 521 | acc[0] = __builtin_amdgcn_mfma_i32_16x16x32_i8(((int64_t *) A.x)[0], | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:522:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 522 | ((int64_t *) B.x)[0], | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:549:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 549 | acc[0] = __builtin_amdgcn_mfma_i32_32x32x16_i8(((int64_t *) A.x)[0], | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:550:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 550 | ((int64_t *) B.x)[0], | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q8_0.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:1796:31: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 1796 | for (int l = 0; l < sizeof(int); ++l) { | ~ ^ ~~~~~~~~~~~ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q8_0.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:230:46: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 230 | const int64_t * xs = (int64_t *) ((const int *) xs0 + (threadIdx.x % t.I) * stride + 2 * (threadIdx.x / t.I)); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:824:13: note: in instantiation of function template specialization 'ggml_cuda_mma::load_generic<16, 8, int>' requested here 824 | load_generic(A[n], x_qs + (i0 + n*tile_A::I)*MMQ_MMA_TILE_X_K_Q8_0 + k0, MMQ_MMA_TILE_X_K_Q8_0); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2931:54: note: in instantiation of function template specialization 'vec_dot_q8_0_q8_1_mma<8, 128, MMQ_Q8_1_DS_LAYOUT_D4>' requested here 2931 | static constexpr vec_dot_mmq_t vec_dot_mma = vec_dot_q8_0_q8_1_mma; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3300:9: note: in instantiation of function template specialization 'mul_mat_q_process_tile' requested here 3300 | mul_mat_q_process_tile | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3571:13: note: in instantiation of function template specialization 'mul_mat_q' requested here 3571 | mul_mat_q<<>> | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3660:13: note: in instantiation of function template specialization 'launch_mul_mat_q' requested here 3660 | launch_mul_mat_q(ctx, args, stream); | ^ 1 warning generated when compiling for gfx1031. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f32-instance-hs128-q8_0-q8_0.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f32.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1152. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f32-instance-hs128-q4_0-q4_0.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f32.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 28 warnings generated when compiling for gfx1035. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-q8_0-q8_0.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-q8_0-q8_0.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 1, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:402:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 1, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 402 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 1, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:405:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 1, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 405 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 2, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:414:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 2, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 414 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 2, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:417:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 2, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 417 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 4, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:426:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 4, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 426 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 4, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:429:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 4, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 429 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 8, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:437:9: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 8, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 437 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 8, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:440:9: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 8, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 440 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ 1 warning generated when compiling for gfx1035. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f32-instance-hs128-q8_0-q8_0.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f32.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1153. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f32-instance-hs128-q4_0-q4_0.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f32.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 28 warnings generated when compiling for gfx1036. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-q8_0-q8_0.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-q8_0-q8_0.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 1, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:402:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 1, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 402 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 1, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:405:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 1, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 405 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 2, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:414:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 2, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 414 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 2, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:417:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 2, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 417 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 4, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:426:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 4, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 426 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 4, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:429:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 4, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 429 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 8, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:437:9: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 8, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 437 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 8, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:440:9: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 8, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 440 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ 1 warning generated when compiling for gfx1036. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f32-instance-hs128-q8_0-q8_0.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f32.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 15 warnings generated when compiling for gfx942. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q8_0.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q8_0.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q8_0.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 28 warnings generated when compiling for gfx1100. 1 warning generated when compiling for gfx1200. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-q8_0-q8_0.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f32-instance-hs128-q4_0-q4_0.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f32.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-q8_0-q8_0.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 1, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:402:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 1, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 402 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 1, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:405:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 1, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 405 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 2, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:414:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 2, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 414 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 2, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:417:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 2, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 417 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 4, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:426:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 4, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 426 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 4, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:429:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 4, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 429 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 8, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:437:9: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 8, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 437 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 8, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:440:9: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 8, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 440 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ 1 warning generated when compiling for gfx1100. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f32-instance-hs128-q8_0-q8_0.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f32.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 28 warnings generated when compiling for gfx1101. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-q8_0-q8_0.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-q8_0-q8_0.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 1, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:402:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 1, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 402 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 1, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:405:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 1, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 405 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 2, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:414:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 2, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 414 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 2, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:417:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 2, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 417 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 4, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:426:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 4, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 426 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 4, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:429:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 4, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 429 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 8, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:437:9: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 8, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 437 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 8, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:440:9: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 8, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 440 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ 1 warning generated when compiling for gfx1201. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f32-instance-hs128-q4_0-q4_0.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f32.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1101. 28 warnings generated when compiling for gfx1102. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f32-instance-hs128-q8_0-q8_0.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f32.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-q8_0-q8_0.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-q8_0-q8_0.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 1, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:402:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 1, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 402 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 1, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:405:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 1, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 405 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 2, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:414:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 2, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 414 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 2, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:417:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 2, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 417 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 4, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:426:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 4, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 426 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 4, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:429:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 4, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 429 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 8, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:437:9: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 8, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 437 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 8, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:440:9: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 8, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 440 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ 1 warning generated when compiling for gfx900. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f32-instance-hs128-q4_0-q4_0.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f32.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 12 warnings generated when compiling for gfx950. 28 warnings generated when compiling for gfx1103. 1 warning generated when compiling for gfx1102. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q8_0.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q8_0.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q8_0.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-q8_0-q8_0.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-q8_0-q8_0.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 1, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:402:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 1, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 402 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 1, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:405:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 1, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 405 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f32-instance-hs128-q8_0-q8_0.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f32.cuh:1/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 2, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:414:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 2, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 414 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ : /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 2, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:417:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 2, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 417 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 4, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:426:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 4, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 426 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 4, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:429:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 4, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 429 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 8, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:437:9: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 8, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 437 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 8, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:440:9: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 8, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 440 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ 12 warnings generated when compiling for host. [ 59%] Building CXX object ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-f16-f16.cu.o cd /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build/ggml/src/ggml-hip && /usr/bin/hipcc -DGGML_BACKEND_BUILD -DGGML_BACKEND_SHARED -DGGML_HIP_NO_VMM -DGGML_HIP_ROCWMMA_FATTN_GFX12 -DGGML_SCHED_MAX_COPIES=4 -DGGML_SHARED -DGGML_USE_HIP -DUSE_PROF_API=1 -D_GNU_SOURCE -D_XOPEN_SOURCE=600 -D__HIP_PLATFORM_AMD__=1 -Dggml_hip_EXPORTS -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/.. -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -DNDEBUG -std=gnu++17 -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -x hip --offload-arch=gfx900 --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx942 --offload-arch=gfx950 --offload-arch=gfx1010 --offload-arch=gfx1012 --offload-arch=gfx1030 --offload-arch=gfx1031 --offload-arch=gfx1035 --offload-arch=gfx1036 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1103 --offload-arch=gfx1150 --offload-arch=gfx1151 --offload-arch=gfx1152 --offload-arch=gfx1153 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-f16-f16.cu.o -MF CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-f16-f16.cu.o.d -o CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-f16-f16.cu.o -c /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-f16-f16.cu 1 warning generated when compiling for gfx906. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-f16-f16.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f32-instance-hs128-q4_0-q4_0.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f32.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 28 warnings generated when compiling for gfx1150. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-q8_0-q8_0.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-q8_0-q8_0.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 1, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:402:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 1, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 402 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 1, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:405:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 1, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 405 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 2, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:414:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 2, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 414 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 2, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:417:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 2, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 417 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 4, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:426:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 4, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 426 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 4, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:429:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 4, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 429 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 8, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:437:9: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 8, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 437 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 8, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:440:9: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 8, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 440 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ 8 warnings generated when compiling for gfx1010. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f32-instance-hs128-q8_0-q8_0.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f32.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-f16-f16.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ 1 warning generated when compiling for gfx908. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f32-instance-hs128-q4_0-q4_0.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f32.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 28 warnings generated when compiling for gfx1151. 8 warnings generated when compiling for gfx1012. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-q8_0-q8_0.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-q8_0-q8_0.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 1, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:402:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 1, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 402 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 1, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:405:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 1, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 405 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 2, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:414:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 2, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 414 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 2, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:417:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 2, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 417 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 4, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:426:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 4, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 426 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 4, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:429:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 4, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 429 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 8, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:437:9: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 8, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 437 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 8, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:440:9: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 8, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 440 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-f16-f16.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ 1 warning generated when compiling for gfx1150. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f32-instance-hs128-q8_0-q8_0.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f32.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f32-instance-hs128-q4_0-q4_0.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f32.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 28 warnings generated when compiling for gfx1152. 8 warnings generated when compiling for gfx1030. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-q8_0-q8_0.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-q8_0-q8_0.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 1, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:402:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 1, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 402 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 1, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:405:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 1, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 405 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 2, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:414:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 2, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 414 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 2, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:417:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 2, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 417 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 4, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:426:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 4, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 426 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 4, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:429:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 4, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 429 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 8, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:437:9: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 8, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 437 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 8, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:440:9: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 8, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 440 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-f16-f16.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ 1 warning generated when compiling for gfx1151. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f32-instance-hs128-q8_0-q8_0.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f32.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 8 warnings generated when compiling for gfx1031. 28 warnings generated when compiling for gfx1153. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-f16-f16.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-q8_0-q8_0.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-q8_0-q8_0.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 1, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:402:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 1, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 402 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 1, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:405:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 1, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 405 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 2, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:414:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 2, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 414 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 2, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:417:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 2, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 417 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 4, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:426:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 4, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 426 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 4, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:429:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 4, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 429 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 8, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:437:9: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 8, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 437 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 8, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:440:9: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 8, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 440 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ 1 warning generated when compiling for gfx1152. 1 warning generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f32-instance-hs128-q8_0-q8_0.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f32.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f32-instance-hs128-q4_0-q4_0.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f32.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 8 warnings generated when compiling for gfx1035. 28 warnings generated when compiling for gfx1200. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-f16-f16.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-q8_0-q8_0.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-q8_0-q8_0.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 1, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:402:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 1, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 402 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 1, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:405:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 1, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 405 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 2, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:414:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 2, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 414 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 2, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:417:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 2, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 417 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 4, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:426:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 4, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 426 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 4, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:429:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 4, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 429 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 8, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:437:9: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 8, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 437 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 8, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:440:9: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 8, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 440 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ 1 warning generated when compiling for gfx1153. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f32-instance-hs128-q8_0-q8_0.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f32.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx942. 8 warnings generated when compiling for gfx1036. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f32-instance-hs128-q4_0-q4_0.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f32.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-f16-f16.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ 28 warnings generated when compiling for gfx1201. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-q8_0-q8_0.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-q8_0-q8_0.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 1, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:402:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 1, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 402 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 1, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:405:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 1, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 405 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 2, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:414:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 2, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 414 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 2, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:417:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 2, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 417 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 4, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:426:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 4, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 426 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 4, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:429:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 4, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 429 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 8, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:437:9: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 8, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 437 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 8, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:440:9: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 8, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 440 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ 1 warning generated when compiling for gfx1200. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f32-instance-hs128-q8_0-q8_0.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f32.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 8 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-f16-f16.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ 28 warnings generated when compiling for gfx900. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-q8_0-q8_0.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-q8_0-q8_0.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 1, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:402:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 1, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 402 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ 1 warning generated when compiling for gfx950. /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 1, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:405:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 1, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 405 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 2, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:414:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 2, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 414 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 2, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:417:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 2, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 417 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 4, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:426:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 4, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 426 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 4, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:429:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 4, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 429 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 8, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:437:9: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 8, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 437 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 8, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:440:9: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 8, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 440 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f32-instance-hs128-q4_0-q4_0.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f32.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for host. [ 59%] Building CXX object ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-vec-f16-instance-hs256-f16-f16.cu.o cd /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build/ggml/src/ggml-hip && /usr/bin/hipcc -DGGML_BACKEND_BUILD -DGGML_BACKEND_SHARED -DGGML_HIP_NO_VMM -DGGML_HIP_ROCWMMA_FATTN_GFX12 -DGGML_SCHED_MAX_COPIES=4 -DGGML_SHARED -DGGML_USE_HIP -DUSE_PROF_API=1 -D_GNU_SOURCE -D_XOPEN_SOURCE=600 -D__HIP_PLATFORM_AMD__=1 -Dggml_hip_EXPORTS -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/.. -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -DNDEBUG -std=gnu++17 -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -x hip --offload-arch=gfx900 --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx942 --offload-arch=gfx950 --offload-arch=gfx1010 --offload-arch=gfx1012 --offload-arch=gfx1030 --offload-arch=gfx1031 --offload-arch=gfx1035 --offload-arch=gfx1036 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1103 --offload-arch=gfx1150 --offload-arch=gfx1151 --offload-arch=gfx1152 --offload-arch=gfx1153 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-vec-f16-instance-hs256-f16-f16.cu.o -MF CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-vec-f16-instance-hs256-f16-f16.cu.o.d -o CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-vec-f16-instance-hs256-f16-f16.cu.o -c /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs256-f16-f16.cu In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs256-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs256-f16-f16.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ 1 warning generated when compiling for gfx1201. 8 warnings generated when compiling for gfx1101. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f32-instance-hs128-q8_0-q8_0.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f32.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-f16-f16.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ 28 warnings generated when compiling for gfx906. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-q8_0-q8_0.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-q8_0-q8_0.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 1, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:402:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 1, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 402 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 1, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:405:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 1, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 405 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 2, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:414:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 2, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 414 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 2, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:417:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 2, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 417 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 4, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:426:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 4, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 426 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 4, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:429:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 4, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 429 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 8, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:437:9: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 8, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 437 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 8, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:440:9: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 8, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 440 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ 1 warning generated when compiling for gfx900. 8 warnings generated when compiling for gfx1010. 8 warnings generated when compiling for gfx1102. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f32-instance-hs128-q8_0-q8_0.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f32.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs256-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs256-f16-f16.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-f16-f16.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ 28 warnings generated when compiling for gfx908. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-q8_0-q8_0.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-q8_0-q8_0.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 1, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:402:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 1, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 402 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 1, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:405:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 1, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 405 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 2, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:414:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 2, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 414 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 2, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:417:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 2, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 417 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 4, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:426:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 4, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 426 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 4, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:429:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 4, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 429 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 8, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:437:9: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 8, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 437 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 8, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:440:9: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 8, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 440 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ 8 warnings generated when compiling for gfx1103. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx906. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-f16-f16.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f32-instance-hs128-q8_0-q8_0.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f32.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 8 warnings generated when compiling for gfx1012. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs256-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs256-f16-f16.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ 28 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-q8_0-q8_0.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 8 warnings generated when compiling for gfx1150. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-q8_0-q8_0.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 1, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:402:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 1, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 402 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 1, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:405:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 1, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 405 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 2, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:414:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 2, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 414 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 2, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:417:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 2, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 417 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 4, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:426:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 4, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 426 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 4, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:429:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 4, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 429 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 8, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:437:9: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 8, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 437 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 8, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:440:9: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 8, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 440 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-f16-f16.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ 1 warning generated when compiling for gfx908. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f32-instance-hs128-q8_0-q8_0.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f32.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 8 warnings generated when compiling for gfx1030. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs256-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs256-f16-f16.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ 8 warnings generated when compiling for gfx1151. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-f16-f16.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ 28 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-q8_0-q8_0.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-q8_0-q8_0.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 1, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:402:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 1, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 402 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 1, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:405:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 1, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 405 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 2, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:414:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 2, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 414 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 2, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:417:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 2, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 417 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 4, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:426:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 4, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 426 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 4, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:429:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 4, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 429 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 8, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:437:9: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 8, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 437 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 8, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:440:9: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 8, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 440 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ 8 warnings generated when compiling for gfx1031. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs256-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs256-f16-f16.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ 1 warning generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f32-instance-hs128-q8_0-q8_0.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f32.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 8 warnings generated when compiling for gfx1152. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-f16-f16.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ 28 warnings generated when compiling for gfx942. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-q8_0-q8_0.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-q8_0-q8_0.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 1, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:402:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 1, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 402 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 1, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:405:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 1, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 405 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 2, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:414:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 2, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 414 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 2, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:417:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 2, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 417 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 4, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:426:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 4, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 426 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 4, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:429:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 4, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 429 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 8, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:437:9: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 8, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 437 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 8, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:440:9: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 8, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 440 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ 8 warnings generated when compiling for gfx1035. 8 warnings generated when compiling for gfx1153. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs256-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs256-f16-f16.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-f16-f16.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ 28 warnings generated when compiling for gfx950. 1 warning generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-q8_0-q8_0.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-q8_0-q8_0.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 1, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:402:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 1, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 402 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 1, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:405:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 1, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 405 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 2, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:414:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 2, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 414 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 2, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:417:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 2, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 417 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 4, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:426:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 4, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 426 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 4, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:429:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 4, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 429 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 8, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:437:9: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 8, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 437 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 8, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:440:9: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 8, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 440 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f32-instance-hs128-q8_0-q8_0.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f32.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 28 warnings generated when compiling for host. [ 60%] Building CXX object ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-vec-f16-instance-hs64-f16-f16.cu.o cd /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build/ggml/src/ggml-hip && /usr/bin/hipcc -DGGML_BACKEND_BUILD -DGGML_BACKEND_SHARED -DGGML_HIP_NO_VMM -DGGML_HIP_ROCWMMA_FATTN_GFX12 -DGGML_SCHED_MAX_COPIES=4 -DGGML_SHARED -DGGML_USE_HIP -DUSE_PROF_API=1 -D_GNU_SOURCE -D_XOPEN_SOURCE=600 -D__HIP_PLATFORM_AMD__=1 -Dggml_hip_EXPORTS -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/.. -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -DNDEBUG -std=gnu++17 -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -x hip --offload-arch=gfx900 --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx942 --offload-arch=gfx950 --offload-arch=gfx1010 --offload-arch=gfx1012 --offload-arch=gfx1030 --offload-arch=gfx1031 --offload-arch=gfx1035 --offload-arch=gfx1036 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1103 --offload-arch=gfx1150 --offload-arch=gfx1151 --offload-arch=gfx1152 --offload-arch=gfx1153 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-vec-f16-instance-hs64-f16-f16.cu.o -MF CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-vec-f16-instance-hs64-f16-f16.cu.o.d -o CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-vec-f16-instance-hs64-f16-f16.cu.o -c /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs64-f16-f16.cu In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs64-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs64-f16-f16.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ 8 warnings generated when compiling for gfx1200. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-f16-f16.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ 8 warnings generated when compiling for gfx1010. 8 warnings generated when compiling for gfx1036. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs64-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs64-f16-f16.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs256-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs256-f16-f16.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ 8 warnings generated when compiling for gfx1012. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs64-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs64-f16-f16.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ 1 warning generated when compiling for gfx942. 8 warnings generated when compiling for gfx1201. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f32-instance-hs128-q8_0-q8_0.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f32.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-f16-f16.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ 8 warnings generated when compiling for gfx1030. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs64-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs64-f16-f16.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ 8 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs256-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs256-f16-f16.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ 8 warnings generated when compiling for gfx1031. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs64-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs64-f16-f16.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ 8 warnings generated when compiling for gfx900. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-f16-f16.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ 1 warning generated when compiling for gfx950. 8 warnings generated when compiling for gfx1035. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f32-instance-hs128-q8_0-q8_0.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f32.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs64-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs64-f16-f16.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ 1 warning generated when compiling for host. [ 60%] Building CXX object ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-vec-f32-instance-hs128-f16-f16.cu.o cd /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build/ggml/src/ggml-hip && /usr/bin/hipcc -DGGML_BACKEND_BUILD -DGGML_BACKEND_SHARED -DGGML_HIP_NO_VMM -DGGML_HIP_ROCWMMA_FATTN_GFX12 -DGGML_SCHED_MAX_COPIES=4 -DGGML_SHARED -DGGML_USE_HIP -DUSE_PROF_API=1 -D_GNU_SOURCE -D_XOPEN_SOURCE=600 -D__HIP_PLATFORM_AMD__=1 -Dggml_hip_EXPORTS -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/.. -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -DNDEBUG -std=gnu++17 -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -x hip --offload-arch=gfx900 --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx942 --offload-arch=gfx950 --offload-arch=gfx1010 --offload-arch=gfx1012 --offload-arch=gfx1030 --offload-arch=gfx1031 --offload-arch=gfx1035 --offload-arch=gfx1036 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1103 --offload-arch=gfx1150 --offload-arch=gfx1151 --offload-arch=gfx1152 --offload-arch=gfx1153 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-vec-f32-instance-hs128-f16-f16.cu.o -MF CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-vec-f32-instance-hs128-f16-f16.cu.o.d -o CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-vec-f32-instance-hs128-f16-f16.cu.o -c /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f32-instance-hs128-f16-f16.cu In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f32-instance-hs128-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f32.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 8 warnings generated when compiling for gfx1101. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs256-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs256-f16-f16.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ 8 warnings generated when compiling for gfx1036. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs64-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs64-f16-f16.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ 8 warnings generated when compiling for gfx906. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-f16-f16.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ 8 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs64-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs64-f16-f16.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ 1 warning generated when compiling for gfx1010. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f32-instance-hs128-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f32.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 8 warnings generated when compiling for gfx1102. 8 warnings generated when compiling for gfx1101. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs256-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs256-f16-f16.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs64-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs64-f16-f16.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ 8 warnings generated when compiling for gfx908. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-f16-f16.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ 8 warnings generated when compiling for gfx1102. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs64-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs64-f16-f16.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ 1 warning generated when compiling for gfx1012. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f32-instance-hs128-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f32.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 8 warnings generated when compiling for gfx1103. 8 warnings generated when compiling for gfx1103. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs64-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs64-f16-f16.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs256-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs256-f16-f16.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ 8 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-f16-f16.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ 8 warnings generated when compiling for gfx1150. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs64-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs64-f16-f16.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ 1 warning generated when compiling for gfx1030. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f32-instance-hs128-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f32.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 8 warnings generated when compiling for gfx1151. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs64-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs64-f16-f16.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ 8 warnings generated when compiling for gfx1150. 8 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs256-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs256-f16-f16.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-f16-f16.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ 8 warnings generated when compiling for gfx1152. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs64-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs64-f16-f16.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ 1 warning generated when compiling for gfx1031. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f32-instance-hs128-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f32.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 8 warnings generated when compiling for gfx1153. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs64-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs64-f16-f16.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ 8 warnings generated when compiling for gfx942. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-f16-f16.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ 8 warnings generated when compiling for gfx1151. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs256-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs256-f16-f16.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ 8 warnings generated when compiling for gfx1200. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs64-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs64-f16-f16.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ 1 warning generated when compiling for gfx1035. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f32-instance-hs128-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f32.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 8 warnings generated when compiling for gfx1201. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs64-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs64-f16-f16.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ 8 warnings generated when compiling for gfx950. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-f16-f16.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ 8 warnings generated when compiling for host. [ 61%] Building CXX object ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-vec-f32-instance-hs256-f16-f16.cu.o cd /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build/ggml/src/ggml-hip && /usr/bin/hipcc -DGGML_BACKEND_BUILD -DGGML_BACKEND_SHARED -DGGML_HIP_NO_VMM -DGGML_HIP_ROCWMMA_FATTN_GFX12 -DGGML_SCHED_MAX_COPIES=4 -DGGML_SHARED -DGGML_USE_HIP -DUSE_PROF_API=1 -D_GNU_SOURCE -D_XOPEN_SOURCE=600 -D__HIP_PLATFORM_AMD__=1 -Dggml_hip_EXPORTS -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/.. -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -DNDEBUG -std=gnu++17 -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -x hip --offload-arch=gfx900 --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx942 --offload-arch=gfx950 --offload-arch=gfx1010 --offload-arch=gfx1012 --offload-arch=gfx1030 --offload-arch=gfx1031 --offload-arch=gfx1035 --offload-arch=gfx1036 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1103 --offload-arch=gfx1150 --offload-arch=gfx1151 --offload-arch=gfx1152 --offload-arch=gfx1153 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-vec-f32-instance-hs256-f16-f16.cu.o -MF CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-vec-f32-instance-hs256-f16-f16.cu.o.d -o CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-vec-f32-instance-hs256-f16-f16.cu.o -c /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f32-instance-hs256-f16-f16.cu 8 warnings generated when compiling for gfx900. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f32-instance-hs256-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f32.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 8 warnings generated when compiling for gfx1152. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs64-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs64-f16-f16.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs256-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs256-f16-f16.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ 1 warning generated when compiling for gfx1036. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f32-instance-hs128-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f32.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 8 warnings generated when compiling for gfx906. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs64-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs64-f16-f16.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ 8 warnings generated when compiling for gfx908. 1 warning generated when compiling for gfx1010. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs64-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs64-f16-f16.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ 8 warnings generated when compiling for gfx1153. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f32-instance-hs256-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f32.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs256-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs256-f16-f16.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ 1 warning generated when compiling for gfx1100. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f32-instance-hs128-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f32.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 8 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs64-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs64-f16-f16.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ 8 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs64-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs64-f16-f16.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ 8 warnings generated when compiling for gfx1200. 1 warning generated when compiling for gfx1012. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs256-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs256-f16-f16.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f32-instance-hs256-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f32.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1101. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f32-instance-hs128-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f32.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 8 warnings generated when compiling for gfx942. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs64-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs64-f16-f16.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ 8 warnings generated when compiling for gfx950. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs64-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs64-f16-f16.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ 8 warnings generated when compiling for host. [ 61%] Building CXX object ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-vec-f32-instance-hs64-f16-f16.cu.o cd /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build/ggml/src/ggml-hip && /usr/bin/hipcc -DGGML_BACKEND_BUILD -DGGML_BACKEND_SHARED -DGGML_HIP_NO_VMM -DGGML_HIP_ROCWMMA_FATTN_GFX12 -DGGML_SCHED_MAX_COPIES=4 -DGGML_SHARED -DGGML_USE_HIP -DUSE_PROF_API=1 -D_GNU_SOURCE -D_XOPEN_SOURCE=600 -D__HIP_PLATFORM_AMD__=1 -Dggml_hip_EXPORTS -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-hip/.. -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -DNDEBUG -std=gnu++17 -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -x hip --offload-arch=gfx900 --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx942 --offload-arch=gfx950 --offload-arch=gfx1010 --offload-arch=gfx1012 --offload-arch=gfx1030 --offload-arch=gfx1031 --offload-arch=gfx1035 --offload-arch=gfx1036 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1103 --offload-arch=gfx1150 --offload-arch=gfx1151 --offload-arch=gfx1152 --offload-arch=gfx1153 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-vec-f32-instance-hs64-f16-f16.cu.o -MF CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-vec-f32-instance-hs64-f16-f16.cu.o.d -o CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-vec-f32-instance-hs64-f16-f16.cu.o -c /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f32-instance-hs64-f16-f16.cu In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f32-instance-hs64-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f32.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 8 warnings generated when compiling for gfx1201. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs256-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs256-f16-f16.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ 1 warning generated when compiling for gfx1030. 1 warning generated when compiling for gfx1102. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f32-instance-hs256-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f32.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f32-instance-hs128-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f32.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1010. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f32-instance-hs64-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f32.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1012. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f32-instance-hs64-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f32.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1103. 8 warnings generated when compiling for gfx900. 1 warning generated when compiling for gfx1030. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f32-instance-hs128-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f32.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f32-instance-hs64-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f32.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs256-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs256-f16-f16.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ 1 warning generated when compiling for gfx1031. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f32-instance-hs256-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f32.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1031. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f32-instance-hs64-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f32.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1035. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f32-instance-hs64-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f32.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1150. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f32-instance-hs128-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f32.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 8 warnings generated when compiling for gfx906. 1 warning generated when compiling for gfx1036. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs256-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1035. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs256-f16-f16.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f32-instance-hs64-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f32.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f32-instance-hs256-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f32.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1100. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f32-instance-hs64-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f32.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1151. 1 warning generated when compiling for gfx1101. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f32-instance-hs128-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f32.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f32-instance-hs64-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f32.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 8 warnings generated when compiling for gfx908. 1 warning generated when compiling for gfx1036. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs256-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs256-f16-f16.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ 1 warning generated when compiling for gfx1102. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f32-instance-hs256-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f32.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f32-instance-hs64-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f32.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1103. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f32-instance-hs64-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f32.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1152. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f32-instance-hs128-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f32.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1150. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f32-instance-hs64-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f32.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1100. 8 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f32-instance-hs256-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f32.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs256-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs256-f16-f16.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ 1 warning generated when compiling for gfx1151. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f32-instance-hs64-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f32.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1153. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f32-instance-hs128-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f32.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1152. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f32-instance-hs64-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f32.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1153. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f32-instance-hs64-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f32.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1101. 8 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f32-instance-hs256-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f32.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs256-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs256-f16-f16.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ 1 warning generated when compiling for gfx1200. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f32-instance-hs64-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f32.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1200. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f32-instance-hs128-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f32.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1201. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f32-instance-hs64-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f32.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx900. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f32-instance-hs64-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f32.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1102. 8 warnings generated when compiling for gfx942. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f32-instance-hs256-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f32.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs256-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs256-f16-f16.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ 1 warning generated when compiling for gfx1201. 1 warning generated when compiling for gfx906. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f32-instance-hs128-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f32.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f32-instance-hs64-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f32.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx908. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f32-instance-hs64-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f32.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1103. 1 warning generated when compiling for gfx90a. 8 warnings generated when compiling for gfx950. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f32-instance-hs256-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f32.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx900. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f32-instance-hs64-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f32.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs256-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs256-f16-f16.cu:3: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f32-instance-hs128-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f32.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 8 warnings generated when compiling for host. 1 warning generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f32-instance-hs64-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f32.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx906. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f32-instance-hs128-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f32.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx942. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f32-instance-hs64-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f32.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1150. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f32-instance-hs256-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f32.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx950. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f32-instance-hs64-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f32.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for host. 1 warning generated when compiling for gfx908. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f32-instance-hs128-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f32.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1151. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f32-instance-hs256-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f32.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f32-instance-hs128-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f32.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1152. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f32-instance-hs256-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f32.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f32-instance-hs128-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f32.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1153. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f32-instance-hs256-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f32.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx942. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f32-instance-hs128-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f32.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1200. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f32-instance-hs256-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f32.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx950. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f32-instance-hs128-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f32.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for host. 1 warning generated when compiling for gfx1201. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f32-instance-hs256-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f32.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx900. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f32-instance-hs256-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f32.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx906. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f32-instance-hs256-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f32.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx908. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f32-instance-hs256-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f32.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f32-instance-hs256-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f32.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f32-instance-hs256-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f32.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx942. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f32-instance-hs256-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f32.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx950. In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f32-instance-hs256-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f32.cuh:1: /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for host. [ 62%] Linking CXX shared library ../../../bin/libggml-hip.so cd /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build/ggml/src/ggml-hip && /usr/bin/cmake -E cmake_link_script CMakeFiles/ggml-hip.dir/link.txt --verbose=1 /usr/bin/hipcc -fPIC -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -DNDEBUG -Xlinker --dependency-file=CMakeFiles/ggml-hip.dir/link.d -Wl,-z,relro -Wl,--as-needed -Wl,-z,pack-relative-relocs -Wl,-z,now -Wl,-z,now -Wl,--build-id=sha1 -specs=/usr/lib/rpm/redhat/redhat-package-notes -shared -Wl,-soname,libggml-hip.so.b6153 -o ../../../bin/libggml-hip.so.b6153 "CMakeFiles/ggml-hip.dir/__/ggml-cuda/acc.cu.o" "CMakeFiles/ggml-hip.dir/__/ggml-cuda/add-id.cu.o" "CMakeFiles/ggml-hip.dir/__/ggml-cuda/arange.cu.o" "CMakeFiles/ggml-hip.dir/__/ggml-cuda/argmax.cu.o" "CMakeFiles/ggml-hip.dir/__/ggml-cuda/argsort.cu.o" "CMakeFiles/ggml-hip.dir/__/ggml-cuda/binbcast.cu.o" "CMakeFiles/ggml-hip.dir/__/ggml-cuda/clamp.cu.o" "CMakeFiles/ggml-hip.dir/__/ggml-cuda/concat.cu.o" "CMakeFiles/ggml-hip.dir/__/ggml-cuda/conv-transpose-1d.cu.o" "CMakeFiles/ggml-hip.dir/__/ggml-cuda/conv2d-dw.cu.o" "CMakeFiles/ggml-hip.dir/__/ggml-cuda/conv2d-transpose.cu.o" "CMakeFiles/ggml-hip.dir/__/ggml-cuda/convert.cu.o" "CMakeFiles/ggml-hip.dir/__/ggml-cuda/count-equal.cu.o" "CMakeFiles/ggml-hip.dir/__/ggml-cuda/cpy.cu.o" "CMakeFiles/ggml-hip.dir/__/ggml-cuda/cross-entropy-loss.cu.o" "CMakeFiles/ggml-hip.dir/__/ggml-cuda/diagmask.cu.o" "CMakeFiles/ggml-hip.dir/__/ggml-cuda/fattn-tile-f16.cu.o" "CMakeFiles/ggml-hip.dir/__/ggml-cuda/fattn-tile-f32.cu.o" "CMakeFiles/ggml-hip.dir/__/ggml-cuda/fattn-wmma-f16.cu.o" "CMakeFiles/ggml-hip.dir/__/ggml-cuda/fattn.cu.o" "CMakeFiles/ggml-hip.dir/__/ggml-cuda/getrows.cu.o" "CMakeFiles/ggml-hip.dir/__/ggml-cuda/ggml-cuda.cu.o" "CMakeFiles/ggml-hip.dir/__/ggml-cuda/gla.cu.o" "CMakeFiles/ggml-hip.dir/__/ggml-cuda/im2col.cu.o" "CMakeFiles/ggml-hip.dir/__/ggml-cuda/mean.cu.o" "CMakeFiles/ggml-hip.dir/__/ggml-cuda/mmf.cu.o" "CMakeFiles/ggml-hip.dir/__/ggml-cuda/mmq.cu.o" "CMakeFiles/ggml-hip.dir/__/ggml-cuda/mmvf.cu.o" "CMakeFiles/ggml-hip.dir/__/ggml-cuda/mmvq.cu.o" "CMakeFiles/ggml-hip.dir/__/ggml-cuda/norm.cu.o" "CMakeFiles/ggml-hip.dir/__/ggml-cuda/opt-step-adamw.cu.o" "CMakeFiles/ggml-hip.dir/__/ggml-cuda/out-prod.cu.o" "CMakeFiles/ggml-hip.dir/__/ggml-cuda/pad.cu.o" "CMakeFiles/ggml-hip.dir/__/ggml-cuda/pool2d.cu.o" "CMakeFiles/ggml-hip.dir/__/ggml-cuda/quantize.cu.o" "CMakeFiles/ggml-hip.dir/__/ggml-cuda/roll.cu.o" "CMakeFiles/ggml-hip.dir/__/ggml-cuda/rope.cu.o" "CMakeFiles/ggml-hip.dir/__/ggml-cuda/scale.cu.o" "CMakeFiles/ggml-hip.dir/__/ggml-cuda/set-rows.cu.o" "CMakeFiles/ggml-hip.dir/__/ggml-cuda/softcap.cu.o" "CMakeFiles/ggml-hip.dir/__/ggml-cuda/softmax.cu.o" "CMakeFiles/ggml-hip.dir/__/ggml-cuda/ssm-conv.cu.o" "CMakeFiles/ggml-hip.dir/__/ggml-cuda/ssm-scan.cu.o" "CMakeFiles/ggml-hip.dir/__/ggml-cuda/sum.cu.o" "CMakeFiles/ggml-hip.dir/__/ggml-cuda/sumrows.cu.o" "CMakeFiles/ggml-hip.dir/__/ggml-cuda/tsembd.cu.o" "CMakeFiles/ggml-hip.dir/__/ggml-cuda/unary.cu.o" "CMakeFiles/ggml-hip.dir/__/ggml-cuda/upscale.cu.o" "CMakeFiles/ggml-hip.dir/__/ggml-cuda/wkv.cu.o" "CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_16.cu.o" "CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_8.cu.o" "CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_1.cu.o" "CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_2.cu.o" "CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_4.cu.o" "CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_16.cu.o" "CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_4.cu.o" "CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_8.cu.o" "CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_1.cu.o" "CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_2.cu.o" "CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_16.cu.o" "CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_2.cu.o" "CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_4.cu.o" "CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_8.cu.o" "CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_64-ncols2_1.cu.o" "CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_1.cu.o" "CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_2.cu.o" "CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_4.cu.o" "CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_8.cu.o" "CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/mmq-instance-iq1_s.cu.o" "CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/mmq-instance-iq2_s.cu.o" "CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/mmq-instance-iq2_xs.cu.o" "CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/mmq-instance-iq2_xxs.cu.o" "CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/mmq-instance-iq3_s.cu.o" "CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/mmq-instance-iq3_xxs.cu.o" "CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/mmq-instance-iq4_nl.cu.o" "CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/mmq-instance-iq4_xs.cu.o" "CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/mmq-instance-mxfp4.cu.o" "CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/mmq-instance-q2_k.cu.o" "CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/mmq-instance-q3_k.cu.o" "CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/mmq-instance-q4_0.cu.o" "CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/mmq-instance-q4_1.cu.o" "CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/mmq-instance-q4_k.cu.o" "CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/mmq-instance-q5_0.cu.o" "CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/mmq-instance-q5_1.cu.o" "CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/mmq-instance-q5_k.cu.o" "CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/mmq-instance-q6_k.cu.o" "CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/mmq-instance-q8_0.cu.o" "CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-q4_0-q4_0.cu.o" "CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-vec-f32-instance-hs128-q4_0-q4_0.cu.o" "CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-q8_0-q8_0.cu.o" "CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-vec-f32-instance-hs128-q8_0-q8_0.cu.o" "CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-f16-f16.cu.o" "CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-vec-f16-instance-hs256-f16-f16.cu.o" "CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-vec-f16-instance-hs64-f16-f16.cu.o" "CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-vec-f32-instance-hs128-f16-f16.cu.o" "CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-vec-f32-instance-hs256-f16-f16.cu.o" "CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-vec-f32-instance-hs64-f16-f16.cu.o" ../../../bin/libggml-base.so.b6153 /usr/lib64/libhipblas.so.3.1 --hip-link --offload-arch=gfx900 --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx942 --offload-arch=gfx950 --offload-arch=gfx1010 --offload-arch=gfx1012 --offload-arch=gfx1030 --offload-arch=gfx1031 --offclang++: warning: argument unused during compilation: '-Xarch_host -fstack-protector-strong' [-Wunused-command-line-argument] clang++: warning: argument unused during compilation: '-Xarch_host -fcf-protection' [-Wunused-command-line-argument] clang++: warning: argument unused during compilation: '-specs=/usr/lib/rpm/redhat/redhat-package-notes' [-Wunused-command-line-argument] load-arch=gfx1035 --offload-arch=gfx1036 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1103 --offload-arch=gfx1150 --offload-arch=gfx1151 --offload-arch=gfx1152 --offload-arch=gfx1153 --offload-arch=gfx1200 --offload-arch=gfx1201 /usr/lib64/librocblas.so.5.1 /usr/lib64/libamdhip64.so.7.1.52802 cd /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build/ggml/src/ggml-hip && /usr/bin/cmake -E cmake_symlink_library ../../../bin/libggml-hip.so.b6153 ../../../bin/libggml-hip.so.b6153 ../../../bin/libggml-hip.so gmake[2]: Leaving directory '/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build' [ 62%] Built target ggml-hip /usr/bin/gmake -f ggml/src/CMakeFiles/ggml.dir/build.make ggml/src/CMakeFiles/ggml.dir/depend gmake[2]: Entering directory '/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build' cd /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build && /usr/bin/cmake -E cmake_depends "Unix Makefiles" /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153 /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build/ggml/src /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build/ggml/src/CMakeFiles/ggml.dir/DependInfo.cmake "--color=" gmake[2]: Leaving directory '/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build' /usr/bin/gmake -f ggml/src/CMakeFiles/ggml.dir/build.make ggml/src/CMakeFiles/ggml.dir/build gmake[2]: Entering directory '/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build' [ 62%] Building CXX object ggml/src/CMakeFiles/ggml.dir/ggml-backend-reg.cpp.o cd /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build/ggml/src && /usr/bin/hipcc -DGGML_BACKEND_SHARED -DGGML_BUILD -DGGML_SCHED_MAX_COPIES=4 -DGGML_SHARED -DGGML_USE_CPU -DGGML_USE_CUDA -DGGML_USE_HIP -D_GNU_SOURCE -D_XOPEN_SOURCE=600 -Dggml_EXPORTS -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -DNDEBUG -std=gnu++17 -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -MD -MT ggml/src/CMakeFiles/ggml.dir/ggml-backend-reg.cpp.o -MF CMakeFiles/ggml.dir/ggml-backend-reg.cpp.o.d -o CMakeFiles/ggml.dir/ggml-backend-reg.cpp.o -c /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/ggml-backend-reg.cpp sh: line 1: /usr/bin/rocm_agent_enumerator: No such file or directory [ 63%] Linking CXX shared library ../../bin/libggml.so cd /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build/ggml/src && /usr/bin/cmake -E cmake_link_script CMakeFiles/ggml.dir/link.txt --verbose=1 sh: line 1: /usr/bin/rocm_agent_enumerator: No such file or directory clang++: warning: argument unused during compilation: '-Xarch_host -fstack-protector-strong' [-Wunused-command-line-argument] clang++: warning: argument unused during compilation: '-Xarch_host -fcf-protection' [-Wunused-command-line-argument] clang++: warning: argument unused during compilation: '-specs=/usr/lib/rpm/redhat/redhat-package-notes' [-Wunused-command-line-argument] /usr/bin/hipcc -fPIC -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -DNDEBUG -Xlinker --dependency-file=CMakeFiles/ggml.dir/link.d -Wl,-z,relro -Wl,--as-needed -Wl,-z,pack-relative-relocs -Wl,-z,now -Wl,-z,now -Wl,--build-id=sha1 -specs=/usr/lib/rpm/redhat/redhat-package-notes -shared -Wl,-soname,libggml.so.b6153 -o ../../bin/libggml.so.b6153 "CMakeFiles/ggml.dir/ggml-backend-reg.cpp.o" -ldl ../../bin/libggml-cpu.so.b6153 ../../bin/libggml-hip.so.b6153 ../../bin/libggml-base.so.b6153 cd /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build/ggml/src && /usr/bin/cmake -E cmake_symlink_library ../../bin/libggml.so.b6153 ../../bin/libggml.so.b6153 ../../bin/libggml.so gmake[2]: Leaving directory '/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build' [ 63%] Built target ggml /usr/bin/gmake -f src/CMakeFiles/llama.dir/build.make src/CMakeFiles/llama.dir/depend gmake[2]: Entering directory '/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build' cd /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build && /usr/bin/cmake -E cmake_depends "Unix Makefiles" /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153 /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/src /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build/src /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build/src/CMakeFiles/llama.dir/DependInfo.cmake "--color=" gmake[2]: Leaving directory '/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build' /usr/bin/gmake -f src/CMakeFiles/llama.dir/build.make src/CMakeFiles/llama.dir/build gmake[2]: Entering directory '/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build' [ 63%] Building CXX object src/CMakeFiles/llama.dir/llama.cpp.o [ 63%] Building CXX object src/CMakeFiles/llama.dir/llama-arch.cpp.o [ 65%] Building CXX object src/CMakeFiles/llama.dir/llama-batch.cpp.o [ 65%] Building CXX object src/CMakeFiles/llama.dir/llama-adapter.cpp.o cd /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build/src && /usr/bin/hipcc -DGGML_BACKEND_SHARED -DGGML_SHARED -DGGML_USE_CPU -DGGML_USE_CUDA -DGGML_USE_HIP -DLLAMA_BUILD -DLLAMA_SHARED -Dllama_EXPORTS -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/src/. -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/src/../include -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -DNDEBUG -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -MD -MT src/CMakeFiles/llama.dir/llama.cpp.o -MF CMakeFiles/llama.dir/llama.cpp.o.d -o CMakeFiles/llama.dir/llama.cpp.o -c /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/src/llama.cpp cd /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build/src && /usr/bin/hipcc -DGGML_BACKEND_SHARED -DGGML_SHARED -DGGML_USE_CPU -DGGML_USE_CUDA -DGGML_USE_HIP -DLLAMA_BUILD -DLLAMA_SHARED -Dllama_EXPORTS -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/src/. -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/src/../include -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -DNDEBUG -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -MD -MT src/CMakeFiles/llama.dir/llama-adapter.cpp.o -MF CMakeFiles/llama.dir/llama-adapter.cpp.o.d -o CMakeFiles/llama.dir/llama-adapter.cpp.o -c /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/src/llama-adapter.cpp cd /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build/src && /usr/bin/hipcc -DGGML_BACKEND_SHARED -DGGML_SHARED -DGGML_USE_CPU -DGGML_USE_CUDA -DGGML_USE_HIP -DLLAMA_BUILD -DLLAMA_SHARED -Dllama_EXPORTS -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/src/. -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/src/../include -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -DNDEBUG -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -MD -MT src/CMakeFiles/llama.dir/llama-arch.cpp.o -MF CMakeFiles/llama.dir/llama-arch.cpp.o.d -o CMakeFiles/llama.dir/llama-arch.cpp.o -c /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/src/llama-arch.cpp cd /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build/src && /usr/bin/hipcc -DGGML_BACKEND_SHARED -DGGML_SHARED -DGGML_USE_CPU -DGGML_USE_CUDA -DGGML_USE_HIP -DLLAMA_BUILD -DLLAMA_SHARED -Dllama_EXPORTS -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/src/. -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/src/../include -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -DNDEBUG -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -MD -MT src/CMakeFiles/llama.dir/llama-batch.cpp.o -MF CMakeFiles/llama.dir/llama-batch.cpp.o.d -o CMakeFiles/llama.dir/llama-batch.cpp.o -c /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/src/llama-batch.cpp sh: line 1: /usr/bin/rocm_agent_enumerator: No such file or directory sh: line 1: /usr/bin/rocm_agent_enumerator: No such file or directory sh: line 1: /usr/bin/rocm_agent_enumerator: No such file or directory sh: line 1: /usr/bin/rocm_agent_enumerator: No such file or directory [ 65%] Building CXX object src/CMakeFiles/llama.dir/llama-chat.cpp.o cd /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build/src && /usr/bin/hipcc -DGGML_BACKEND_SHARED -DGGML_SHARED -DGGML_USE_CPU -DGGML_USE_CUDA -DGGML_USE_HIP -DLLAMA_BUILD -DLLAMA_SHARED -Dllama_EXPORTS -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/src/. -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/src/../include -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -DNDEBUG -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -MD -MT src/CMakeFiles/llama.dir/llama-chat.cpp.o -MF CMakeFiles/llama.dir/llama-chat.cpp.o.d -o CMakeFiles/llama.dir/llama-chat.cpp.o -c /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/src/llama-chat.cpp sh: line 1: /usr/bin/rocm_agent_enumerator: No such file or directory [ 66%] Building CXX object src/CMakeFiles/llama.dir/llama-context.cpp.o cd /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build/src && /usr/bin/hipcc -DGGML_BACKEND_SHARED -DGGML_SHARED -DGGML_USE_CPU -DGGML_USE_CUDA -DGGML_USE_HIP -DLLAMA_BUILD -DLLAMA_SHARED -Dllama_EXPORTS -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/src/. -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/src/../include -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -DNDEBUG -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -MD -MT src/CMakeFiles/llama.dir/llama-context.cpp.o -MF CMakeFiles/llama.dir/llama-context.cpp.o.d -o CMakeFiles/llama.dir/llama-context.cpp.o -c /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/src/llama-context.cpp sh: line 1: /usr/bin/rocm_agent_enumerator: No such file or directory [ 66%] Building CXX object src/CMakeFiles/llama.dir/llama-cparams.cpp.o cd /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build/src && /usr/bin/hipcc -DGGML_BACKEND_SHARED -DGGML_SHARED -DGGML_USE_CPU -DGGML_USE_CUDA -DGGML_USE_HIP -DLLAMA_BUILD -DLLAMA_SHARED -Dllama_EXPORTS -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/src/. -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/src/../include -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -DNDEBUG -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -MD -MT src/CMakeFiles/llama.dir/llama-cparams.cpp.o -MF CMakeFiles/llama.dir/llama-cparams.cpp.o.d -o CMakeFiles/llama.dir/llama-cparams.cpp.o -c /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/src/llama-cparams.cpp sh: line 1: /usr/bin/rocm_agent_enumerator: No such file or directory [ 67%] Building CXX object src/CMakeFiles/llama.dir/llama-grammar.cpp.o cd /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build/src && /usr/bin/hipcc -DGGML_BACKEND_SHARED -DGGML_SHARED -DGGML_USE_CPU -DGGML_USE_CUDA -DGGML_USE_HIP -DLLAMA_BUILD -DLLAMA_SHARED -Dllama_EXPORTS -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/src/. -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/src/../include -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -DNDEBUG -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -MD -MT src/CMakeFiles/llama.dir/llama-grammar.cpp.o -MF CMakeFiles/llama.dir/llama-grammar.cpp.o.d -o CMakeFiles/llama.dir/llama-grammar.cpp.o -c /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/src/llama-grammar.cpp sh: line 1: /usr/bin/rocm_agent_enumerator: No such file or directory [ 67%] Building CXX object src/CMakeFiles/llama.dir/llama-graph.cpp.o cd /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build/src && /usr/bin/hipcc -DGGML_BACKEND_SHARED -DGGML_SHARED -DGGML_USE_CPU -DGGML_USE_CUDA -DGGML_USE_HIP -DLLAMA_BUILD -DLLAMA_SHARED -Dllama_EXPORTS -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/src/. -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/src/../include -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -DNDEBUG -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -MD -MT src/CMakeFiles/llama.dir/llama-graph.cpp.o -MF CMakeFiles/llama.dir/llama-graph.cpp.o.d -o CMakeFiles/llama.dir/llama-graph.cpp.o -c /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/src/llama-graph.cpp sh: line 1: /usr/bin/rocm_agent_enumerator: No such file or directory [ 68%] Building CXX object src/CMakeFiles/llama.dir/llama-hparams.cpp.o cd /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build/src && /usr/bin/hipcc -DGGML_BACKEND_SHARED -DGGML_SHARED -DGGML_USE_CPU -DGGML_USE_CUDA -DGGML_USE_HIP -DLLAMA_BUILD -DLLAMA_SHARED -Dllama_EXPORTS -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/src/. -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/src/../include -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -DNDEBUG -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -MD -MT src/CMakeFiles/llama.dir/llama-hparams.cpp.o -MF CMakeFiles/llama.dir/llama-hparams.cpp.o.d -o CMakeFiles/llama.dir/llama-hparams.cpp.o -c /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/src/llama-hparams.cpp sh: line 1: /usr/bin/rocm_agent_enumerator: No such file or directory [ 68%] Building CXX object src/CMakeFiles/llama.dir/llama-impl.cpp.o cd /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build/src && /usr/bin/hipcc -DGGML_BACKEND_SHARED -DGGML_SHARED -DGGML_USE_CPU -DGGML_USE_CUDA -DGGML_USE_HIP -DLLAMA_BUILD -DLLAMA_SHARED -Dllama_EXPORTS -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/src/. -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/src/../include -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -DNDEBUG -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -MD -MT src/CMakeFiles/llama.dir/llama-impl.cpp.o -MF CMakeFiles/llama.dir/llama-impl.cpp.o.d -o CMakeFiles/llama.dir/llama-impl.cpp.o -c /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/src/llama-impl.cpp sh: line 1: /usr/bin/rocm_agent_enumerator: No such file or directory [ 69%] Building CXX object src/CMakeFiles/llama.dir/llama-io.cpp.o cd /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build/src && /usr/bin/hipcc -DGGML_BACKEND_SHARED -DGGML_SHARED -DGGML_USE_CPU -DGGML_USE_CUDA -DGGML_USE_HIP -DLLAMA_BUILD -DLLAMA_SHARED -Dllama_EXPORTS -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/src/. -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/src/../include -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -DNDEBUG -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -MD -MT src/CMakeFiles/llama.dir/llama-io.cpp.o -MF CMakeFiles/llama.dir/llama-io.cpp.o.d -o CMakeFiles/llama.dir/llama-io.cpp.o -c /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/src/llama-io.cpp sh: line 1: /usr/bin/rocm_agent_enumerator: No such file or directory [ 69%] Building CXX object src/CMakeFiles/llama.dir/llama-kv-cache-unified.cpp.o cd /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build/src && /usr/bin/hipcc -DGGML_BACKEND_SHARED -DGGML_SHARED -DGGML_USE_CPU -DGGML_USE_CUDA -DGGML_USE_HIP -DLLAMA_BUILD -DLLAMA_SHARED -Dllama_EXPORTS -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/src/. -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/src/../include -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -DNDEBUG -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -MD -MT src/CMakeFiles/llama.dir/llama-kv-cache-unified.cpp.o -MF CMakeFiles/llama.dir/llama-kv-cache-unified.cpp.o.d -o CMakeFiles/llama.dir/llama-kv-cache-unified.cpp.o -c /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/src/llama-kv-cache-unified.cpp sh: line 1: /usr/bin/rocm_agent_enumerator: No such file or directory [ 69%] Building CXX object src/CMakeFiles/llama.dir/llama-kv-cache-unified-iswa.cpp.o cd /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build/src && /usr/bin/hipcc -DGGML_BACKEND_SHARED -DGGML_SHARED -DGGML_USE_CPU -DGGML_USE_CUDA -DGGML_USE_HIP -DLLAMA_BUILD -DLLAMA_SHARED -Dllama_EXPORTS -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/src/. -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/src/../include -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -DNDEBUG -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -MD -MT src/CMakeFiles/llama.dir/llama-kv-cache-unified-iswa.cpp.o -MF CMakeFiles/llama.dir/llama-kv-cache-unified-iswa.cpp.o.d -o CMakeFiles/llama.dir/llama-kv-cache-unified-iswa.cpp.o -c /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/src/llama-kv-cache-unified-iswa.cpp [ 70%] Building CXX object src/CMakeFiles/llama.dir/llama-memory.cpp.o cd /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build/src && /usr/bin/hipcc -DGGML_BACKEND_SHARED -DGGML_SHARED -DGGML_USE_CPU -DGGML_USE_CUDA -DGGML_USE_HIP -DLLAMA_BUILD -DLLAMA_SHARED -Dllama_EXPORTS -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/src/. -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/src/../include -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -DNDEBUG -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -MD -MT src/CMakeFiles/llama.dir/llama-memory.cpp.o -MF CMakeFiles/llama.dir/llama-memory.cpp.o.d -o CMakeFiles/llama.dir/llama-memory.cpp.o -c /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/src/llama-memory.cpp sh: line 1: /usr/bin/rocm_agent_enumerator: No such file or directory sh: line 1: /usr/bin/rocm_agent_enumerator: No such file or directory [ 70%] Building CXX object src/CMakeFiles/llama.dir/llama-memory-hybrid.cpp.o cd /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build/src && /usr/bin/hipcc -DGGML_BACKEND_SHARED -DGGML_SHARED -DGGML_USE_CPU -DGGML_USE_CUDA -DGGML_USE_HIP -DLLAMA_BUILD -DLLAMA_SHARED -Dllama_EXPORTS -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/src/. -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/src/../include -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -DNDEBUG -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -MD -MT src/CMakeFiles/llama.dir/llama-memory-hybrid.cpp.o -MF CMakeFiles/llama.dir/llama-memory-hybrid.cpp.o.d -o CMakeFiles/llama.dir/llama-memory-hybrid.cpp.o -c /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/src/llama-memory-hybrid.cpp sh: line 1: /usr/bin/rocm_agent_enumerator: No such file or directory [ 71%] Building CXX object src/CMakeFiles/llama.dir/llama-memory-recurrent.cpp.o cd /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build/src && /usr/bin/hipcc -DGGML_BACKEND_SHARED -DGGML_SHARED -DGGML_USE_CPU -DGGML_USE_CUDA -DGGML_USE_HIP -DLLAMA_BUILD -DLLAMA_SHARED -Dllama_EXPORTS -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/src/. -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/src/../include -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -DNDEBUG -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -MD -MT src/CMakeFiles/llama.dir/llama-memory-recurrent.cpp.o -MF CMakeFiles/llama.dir/llama-memory-recurrent.cpp.o.d -o CMakeFiles/llama.dir/llama-memory-recurrent.cpp.o -c /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/src/llama-memory-recurrent.cpp sh: line 1: /usr/bin/rocm_agent_enumerator: No such file or directory [ 71%] Building CXX object src/CMakeFiles/llama.dir/llama-mmap.cpp.o cd /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build/src && /usr/bin/hipcc -DGGML_BACKEND_SHARED -DGGML_SHARED -DGGML_USE_CPU -DGGML_USE_CUDA -DGGML_USE_HIP -DLLAMA_BUILD -DLLAMA_SHARED -Dllama_EXPORTS -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/src/. -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/src/../include -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -DNDEBUG -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -MD -MT src/CMakeFiles/llama.dir/llama-mmap.cpp.o -MF CMakeFiles/llama.dir/llama-mmap.cpp.o.d -o CMakeFiles/llama.dir/llama-mmap.cpp.o -c /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/src/llama-mmap.cpp sh: line 1: /usr/bin/rocm_agent_enumerator: No such file or directory [ 72%] Building CXX object src/CMakeFiles/llama.dir/llama-model-loader.cpp.o cd /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build/src && /usr/bin/hipcc -DGGML_BACKEND_SHARED -DGGML_SHARED -DGGML_USE_CPU -DGGML_USE_CUDA -DGGML_USE_HIP -DLLAMA_BUILD -DLLAMA_SHARED -Dllama_EXPORTS -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/src/. -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/src/../include -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -DNDEBUG -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -MD -MT src/CMakeFiles/llama.dir/llama-model-loader.cpp.o -MF CMakeFiles/llama.dir/llama-model-loader.cpp.o.d -o CMakeFiles/llama.dir/llama-model-loader.cpp.o -c /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/src/llama-model-loader.cpp sh: line 1: /usr/bin/rocm_agent_enumerator: No such file or directory [ 72%] Building CXX object src/CMakeFiles/llama.dir/llama-model-saver.cpp.o cd /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build/src && /usr/bin/hipcc -DGGML_BACKEND_SHARED -DGGML_SHARED -DGGML_USE_CPU -DGGML_USE_CUDA -DGGML_USE_HIP -DLLAMA_BUILD -DLLAMA_SHARED -Dllama_EXPORTS -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/src/. -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/src/../include -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -DNDEBUG -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -MD -MT src/CMakeFiles/llama.dir/llama-model-saver.cpp.o -MF CMakeFiles/llama.dir/llama-model-saver.cpp.o.d -o CMakeFiles/llama.dir/llama-model-saver.cpp.o -c /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/src/llama-model-saver.cpp sh: line 1: /usr/bin/rocm_agent_enumerator: No such file or directory [ 73%] Building CXX object src/CMakeFiles/llama.dir/llama-model.cpp.o cd /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build/src && /usr/bin/hipcc -DGGML_BACKEND_SHARED -DGGML_SHARED -DGGML_USE_CPU -DGGML_USE_CUDA -DGGML_USE_HIP -DLLAMA_BUILD -DLLAMA_SHARED -Dllama_EXPORTS -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/src/. -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/src/../include -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -DNDEBUG -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -MD -MT src/CMakeFiles/llama.dir/llama-model.cpp.o -MF CMakeFiles/llama.dir/llama-model.cpp.o.d -o CMakeFiles/llama.dir/llama-model.cpp.o -c /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/src/llama-model.cpp sh: line 1: /usr/bin/rocm_agent_enumerator: No such file or directory [ 73%] Building CXX object src/CMakeFiles/llama.dir/llama-quant.cpp.o cd /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build/src && /usr/bin/hipcc -DGGML_BACKEND_SHARED -DGGML_SHARED -DGGML_USE_CPU -DGGML_USE_CUDA -DGGML_USE_HIP -DLLAMA_BUILD -DLLAMA_SHARED -Dllama_EXPORTS -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/src/. -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/src/../include -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -DNDEBUG -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -MD -MT src/CMakeFiles/llama.dir/llama-quant.cpp.o -MF CMakeFiles/llama.dir/llama-quant.cpp.o.d -o CMakeFiles/llama.dir/llama-quant.cpp.o -c /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/src/llama-quant.cpp sh: line 1: /usr/bin/rocm_agent_enumerator: No such file or directory [ 74%] Building CXX object src/CMakeFiles/llama.dir/llama-sampling.cpp.o cd /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build/src && /usr/bin/hipcc -DGGML_BACKEND_SHARED -DGGML_SHARED -DGGML_USE_CPU -DGGML_USE_CUDA -DGGML_USE_HIP -DLLAMA_BUILD -DLLAMA_SHARED -Dllama_EXPORTS -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/src/. -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/src/../include -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -DNDEBUG -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -MD -MT src/CMakeFiles/llama.dir/llama-sampling.cpp.o -MF CMakeFiles/llama.dir/llama-sampling.cpp.o.d -o CMakeFiles/llama.dir/llama-sampling.cpp.o -c /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/src/llama-sampling.cpp sh: line 1: /usr/bin/rocm_agent_enumerator: No such file or directory [ 74%] Building CXX object src/CMakeFiles/llama.dir/llama-vocab.cpp.o cd /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build/src && /usr/bin/hipcc -DGGML_BACKEND_SHARED -DGGML_SHARED -DGGML_USE_CPU -DGGML_USE_CUDA -DGGML_USE_HIP -DLLAMA_BUILD -DLLAMA_SHARED -Dllama_EXPORTS -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/src/. -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/src/../include -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -DNDEBUG -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -MD -MT src/CMakeFiles/llama.dir/llama-vocab.cpp.o -MF CMakeFiles/llama.dir/llama-vocab.cpp.o.d -o CMakeFiles/llama.dir/llama-vocab.cpp.o -c /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/src/llama-vocab.cpp sh: line 1: /usr/bin/rocm_agent_enumerator: No such file or directory [ 75%] Building CXX object src/CMakeFiles/llama.dir/unicode-data.cpp.o cd /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build/src && /usr/bin/hipcc -DGGML_BACKEND_SHARED -DGGML_SHARED -DGGML_USE_CPU -DGGML_USE_CUDA -DGGML_USE_HIP -DLLAMA_BUILD -DLLAMA_SHARED -Dllama_EXPORTS -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/src/. -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/src/../include -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -DNDEBUG -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -MD -MT src/CMakeFiles/llama.dir/unicode-data.cpp.o -MF CMakeFiles/llama.dir/unicode-data.cpp.o.d -o CMakeFiles/llama.dir/unicode-data.cpp.o -c /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/src/unicode-data.cpp sh: line 1: /usr/bin/rocm_agent_enumerator: No such file or directory [ 75%] Building CXX object src/CMakeFiles/llama.dir/unicode.cpp.o cd /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build/src && /usr/bin/hipcc -DGGML_BACKEND_SHARED -DGGML_SHARED -DGGML_USE_CPU -DGGML_USE_CUDA -DGGML_USE_HIP -DLLAMA_BUILD -DLLAMA_SHARED -Dllama_EXPORTS -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/src/. -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/src/../include -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -DNDEBUG -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -MD -MT src/CMakeFiles/llama.dir/unicode.cpp.o -MF CMakeFiles/llama.dir/unicode.cpp.o.d -o CMakeFiles/llama.dir/unicode.cpp.o -c /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/src/unicode.cpp sh: line 1: /usr/bin/rocm_agent_enumerator: No such file or directory [ 76%] Linking CXX shared library ../bin/libllama.so cd /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build/src && /usr/bin/cmake -E cmake_link_script CMakeFiles/llama.dir/link.txt --verbose=1 sh: line 1: /usr/bin/rocm_agent_enumerator: No such file or directory clang++: warning: argument unused during compilation: '-Xarch_host -fstack-protector-strong' [-Wunused-command-line-argument] clang++: warning: argument unused during compilation: '-Xarch_host -fcf-protection' [-Wunused-command-line-argument] clang++: warning: argument unused during compilation: '-specs=/usr/lib/rpm/redhat/redhat-package-notes' [-Wunused-command-line-argument] /usr/bin/hipcc -fPIC -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -DNDEBUG -Xlinker --dependency-file=CMakeFiles/llama.dir/link.d -Wl,-z,relro -Wl,--as-needed -Wl,-z,pack-relative-relocs -Wl,-z,now -Wl,-z,now -Wl,--build-id=sha1 -specs=/usr/lib/rpm/redhat/redhat-package-notes -shared -Wl,-soname,libllama.so.b6153 -o ../bin/libllama.so.b6153 CMakeFiles/llama.dir/llama.cpp.o "CMakeFiles/llama.dir/llama-adapter.cpp.o" "CMakeFiles/llama.dir/llama-arch.cpp.o" "CMakeFiles/llama.dir/llama-batch.cpp.o" "CMakeFiles/llama.dir/llama-chat.cpp.o" "CMakeFiles/llama.dir/llama-context.cpp.o" "CMakeFiles/llama.dir/llama-cparams.cpp.o" "CMakeFiles/llama.dir/llama-grammar.cpp.o" "CMakeFiles/llama.dir/llama-graph.cpp.o" "CMakeFiles/llama.dir/llama-hparams.cpp.o" "CMakeFiles/llama.dir/llama-impl.cpp.o" "CMakeFiles/llama.dir/llama-io.cpp.o" "CMakeFiles/llama.dir/llama-kv-cache-unified.cpp.o" "CMakeFiles/llama.dir/llama-kv-cache-unified-iswa.cpp.o" "CMakeFiles/llama.dir/llama-memory.cpp.o" "CMakeFiles/llama.dir/llama-memory-hybrid.cpp.o" "CMakeFiles/llama.dir/llama-memory-recurrent.cpp.o" "CMakeFiles/llama.dir/llama-mmap.cpp.o" "CMakeFiles/llama.dir/llama-model-loader.cpp.o" "CMakeFiles/llama.dir/llama-model-saver.cpp.o" "CMakeFiles/llama.dir/llama-model.cpp.o" "CMakeFiles/llama.dir/llama-quant.cpp.o" "CMakeFiles/llama.dir/llama-sampling.cpp.o" "CMakeFiles/llama.dir/llama-vocab.cpp.o" "CMakeFiles/llama.dir/unicode-data.cpp.o" CMakeFiles/llama.dir/unicode.cpp.o ../bin/libggml.so.b6153 ../bin/libggml-cpu.so.b6153 ../bin/libggml-hip.so.b6153 ../bin/libggml-base.so.b6153 cd /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build/src && /usr/bin/cmake -E cmake_symlink_library ../bin/libllama.so.b6153 ../bin/libllama.so.b6153 ../bin/libllama.so gmake[2]: Leaving directory '/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build' [ 76%] Built target llama /usr/bin/gmake -f common/CMakeFiles/common.dir/build.make common/CMakeFiles/common.dir/depend /usr/bin/gmake -f tools/mtmd/CMakeFiles/mtmd.dir/build.make tools/mtmd/CMakeFiles/mtmd.dir/depend gmake[2]: Entering directory '/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build' cd /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build && /usr/bin/cmake -E cmake_depends "Unix Makefiles" /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153 /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/common /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build/common /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build/common/CMakeFiles/common.dir/DependInfo.cmake "--color=" gmake[2]: Entering directory '/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build' cd /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build && /usr/bin/cmake -E cmake_depends "Unix Makefiles" /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153 /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/tools/mtmd /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build/tools/mtmd /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build/tools/mtmd/CMakeFiles/mtmd.dir/DependInfo.cmake "--color=" gmake[2]: Leaving directory '/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build' /usr/bin/gmake -f tools/mtmd/CMakeFiles/mtmd.dir/build.make tools/mtmd/CMakeFiles/mtmd.dir/build gmake[2]: Leaving directory '/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build' /usr/bin/gmake -f common/CMakeFiles/common.dir/build.make common/CMakeFiles/common.dir/build gmake[2]: Entering directory '/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build' gmake[2]: Entering directory '/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build' [ 78%] Building CXX object tools/mtmd/CMakeFiles/mtmd.dir/clip.cpp.o [ 78%] Building CXX object tools/mtmd/CMakeFiles/mtmd.dir/mtmd.cpp.o [ 78%] Building CXX object tools/mtmd/CMakeFiles/mtmd.dir/mtmd-audio.cpp.o [ 78%] Building CXX object common/CMakeFiles/common.dir/arg.cpp.o cd /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build/tools/mtmd && /usr/bin/hipcc -DGGML_BACKEND_SHARED -DGGML_SHARED -DGGML_USE_CPU -DGGML_USE_CUDA -DGGML_USE_HIP -DLLAMA_BUILD -DLLAMA_SHARED -Dmtmd_EXPORTS -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/tools/mtmd/. -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/tools/mtmd/../.. -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/tools/mtmd/../../vendor -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/../include -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -DNDEBUG -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -Wno-cast-qual -MD -MT tools/mtmd/CMakeFiles/mtmd.dir/mtmd.cpp.o -MF CMakeFiles/mtmd.dir/mtmd.cpp.o.d -o CMakeFiles/mtmd.dir/mtmd.cpp.o -c /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/tools/mtmd/mtmd.cpp cd /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build/tools/mtmd && /usr/bin/hipcc -DGGML_BACKEND_SHARED -DGGML_SHARED -DGGML_USE_CPU -DGGML_USE_CUDA -DGGML_USE_HIP -DLLAMA_BUILD -DLLAMA_SHARED -Dmtmd_EXPORTS -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/tools/mtmd/. -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/tools/mtmd/../.. -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/tools/mtmd/../../vendor -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/../include -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -DNDEBUG -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -Wno-cast-qual -MD -MT tools/mtmd/CMakeFiles/mtmd.dir/clip.cpp.o -MF CMakeFiles/mtmd.dir/clip.cpp.o.d -o CMakeFiles/mtmd.dir/clip.cpp.o -c /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/tools/mtmd/clip.cpp cd /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build/tools/mtmd && /usr/bin/hipcc -DGGML_BACKEND_SHARED -DGGML_SHARED -DGGML_USE_CPU -DGGML_USE_CUDA -DGGML_USE_HIP -DLLAMA_BUILD -DLLAMA_SHARED -Dmtmd_EXPORTS -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/tools/mtmd/. -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/tools/mtmd/../.. -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/tools/mtmd/../../vendor -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/../include -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -DNDEBUG -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -Wno-cast-qual -MD -MT tools/mtmd/CMakeFiles/mtmd.dir/mtmd-audio.cpp.o -MF CMakeFiles/mtmd.dir/mtmd-audio.cpp.o.d -o CMakeFiles/mtmd.dir/mtmd-audio.cpp.o -c /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/tools/mtmd/mtmd-audio.cpp cd /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build/common && /usr/bin/hipcc -DGGML_BACKEND_SHARED -DGGML_SHARED -DGGML_USE_CPU -DGGML_USE_CUDA -DGGML_USE_HIP -DLLAMA_SHARED -DLLAMA_USE_CURL -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/common/. -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/common/../vendor -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/src/../include -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -DNDEBUG -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -MD -MT common/CMakeFiles/common.dir/arg.cpp.o -MF CMakeFiles/common.dir/arg.cpp.o.d -o CMakeFiles/common.dir/arg.cpp.o -c /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/common/arg.cpp sh: line 1: /usr/bin/rocm_agent_enumerator: No such file or directory sh: line 1: /usr/bin/rocm_agent_enumerator: No such file or directory sh: line 1: /usr/bin/rocm_agent_enumerator: No such file or directory sh: line 1: /usr/bin/rocm_agent_enumerator: No such file or directory [ 78%] Building CXX object tools/mtmd/CMakeFiles/mtmd.dir/mtmd-helper.cpp.o cd /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build/tools/mtmd && /usr/bin/hipcc -DGGML_BACKEND_SHARED -DGGML_SHARED -DGGML_USE_CPU -DGGML_USE_CUDA -DGGML_USE_HIP -DLLAMA_BUILD -DLLAMA_SHARED -Dmtmd_EXPORTS -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/tools/mtmd/. -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/tools/mtmd/../.. -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/tools/mtmd/../../vendor -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/../include -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -DNDEBUG -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -Wno-cast-qual -MD -MT tools/mtmd/CMakeFiles/mtmd.dir/mtmd-helper.cpp.o -MF CMakeFiles/mtmd.dir/mtmd-helper.cpp.o.d -o CMakeFiles/mtmd.dir/mtmd-helper.cpp.o -c /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/tools/mtmd/mtmd-helper.cpp sh: line 1: /usr/bin/rocm_agent_enumerator: No such file or directory [ 79%] Building CXX object common/CMakeFiles/common.dir/chat-parser.cpp.o cd /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build/common && /usr/bin/hipcc -DGGML_BACKEND_SHARED -DGGML_SHARED -DGGML_USE_CPU -DGGML_USE_CUDA -DGGML_USE_HIP -DLLAMA_SHARED -DLLAMA_USE_CURL -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/common/. -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/common/../vendor -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/src/../include -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -DNDEBUG -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -MD -MT common/CMakeFiles/common.dir/chat-parser.cpp.o -MF CMakeFiles/common.dir/chat-parser.cpp.o.d -o CMakeFiles/common.dir/chat-parser.cpp.o -c /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/common/chat-parser.cpp sh: line 1: /usr/bin/rocm_agent_enumerator: No such file or directory [ 79%] Building CXX object common/CMakeFiles/common.dir/chat.cpp.o cd /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build/common && /usr/bin/hipcc -DGGML_BACKEND_SHARED -DGGML_SHARED -DGGML_USE_CPU -DGGML_USE_CUDA -DGGML_USE_HIP -DLLAMA_SHARED -DLLAMA_USE_CURL -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/common/. -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/common/../vendor -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/src/../include -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -DNDEBUG -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -MD -MT common/CMakeFiles/common.dir/chat.cpp.o -MF CMakeFiles/common.dir/chat.cpp.o.d -o CMakeFiles/common.dir/chat.cpp.o -c /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/common/chat.cpp sh: line 1: /usr/bin/rocm_agent_enumerator: No such file or directory [ 80%] Building CXX object common/CMakeFiles/common.dir/common.cpp.o cd /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build/common && /usr/bin/hipcc -DGGML_BACKEND_SHARED -DGGML_SHARED -DGGML_USE_CPU -DGGML_USE_CUDA -DGGML_USE_HIP -DLLAMA_SHARED -DLLAMA_USE_CURL -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/common/. -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/common/../vendor -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/src/../include -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -DNDEBUG -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -MD -MT common/CMakeFiles/common.dir/common.cpp.o -MF CMakeFiles/common.dir/common.cpp.o.d -o CMakeFiles/common.dir/common.cpp.o -c /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/common/common.cpp sh: line 1: /usr/bin/rocm_agent_enumerator: No such file or directory [ 81%] Linking CXX shared library ../../bin/libmtmd.so cd /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build/tools/mtmd && /usr/bin/cmake -E cmake_link_script CMakeFiles/mtmd.dir/link.txt --verbose=1 sh: line 1: /usr/bin/rocm_agent_enumerator: No such file or directory clang++: warning: argument unused during compilation: '-Xarch_host -fstack-protector-strong' [-Wunused-command-line-argument] clang++: warning: argument unused during compilation: '-Xarch_host -fcf-protection' [-Wunused-command-line-argument] clang++: warning: argument unused during compilation: '-specs=/usr/lib/rpm/redhat/redhat-package-notes' [-Wunused-command-line-argument] [ 81%] Building CXX object common/CMakeFiles/common.dir/console.cpp.o cd /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build/common && /usr/bin/hipcc -DGGML_BACKEND_SHARED -DGGML_SHARED -DGGML_USE_CPU -DGGML_USE_CUDA -DGGML_USE_HIP -DLLAMA_SHARED -DLLAMA_USE_CURL -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/common/. -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/common/../vendor -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/src/../include -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -DNDEBUG -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -MD -MT common/CMakeFiles/common.dir/console.cpp.o -MF CMakeFiles/common.dir/console.cpp.o.d -o CMakeFiles/common.dir/console.cpp.o -c /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/common/console.cpp sh: line 1: /usr/bin/rocm_agent_enumerator: No such file or directory /usr/bin/hipcc -fPIC -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -DNDEBUG -Xlinker --dependency-file=CMakeFiles/mtmd.dir/link.d -Wl,-z,relro -Wl,--as-needed -Wl,-z,pack-relative-relocs -Wl,-z,now -Wl,-z,now -Wl,--build-id=sha1 -specs=/usr/lib/rpm/redhat/redhat-package-notes -shared -Wl,-soname,libmtmd.so.b6153 -o ../../bin/libmtmd.so.b6153 CMakeFiles/mtmd.dir/mtmd.cpp.o "CMakeFiles/mtmd.dir/mtmd-audio.cpp.o" CMakeFiles/mtmd.dir/clip.cpp.o "CMakeFiles/mtmd.dir/mtmd-helper.cpp.o" ../../bin/libllama.so.b6153 ../../bin/libggml.so.b6153 ../../bin/libggml-cpu.so.b6153 ../../bin/libggml-hip.so.b6153 ../../bin/libggml-base.so.b6153 cd /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build/tools/mtmd && /usr/bin/cmake -E cmake_symlink_library ../../bin/libmtmd.so.b6153 ../../bin/libmtmd.so.b6153 ../../bin/libmtmd.so gmake[2]: Leaving directory '/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build' [ 81%] Built target mtmd [ 82%] Building CXX object common/CMakeFiles/common.dir/json-partial.cpp.o cd /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build/common && /usr/bin/hipcc -DGGML_BACKEND_SHARED -DGGML_SHARED -DGGML_USE_CPU -DGGML_USE_CUDA -DGGML_USE_HIP -DLLAMA_SHARED -DLLAMA_USE_CURL -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/common/. -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/common/../vendor -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/src/../include -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -DNDEBUG -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -MD -MT common/CMakeFiles/common.dir/json-partial.cpp.o -MF CMakeFiles/common.dir/json-partial.cpp.o.d -o CMakeFiles/common.dir/json-partial.cpp.o -c /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/common/json-partial.cpp sh: line 1: /usr/bin/rocm_agent_enumerator: No such file or directory [ 82%] Building CXX object common/CMakeFiles/common.dir/json-schema-to-grammar.cpp.o cd /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build/common && /usr/bin/hipcc -DGGML_BACKEND_SHARED -DGGML_SHARED -DGGML_USE_CPU -DGGML_USE_CUDA -DGGML_USE_HIP -DLLAMA_SHARED -DLLAMA_USE_CURL -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/common/. -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/common/../vendor -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/src/../include -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -DNDEBUG -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -MD -MT common/CMakeFiles/common.dir/json-schema-to-grammar.cpp.o -MF CMakeFiles/common.dir/json-schema-to-grammar.cpp.o.d -o CMakeFiles/common.dir/json-schema-to-grammar.cpp.o -c /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/common/json-schema-to-grammar.cpp sh: line 1: /usr/bin/rocm_agent_enumerator: No such file or directory [ 83%] Building CXX object common/CMakeFiles/common.dir/llguidance.cpp.o cd /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build/common && /usr/bin/hipcc -DGGML_BACKEND_SHARED -DGGML_SHARED -DGGML_USE_CPU -DGGML_USE_CUDA -DGGML_USE_HIP -DLLAMA_SHARED -DLLAMA_USE_CURL -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/common/. -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/common/../vendor -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/src/../include -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -DNDEBUG -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -MD -MT common/CMakeFiles/common.dir/llguidance.cpp.o -MF CMakeFiles/common.dir/llguidance.cpp.o.d -o CMakeFiles/common.dir/llguidance.cpp.o -c /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/common/llguidance.cpp sh: line 1: /usr/bin/rocm_agent_enumerator: No such file or directory [ 83%] Building CXX object common/CMakeFiles/common.dir/log.cpp.o cd /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build/common && /usr/bin/hipcc -DGGML_BACKEND_SHARED -DGGML_SHARED -DGGML_USE_CPU -DGGML_USE_CUDA -DGGML_USE_HIP -DLLAMA_SHARED -DLLAMA_USE_CURL -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/common/. -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/common/../vendor -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/src/../include -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -DNDEBUG -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -MD -MT common/CMakeFiles/common.dir/log.cpp.o -MF CMakeFiles/common.dir/log.cpp.o.d -o CMakeFiles/common.dir/log.cpp.o -c /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/common/log.cpp sh: line 1: /usr/bin/rocm_agent_enumerator: No such file or directory [ 84%] Building CXX object common/CMakeFiles/common.dir/ngram-cache.cpp.o cd /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build/common && /usr/bin/hipcc -DGGML_BACKEND_SHARED -DGGML_SHARED -DGGML_USE_CPU -DGGML_USE_CUDA -DGGML_USE_HIP -DLLAMA_SHARED -DLLAMA_USE_CURL -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/common/. -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/common/../vendor -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/src/../include -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -DNDEBUG -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -MD -MT common/CMakeFiles/common.dir/ngram-cache.cpp.o -MF CMakeFiles/common.dir/ngram-cache.cpp.o.d -o CMakeFiles/common.dir/ngram-cache.cpp.o -c /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/common/ngram-cache.cpp sh: line 1: /usr/bin/rocm_agent_enumerator: No such file or directory [ 84%] Building CXX object common/CMakeFiles/common.dir/regex-partial.cpp.o cd /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build/common && /usr/bin/hipcc -DGGML_BACKEND_SHARED -DGGML_SHARED -DGGML_USE_CPU -DGGML_USE_CUDA -DGGML_USE_HIP -DLLAMA_SHARED -DLLAMA_USE_CURL -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/common/. -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/common/../vendor -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/src/../include -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -DNDEBUG -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -MD -MT common/CMakeFiles/common.dir/regex-partial.cpp.o -MF CMakeFiles/common.dir/regex-partial.cpp.o.d -o CMakeFiles/common.dir/regex-partial.cpp.o -c /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/common/regex-partial.cpp sh: line 1: /usr/bin/rocm_agent_enumerator: No such file or directory [ 85%] Building CXX object common/CMakeFiles/common.dir/sampling.cpp.o cd /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build/common && /usr/bin/hipcc -DGGML_BACKEND_SHARED -DGGML_SHARED -DGGML_USE_CPU -DGGML_USE_CUDA -DGGML_USE_HIP -DLLAMA_SHARED -DLLAMA_USE_CURL -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/common/. -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/common/../vendor -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/src/../include -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -DNDEBUG -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -MD -MT common/CMakeFiles/common.dir/sampling.cpp.o -MF CMakeFiles/common.dir/sampling.cpp.o.d -o CMakeFiles/common.dir/sampling.cpp.o -c /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/common/sampling.cpp sh: line 1: /usr/bin/rocm_agent_enumerator: No such file or directory [ 85%] Building CXX object common/CMakeFiles/common.dir/speculative.cpp.o cd /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build/common && /usr/bin/hipcc -DGGML_BACKEND_SHARED -DGGML_SHARED -DGGML_USE_CPU -DGGML_USE_CUDA -DGGML_USE_HIP -DLLAMA_SHARED -DLLAMA_USE_CURL -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/common/. -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/common/../vendor -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/src/../include -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -DNDEBUG -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -MD -MT common/CMakeFiles/common.dir/speculative.cpp.o -MF CMakeFiles/common.dir/speculative.cpp.o.d -o CMakeFiles/common.dir/speculative.cpp.o -c /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/common/speculative.cpp sh: line 1: /usr/bin/rocm_agent_enumerator: No such file or directory [ 86%] Linking CXX static library libcommon.a cd /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build/common && /usr/bin/cmake -P CMakeFiles/common.dir/cmake_clean_target.cmake cd /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build/common && /usr/bin/cmake -E cmake_link_script CMakeFiles/common.dir/link.txt --verbose=1 bfd plugin: LLVM gold plugin has failed to create LTO module: Invalid record bfd plugin: LLVM gold plugin has failed to create LTO module: Invalid record bfd plugin: LLVM gold plugin has failed to create LTO module: Invalid record bfd plugin: LLVM gold plugin has failed to create LTO module: Invalid record bfd plugin: LLVM gold plugin has failed to create LTO module: Invalid record bfd plugin: LLVM gold plugin has failed to create LTO module: Invalid record bfd plugin: LLVM gold plugin has failed to create LTO module: Invalid record bfd plugin: LLVM gold plugin has failed to create LTO module: Invalid record bfd plugin: LLVM gold plugin has failed to create LTO module: Invalid record bfd plugin: LLVM gold plugin has failed to create LTO module: Invalid record bfd plugin: LLVM gold plugin has failed to create LTO module: Invalid record bfd plugin: LLVM gold plugin has failed to create LTO module: Invalid record bfd plugin: LLVM gold plugin has failed to create LTO module: Invalid record bfd plugin: LLVM gold plugin has failed to create LTO module: Invalid record bfd plugin: LLVM gold plugin has failed to create LTO module: Invalid record bfd plugin: LLVM gold plugin has failed to create LTO module: Invalid record bfd plugin: LLVM gold plugin has failed to create LTO module: Invalid record bfd plugin: LLVM gold plugin has failed to create LTO module: Invalid record bfd plugin: LLVM gold plugin has failed to create LTO module: Invalid record bfd plugin: LLVM gold plugin has failed to create LTO module: Invalid record bfd plugin: LLVM gold plugin has failed to create LTO module: Invalid record bfd plugin: LLVM gold plugin has failed to create LTO module: Invalid record bfd plugin: LLVM gold plugin has failed to create LTO module: Invalid record bfd plugin: LLVM gold plugin has failed to create LTO module: Invalid record bfd plugin: LLVM gold plugin has failed to create LTO module: Invalid record bfd plugin: LLVM gold plugin has failed to create LTO module: Invalid record bfd plugin: LLVM gold plugin has failed to create LTO module: Invalid record bfd plugin: LLVM gold plugin has failed to create LTO module: Invalid record bfd plugin: LLVM gold plugin has failed to create LTO module: Invalid record /usr/bin/ar qc libcommon.a CMakeFiles/common.dir/arg.cpp.o "CMakeFiles/common.dir/chat-parser.cpp.o" CMakeFiles/common.dir/chat.cpp.o CMakeFiles/common.dir/common.cpp.o CMakeFiles/common.dir/console.cpp.o "CMakeFiles/common.dir/json-partial.cpp.o" "CMakeFiles/common.dir/json-schema-to-grammar.cpp.o" CMakeFiles/common.dir/llguidance.cpp.o CMakeFiles/common.dir/log.cpp.o "CMakeFiles/common.dir/ngram-cache.cpp.o" "CMakeFiles/common.dir/regex-partial.cpp.o" CMakeFiles/common.dir/sampling.cpp.o CMakeFiles/common.dir/speculative.cpp.o "CMakeFiles/build_info.dir/build-info.cpp.o" /usr/bin/ranlib libcommon.a gmake[2]: Leaving directory '/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build' [ 86%] Built target common /usr/bin/gmake -f tools/batched-bench/CMakeFiles/llama-batched-bench.dir/build.make tools/batched-bench/CMakeFiles/llama-batched-bench.dir/depend /usr/bin/gmake -f tools/gguf-split/CMakeFiles/llama-gguf-split.dir/build.make tools/gguf-split/CMakeFiles/llama-gguf-split.dir/depend /usr/bin/gmake -f tools/imatrix/CMakeFiles/llama-imatrix.dir/build.make tools/imatrix/CMakeFiles/llama-imatrix.dir/depend /usr/bin/gmake -f tools/llama-bench/CMakeFiles/llama-bench.dir/build.make tools/llama-bench/CMakeFiles/llama-bench.dir/depend gmake[2]: Entering directory '/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build' cd /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build && /usr/bin/cmake -E cmake_depends "Unix Makefiles" /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153 /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/tools/batched-bench /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build/tools/batched-bench /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build/tools/batched-bench/CMakeFiles/llama-batched-bench.dir/DependInfo.cmake "--color=" gmake[2]: Entering directory '/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build' cd /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build && /usr/bin/cmake -E cmake_depends "Unix Makefiles" /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153 /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/tools/gguf-split /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build/tools/gguf-split /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build/tools/gguf-split/CMakeFiles/llama-gguf-split.dir/DependInfo.cmake "--color=" gmake[2]: Entering directory '/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build' cd /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build && /usr/bin/cmake -E cmake_depends "Unix Makefiles" /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153 /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/tools/imatrix /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build/tools/imatrix /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build/tools/imatrix/CMakeFiles/llama-imatrix.dir/DependInfo.cmake "--color=" gmake[2]: Entering directory '/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build' cd /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build && /usr/bin/cmake -E cmake_depends "Unix Makefiles" /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153 /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/tools/llama-bench /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build/tools/llama-bench /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build/tools/llama-bench/CMakeFiles/llama-bench.dir/DependInfo.cmake "--color=" gmake[2]: Leaving directory '/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build' gmake[2]: Leaving directory '/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build' /usr/bin/gmake -f tools/gguf-split/CMakeFiles/llama-gguf-split.dir/build.make tools/gguf-split/CMakeFiles/llama-gguf-split.dir/build gmake[2]: Leaving directory '/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build' /usr/bin/gmake -f tools/imatrix/CMakeFiles/llama-imatrix.dir/build.make tools/imatrix/CMakeFiles/llama-imatrix.dir/build /usr/bin/gmake -f tools/batched-bench/CMakeFiles/llama-batched-bench.dir/build.make tools/batched-bench/CMakeFiles/llama-batched-bench.dir/build gmake[2]: Entering directory '/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build' gmake[2]: Entering directory '/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build' gmake[2]: Entering directory '/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build' gmake[2]: Leaving directory '/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build' /usr/bin/gmake -f tools/llama-bench/CMakeFiles/llama-bench.dir/build.make tools/llama-bench/CMakeFiles/llama-bench.dir/build gmake[2]: Entering directory '/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build' [ 87%] Building CXX object tools/imatrix/CMakeFiles/llama-imatrix.dir/imatrix.cpp.o [ 87%] Building CXX object tools/batched-bench/CMakeFiles/llama-batched-bench.dir/batched-bench.cpp.o [ 88%] Building CXX object tools/gguf-split/CMakeFiles/llama-gguf-split.dir/gguf-split.cpp.o cd /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build/tools/imatrix && /usr/bin/hipcc -DGGML_BACKEND_SHARED -DGGML_SHARED -DGGML_USE_CPU -DGGML_USE_CUDA -DGGML_USE_HIP -DLLAMA_SHARED -DLLAMA_USE_CURL -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/common/. -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/common/../vendor -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/src/../include -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -DNDEBUG -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -MD -MT tools/imatrix/CMakeFiles/llama-imatrix.dir/imatrix.cpp.o -MF CMakeFiles/llama-imatrix.dir/imatrix.cpp.o.d -o CMakeFiles/llama-imatrix.dir/imatrix.cpp.o -c /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/tools/imatrix/imatrix.cpp cd /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build/tools/batched-bench && /usr/bin/hipcc -DGGML_BACKEND_SHARED -DGGML_SHARED -DGGML_USE_CPU -DGGML_USE_CUDA -DGGML_USE_HIP -DLLAMA_SHARED -DLLAMA_USE_CURL -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/common/. -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/common/../vendor -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/src/../include -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -DNDEBUG -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -MD -MT tools/batched-bench/CMakeFiles/llama-batched-bench.dir/batched-bench.cpp.o -MF CMakeFiles/llama-batched-bench.dir/batched-bench.cpp.o.d -o CMakeFiles/llama-batched-bench.dir/batched-bench.cpp.o -c /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/tools/batched-bench/batched-bench.cpp [ 88%] Building CXX object tools/llama-bench/CMakeFiles/llama-bench.dir/llama-bench.cpp.o cd /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build/tools/llama-bench && /usr/bin/hipcc -DGGML_BACKEND_SHARED -DGGML_SHARED -DGGML_USE_CPU -DGGML_USE_CUDA -DGGML_USE_HIP -DLLAMA_SHARED -DLLAMA_USE_CURL -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/common/. -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/common/../vendor -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/src/../include -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -DNDEBUG -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -MD -MT tools/llama-bench/CMakeFiles/llama-bench.dir/llama-bench.cpp.o -MF CMakeFiles/llama-bench.dir/llama-bench.cpp.o.d -o CMakeFiles/llama-bench.dir/llama-bench.cpp.o -c /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/tools/llama-bench/llama-bench.cpp cd /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build/tools/gguf-split && /usr/bin/hipcc -DGGML_BACKEND_SHARED -DGGML_SHARED -DGGML_USE_CPU -DGGML_USE_CUDA -DGGML_USE_HIP -DLLAMA_SHARED -DLLAMA_USE_CURL -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/common/. -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/common/../vendor -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/src/../include -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -DNDEBUG -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -MD -MT tools/gguf-split/CMakeFiles/llama-gguf-split.dir/gguf-split.cpp.o -MF CMakeFiles/llama-gguf-split.dir/gguf-split.cpp.o.d -o CMakeFiles/llama-gguf-split.dir/gguf-split.cpp.o -c /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/tools/gguf-split/gguf-split.cpp sh: line 1: /usr/bin/rocm_agent_enumerator: No such file or directory sh: line 1: /usr/bin/rocm_agent_enumerator: No such file or directory sh: line 1: /usr/bin/rocm_agent_enumerator: No such file or directory sh: line 1: /usr/bin/rocm_agent_enumerator: No such file or directory [ 88%] Linking CXX executable ../../bin/llama-gguf-split cd /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build/tools/gguf-split && /usr/bin/cmake -E cmake_link_script CMakeFiles/llama-gguf-split.dir/link.txt --verbose=1 sh: line 1: /usr/bin/rocm_agent_enumerator: No such file or directory [ 89%] Linking CXX executable ../../bin/llama-batched-bench cd /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build/tools/batched-bench && /usr/bin/cmake -E cmake_link_script CMakeFiles/llama-batched-bench.dir/link.txt --verbose=1 clang++: warning: argument unused during compilation: '-Xarch_host -fstack-protector-strong' [-Wunused-command-line-argument] clang++: warning: argument unused during compilation: '-Xarch_host -fcf-protection' [-Wunused-command-line-argument] clang++: warning: argument unused during compilation: '-specs=/usr/lib/rpm/redhat/redhat-package-notes' [-Wunused-command-line-argument] sh: line 1: /usr/bin/rocm_agent_enumerator: No such file or directory clang++: warning: argument unused during compilation: '-Xarch_host -fstack-protector-strong' [-Wunused-command-line-argument] clang++: warning: argument unused during compilation: '-Xarch_host -fcf-protection' [-Wunused-command-line-argument] clang++: warning: argument unused during compilation: '-specs=/usr/lib/rpm/redhat/redhat-package-notes' [-Wunused-command-line-argument] /usr/bin/hipcc -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -DNDEBUG -Wl,-z,relro -Wl,--as-needed -Wl,-z,pack-relative-relocs -Wl,-z,now -Wl,-z,now -Wl,--build-id=sha1 -specs=/usr/lib/rpm/redhat/redhat-package-notes -Xlinker --dependency-file=CMakeFiles/llama-gguf-split.dir/link.d "CMakeFiles/llama-gguf-split.dir/gguf-split.cpp.o" -o ../../bin/llama-gguf-split ../../common/libcommon.a ../../bin/libllama.so.b6153 ../../bin/libggml.so.b6153 ../../bin/libggml-cpu.so.b6153 ../../bin/libggml-hip.so.b6153 ../../bin/libggml-base.so.b6153 /usr/lib64/libcurl.so gmake[2]: Leaving directory '/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build' [ 89%] Built target llama-gguf-split /usr/bin/gmake -f tools/main/CMakeFiles/llama-cli.dir/build.make tools/main/CMakeFiles/llama-cli.dir/depend gmake[2]: Entering directory '/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build' cd /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build && /usr/bin/cmake -E cmake_depends "Unix Makefiles" /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153 /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/tools/main /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build/tools/main /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build/tools/main/CMakeFiles/llama-cli.dir/DependInfo.cmake "--color=" gmake[2]: Leaving directory '/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build' /usr/bin/gmake -f tools/main/CMakeFiles/llama-cli.dir/build.make tools/main/CMakeFiles/llama-cli.dir/build gmake[2]: Entering directory '/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build' [ 89%] Building CXX object tools/main/CMakeFiles/llama-cli.dir/main.cpp.o cd /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build/tools/main && /usr/bin/hipcc -DGGML_BACKEND_SHARED -DGGML_SHARED -DGGML_USE_CPU -DGGML_USE_CUDA -DGGML_USE_HIP -DLLAMA_SHARED -DLLAMA_USE_CURL -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/common/. -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/common/../vendor -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/src/../include -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -DNDEBUG -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -MD -MT tools/main/CMakeFiles/llama-cli.dir/main.cpp.o -MF CMakeFiles/llama-cli.dir/main.cpp.o.d -o CMakeFiles/llama-cli.dir/main.cpp.o -c /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/tools/main/main.cpp sh: line 1: /usr/bin/rocm_agent_enumerator: No such file or directory [ 89%] Linking CXX executable ../../bin/llama-cli cd /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build/tools/main && /usr/bin/cmake -E cmake_link_script CMakeFiles/llama-cli.dir/link.txt --verbose=1 sh: line 1: /usr/bin/rocm_agent_enumerator: No such file or directory clang++: warning: argument unused during compilation: '-Xarch_host -fstack-protector-strong' [-Wunused-command-line-argument] clang++: warning: argument unused during compilation: '-Xarch_host -fcf-protection' [-Wunused-command-line-argument] clang++: warning: argument unused during compilation: '-specs=/usr/lib/rpm/redhat/redhat-package-notes' [-Wunused-command-line-argument] [ 89%] Linking CXX executable ../../bin/llama-imatrix cd /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build/tools/imatrix && /usr/bin/cmake -E cmake_link_script CMakeFiles/llama-imatrix.dir/link.txt --verbose=1 sh: line 1: /usr/bin/rocm_agent_enumerator: No such file or directory clang++: warning: argument unused during compilation: '-Xarch_host -fstack-protector-strong' [-Wunused-command-line-argument] clang++: warning: argument unused during compilation: '-Xarch_host -fcf-protection' [-Wunused-command-line-argument] clang++: warning: argument unused during compilation: '-specs=/usr/lib/rpm/redhat/redhat-package-notes' [-Wunused-command-line-argument] [ 90%] Linking CXX executable ../../bin/llama-bench cd /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build/tools/llama-bench && /usr/bin/cmake -E cmake_link_script CMakeFiles/llama-bench.dir/link.txt --verbose=1 sh: line 1: /usr/bin/rocm_agent_enumerator: No such file or directory clang++: warning: argument unused during compilation: '-Xarch_host -fstack-protector-strong' [-Wunused-command-line-argument] clang++: warning: argument unused during compilation: '-Xarch_host -fcf-protection' [-Wunused-command-line-argument] clang++: warning: argument unused during compilation: '-specs=/usr/lib/rpm/redhat/redhat-package-notes' [-Wunused-command-line-argument] /usr/bin/hipcc -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -DNDEBUG -Wl,-z,relro -Wl,--as-needed -Wl,-z,pack-relative-relocs -Wl,-z,now -Wl,-z,now -Wl,--build-id=sha1 -specs=/usr/lib/rpm/redhat/redhat-package-notes -Xlinker --dependency-file=CMakeFiles/llama-bench.dir/link.d "CMakeFiles/llama-bench.dir/llama-bench.cpp.o" -o ../../bin/llama-bench ../../common/libcommon.a ../../bin/libllama.so.b6153 ../../bin/libggml.so.b6153 ../../bin/libggml-cpu.so.b6153 ../../bin/libggml-hip.so.b6153 ../../bin/libggml-base.so.b6153 /usr/lib64/libcurl.so gmake[2]: Leaving directory '/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build' [ 90%] Built target llama-bench /usr/bin/gmake -f tools/perplexity/CMakeFiles/llama-perplexity.dir/build.make tools/perplexity/CMakeFiles/llama-perplexity.dir/depend gmake[2]: Entering directory '/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build' cd /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build && /usr/bin/cmake -E cmake_depends "Unix Makefiles" /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153 /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/tools/perplexity /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build/tools/perplexity /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build/tools/perplexity/CMakeFiles/llama-perplexity.dir/DependInfo.cmake "--color=" gmake[2]: Leaving directory '/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build' /usr/bin/gmake -f tools/perplexity/CMakeFiles/llama-perplexity.dir/build.make tools/perplexity/CMakeFiles/llama-perplexity.dir/build gmake[2]: Entering directory '/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build' [ 91%] Building CXX object tools/perplexity/CMakeFiles/llama-perplexity.dir/perplexity.cpp.o cd /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build/tools/perplexity && /usr/bin/hipcc -DGGML_BACKEND_SHARED -DGGML_SHARED -DGGML_USE_CPU -DGGML_USE_CUDA -DGGML_USE_HIP -DLLAMA_SHARED -DLLAMA_USE_CURL -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/common/. -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/common/../vendor -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/src/../include -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -DNDEBUG -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -MD -MT tools/perplexity/CMakeFiles/llama-perplexity.dir/perplexity.cpp.o -MF CMakeFiles/llama-perplexity.dir/perplexity.cpp.o.d -o CMakeFiles/llama-perplexity.dir/perplexity.cpp.o -c /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/tools/perplexity/perplexity.cpp sh: line 1: /usr/bin/rocm_agent_enumerator: No such file or directory [ 91%] Linking CXX executable ../../bin/llama-perplexity cd /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build/tools/perplexity && /usr/bin/cmake -E cmake_link_script CMakeFiles/llama-perplexity.dir/link.txt --verbose=1 sh: line 1: /usr/bin/rocm_agent_enumerator: No such file or directory clang++: warning: argument unused during compilation: '-Xarch_host -fstack-protector-strong' [-Wunused-command-line-argument] clang++: warning: argument unused during compilation: '-Xarch_host -fcf-protection' [-Wunused-command-line-argument] clang++: warning: argument unused during compilation: '-specs=/usr/lib/rpm/redhat/redhat-package-notes' [-Wunused-command-line-argument] /usr/bin/hipcc -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -DNDEBUG -Wl,-z,relro -Wl,--as-needed -Wl,-z,pack-relative-relocs -Wl,-z,now -Wl,-z,now -Wl,--build-id=sha1 -specs=/usr/lib/rpm/redhat/redhat-package-notes -Xlinker --dependency-file=CMakeFiles/llama-batched-bench.dir/link.d "CMakeFiles/llama-batched-bench.dir/batched-bench.cpp.o" -o ../../bin/llama-batched-bench ../../common/libcommon.a ../../bin/libllama.so.b6153 ../../bin/libggml.so.b6153 ../../bin/libggml-cpu.so.b6153 ../../bin/libggml-hip.so.b6153 ../../bin/libggml-base.so.b6153 /usr/lib64/libcurl.so gmake[2]: Leaving directory '/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build' [ 91%] Built target llama-batched-bench /usr/bin/gmake -f tools/quantize/CMakeFiles/llama-quantize.dir/build.make tools/quantize/CMakeFiles/llama-quantize.dir/depend gmake[2]: Entering directory '/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build' cd /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build && /usr/bin/cmake -E cmake_depends "Unix Makefiles" /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153 /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/tools/quantize /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build/tools/quantize /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build/tools/quantize/CMakeFiles/llama-quantize.dir/DependInfo.cmake "--color=" gmake[2]: Leaving directory '/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build' /usr/bin/gmake -f tools/quantize/CMakeFiles/llama-quantize.dir/build.make tools/quantize/CMakeFiles/llama-quantize.dir/build gmake[2]: Entering directory '/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build' [ 91%] Building CXX object tools/quantize/CMakeFiles/llama-quantize.dir/quantize.cpp.o cd /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build/tools/quantize && /usr/bin/hipcc -DGGML_BACKEND_SHARED -DGGML_SHARED -DGGML_USE_CPU -DGGML_USE_CUDA -DGGML_USE_HIP -DLLAMA_SHARED -DLLAMA_USE_CURL -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/tools/quantize/../../common -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/common/. -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/common/../vendor -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/src/../include -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -DNDEBUG -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -MD -MT tools/quantize/CMakeFiles/llama-quantize.dir/quantize.cpp.o -MF CMakeFiles/llama-quantize.dir/quantize.cpp.o.d -o CMakeFiles/llama-quantize.dir/quantize.cpp.o -c /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/tools/quantize/quantize.cpp sh: line 1: /usr/bin/rocm_agent_enumerator: No such file or directory [ 92%] Linking CXX executable ../../bin/llama-quantize cd /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build/tools/quantize && /usr/bin/cmake -E cmake_link_script CMakeFiles/llama-quantize.dir/link.txt --verbose=1 sh: line 1: /usr/bin/rocm_agent_enumerator: No such file or directory clang++: warning: argument unused during compilation: '-Xarch_host -fstack-protector-strong' [-Wunused-command-line-argument] clang++: warning: argument unused during compilation: '-Xarch_host -fcf-protection' [-Wunused-command-line-argument] clang++: warning: argument unused during compilation: '-specs=/usr/lib/rpm/redhat/redhat-package-notes' [-Wunused-command-line-argument] /usr/bin/hipcc -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -DNDEBUG -Wl,-z,relro -Wl,--as-needed -Wl,-z,pack-relative-relocs -Wl,-z,now -Wl,-z,now -Wl,--build-id=sha1 -specs=/usr/lib/rpm/redhat/redhat-package-notes -Xlinker --dependency-file=CMakeFiles/llama-cli.dir/link.d "CMakeFiles/llama-cli.dir/main.cpp.o" -o ../../bin/llama-cli ../../common/libcommon.a ../../bin/libllama.so.b6153 ../../bin/libggml.so.b6153 ../../bin/libggml-cpu.so.b6153 ../../bin/libggml-hip.so.b6153 ../../bin/libggml-base.so.b6153 /usr/lib64/libcurl.so gmake[2]: Leaving directory '/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build' [ 92%] Built target llama-cli /usr/bin/gmake -f tools/server/CMakeFiles/llama-server.dir/build.make tools/server/CMakeFiles/llama-server.dir/depend gmake[2]: Entering directory '/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build' [ 92%] Generating loading.html.hpp cd /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build/tools/server && /usr/bin/cmake -DINPUT=/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/tools/server/public/loading.html -DOUTPUT=/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build/tools/server/loading.html.hpp -P /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/scripts/xxd.cmake [ 93%] Generating index.html.gz.hpp cd /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build/tools/server && /usr/bin/cmake -DINPUT=/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/tools/server/public/index.html.gz -DOUTPUT=/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build/tools/server/index.html.gz.hpp -P /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/scripts/xxd.cmake /usr/bin/hipcc -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -DNDEBUG -Wl,-z,relro -Wl,--as-needed -Wl,-z,pack-relative-relocs -Wl,-z,now -Wl,-z,now -Wl,--build-id=sha1 -specs=/usr/lib/rpm/redhat/redhat-package-notes -Xlinker --dependency-file=CMakeFiles/llama-quantize.dir/link.d "CMakeFiles/llama-quantize.dir/quantize.cpp.o" -o ../../bin/llama-quantize ../../common/libcommon.a ../../bin/libllama.so.b6153 ../../bin/libggml.so.b6153 ../../bin/libggml-cpu.so.b6153 ../../bin/libggml-hip.so.b6153 ../../bin/libggml-base.so.b6153 /usr/lib64/libcurl.so gmake[2]: Leaving directory '/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build' [ 93%] Built target llama-quantize /usr/bin/gmake -f tools/run/CMakeFiles/llama-run.dir/build.make tools/run/CMakeFiles/llama-run.dir/depend gmake[2]: Entering directory '/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build' cd /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build && /usr/bin/cmake -E cmake_depends "Unix Makefiles" /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153 /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/tools/run /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build/tools/run /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build/tools/run/CMakeFiles/llama-run.dir/DependInfo.cmake "--color=" gmake[2]: Leaving directory '/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build' /usr/bin/gmake -f tools/run/CMakeFiles/llama-run.dir/build.make tools/run/CMakeFiles/llama-run.dir/build gmake[2]: Entering directory '/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build' [ 93%] Building CXX object tools/run/CMakeFiles/llama-run.dir/run.cpp.o cd /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build/tools/run && /usr/bin/hipcc -DGGML_BACKEND_SHARED -DGGML_SHARED -DGGML_USE_CPU -DGGML_USE_CUDA -DGGML_USE_HIP -DLLAMA_SHARED -DLLAMA_USE_CURL -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/common/. -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/common/../vendor -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/src/../include -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -DNDEBUG -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -MD -MT tools/run/CMakeFiles/llama-run.dir/run.cpp.o -MF CMakeFiles/llama-run.dir/run.cpp.o.d -o CMakeFiles/llama-run.dir/run.cpp.o -c /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/tools/run/run.cpp sh: line 1: /usr/bin/rocm_agent_enumerator: No such file or directory cd /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build && /usr/bin/cmake -E cmake_depends "Unix Makefiles" /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153 /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/tools/server /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build/tools/server /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build/tools/server/CMakeFiles/llama-server.dir/DependInfo.cmake "--color=" gmake[2]: Leaving directory '/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build' /usr/bin/gmake -f tools/server/CMakeFiles/llama-server.dir/build.make tools/server/CMakeFiles/llama-server.dir/build gmake[2]: Entering directory '/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build' [ 94%] Building CXX object tools/server/CMakeFiles/llama-server.dir/server.cpp.o cd /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build/tools/server && /usr/bin/hipcc -DGGML_BACKEND_SHARED -DGGML_SHARED -DGGML_USE_CPU -DGGML_USE_CUDA -DGGML_USE_HIP -DLLAMA_SHARED -DLLAMA_USE_CURL -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/tools/server -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build/tools/server -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/tools/server/../llava -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153 -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/common/. -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/common/../vendor -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/src/../include -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/../include -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/tools/mtmd/. -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -DNDEBUG -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -MD -MT tools/server/CMakeFiles/llama-server.dir/server.cpp.o -MF CMakeFiles/llama-server.dir/server.cpp.o.d -o CMakeFiles/llama-server.dir/server.cpp.o -c /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/tools/server/server.cpp sh: line 1: /usr/bin/rocm_agent_enumerator: No such file or directory /usr/bin/hipcc -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -DNDEBUG -Wl,-z,relro -Wl,--as-needed -Wl,-z,pack-relative-relocs -Wl,-z,now -Wl,-z,now -Wl,--build-id=sha1 -specs=/usr/lib/rpm/redhat/redhat-package-notes -Xlinker --dependency-file=CMakeFiles/llama-imatrix.dir/link.d "CMakeFiles/llama-imatrix.dir/imatrix.cpp.o" -o ../../bin/llama-imatrix ../../common/libcommon.a ../../bin/libllama.so.b6153 ../../bin/libggml.so.b6153 ../../bin/libggml-cpu.so.b6153 ../../bin/libggml-hip.so.b6153 ../../bin/libggml-base.so.b6153 /usr/lib64/libcurl.so gmake[2]: Leaving directory '/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build' [ 94%] Built target llama-imatrix /usr/bin/gmake -f tools/tokenize/CMakeFiles/llama-tokenize.dir/build.make tools/tokenize/CMakeFiles/llama-tokenize.dir/depend gmake[2]: Entering directory '/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build' cd /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build && /usr/bin/cmake -E cmake_depends "Unix Makefiles" /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153 /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/tools/tokenize /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build/tools/tokenize /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build/tools/tokenize/CMakeFiles/llama-tokenize.dir/DependInfo.cmake "--color=" gmake[2]: Leaving directory '/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build' /usr/bin/gmake -f tools/tokenize/CMakeFiles/llama-tokenize.dir/build.make tools/tokenize/CMakeFiles/llama-tokenize.dir/build gmake[2]: Entering directory '/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build' [ 95%] Building CXX object tools/tokenize/CMakeFiles/llama-tokenize.dir/tokenize.cpp.o cd /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build/tools/tokenize && /usr/bin/hipcc -DGGML_BACKEND_SHARED -DGGML_SHARED -DGGML_USE_CPU -DGGML_USE_CUDA -DGGML_USE_HIP -DLLAMA_SHARED -DLLAMA_USE_CURL -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/common/. -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/common/../vendor -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/src/../include -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -DNDEBUG -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -MD -MT tools/tokenize/CMakeFiles/llama-tokenize.dir/tokenize.cpp.o -MF CMakeFiles/llama-tokenize.dir/tokenize.cpp.o.d -o CMakeFiles/llama-tokenize.dir/tokenize.cpp.o -c /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/tools/tokenize/tokenize.cpp sh: line 1: /usr/bin/rocm_agent_enumerator: No such file or directory [ 95%] Linking CXX executable ../../bin/llama-tokenize cd /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build/tools/tokenize && /usr/bin/cmake -E cmake_link_script CMakeFiles/llama-tokenize.dir/link.txt --verbose=1 sh: line 1: /usr/bin/rocm_agent_enumerator: No such file or directory clang++: warning: argument unused during compilation: '-Xarch_host -fstack-protector-strong' [-Wunused-command-line-argument] clang++: warning: argument unused during compilation: '-Xarch_host -fcf-protection' [-Wunused-command-line-argument] clang++: warning: argument unused during compilation: '-specs=/usr/lib/rpm/redhat/redhat-package-notes' [-Wunused-command-line-argument] /usr/bin/hipcc -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -DNDEBUG -Wl,-z,relro -Wl,--as-needed -Wl,-z,pack-relative-relocs -Wl,-z,now -Wl,-z,now -Wl,--build-id=sha1 -specs=/usr/lib/rpm/redhat/redhat-package-notes -Xlinker --dependency-file=CMakeFiles/llama-tokenize.dir/link.d "CMakeFiles/llama-tokenize.dir/tokenize.cpp.o" -o ../../bin/llama-tokenize ../../common/libcommon.a ../../bin/libllama.so.b6153 ../../bin/libggml.so.b6153 ../../bin/libggml-cpu.so.b6153 ../../bin/libggml-hip.so.b6153 ../../bin/libggml-base.so.b6153 /usr/lib64/libcurl.so gmake[2]: Leaving directory '/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build' [ 95%] Built target llama-tokenize /usr/bin/gmake -f tools/tts/CMakeFiles/llama-tts.dir/build.make tools/tts/CMakeFiles/llama-tts.dir/depend gmake[2]: Entering directory '/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build' cd /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build && /usr/bin/cmake -E cmake_depends "Unix Makefiles" /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153 /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/tools/tts /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build/tools/tts /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build/tools/tts/CMakeFiles/llama-tts.dir/DependInfo.cmake "--color=" gmake[2]: Leaving directory '/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build' /usr/bin/gmake -f tools/tts/CMakeFiles/llama-tts.dir/build.make tools/tts/CMakeFiles/llama-tts.dir/build gmake[2]: Entering directory '/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build' [ 96%] Building CXX object tools/tts/CMakeFiles/llama-tts.dir/tts.cpp.o cd /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build/tools/tts && /usr/bin/hipcc -DGGML_BACKEND_SHARED -DGGML_SHARED -DGGML_USE_CPU -DGGML_USE_CUDA -DGGML_USE_HIP -DLLAMA_SHARED -DLLAMA_USE_CURL -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/src/../include -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/../include -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/common/. -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/common/../vendor -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -DNDEBUG -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -MD -MT tools/tts/CMakeFiles/llama-tts.dir/tts.cpp.o -MF CMakeFiles/llama-tts.dir/tts.cpp.o.d -o CMakeFiles/llama-tts.dir/tts.cpp.o -c /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/tools/tts/tts.cpp sh: line 1: /usr/bin/rocm_agent_enumerator: No such file or directory [ 97%] Building CXX object tools/run/CMakeFiles/llama-run.dir/linenoise.cpp/linenoise.cpp.o cd /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build/tools/run && /usr/bin/hipcc -DGGML_BACKEND_SHARED -DGGML_SHARED -DGGML_USE_CPU -DGGML_USE_CUDA -DGGML_USE_HIP -DLLAMA_SHARED -DLLAMA_USE_CURL -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/common/. -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/common/../vendor -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/src/../include -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -DNDEBUG -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -MD -MT tools/run/CMakeFiles/llama-run.dir/linenoise.cpp/linenoise.cpp.o -MF CMakeFiles/llama-run.dir/linenoise.cpp/linenoise.cpp.o.d -o CMakeFiles/llama-run.dir/linenoise.cpp/linenoise.cpp.o -c /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/tools/run/linenoise.cpp/linenoise.cpp sh: line 1: /usr/bin/rocm_agent_enumerator: No such file or directory [ 97%] Linking CXX executable ../../bin/llama-run cd /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build/tools/run && /usr/bin/cmake -E cmake_link_script CMakeFiles/llama-run.dir/link.txt --verbose=1 sh: line 1: /usr/bin/rocm_agent_enumerator: No such file or directory clang++: warning: argument unused during compilation: '-Xarch_host -fstack-protector-strong' [-Wunused-command-line-argument] clang++: warning: argument unused during compilation: '-Xarch_host -fcf-protection' [-Wunused-command-line-argument] clang++: warning: argument unused during compilation: '-specs=/usr/lib/rpm/redhat/redhat-package-notes' [-Wunused-command-line-argument] /usr/bin/hipcc -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -DNDEBUG -Wl,-z,relro -Wl,--as-needed -Wl,-z,pack-relative-relocs -Wl,-z,now -Wl,-z,now -Wl,--build-id=sha1 -specs=/usr/lib/rpm/redhat/redhat-package-notes -Xlinker --dependency-file=CMakeFiles/llama-perplexity.dir/link.d "CMakeFiles/llama-perplexity.dir/perplexity.cpp.o" -o ../../bin/llama-perplexity ../../common/libcommon.a ../../bin/libllama.so.b6153 ../../bin/libggml.so.b6153 ../../bin/libggml-cpu.so.b6153 ../../bin/libggml-hip.so.b6153 ../../bin/libggml-base.so.b6153 /usr/lib64/libcurl.so gmake[2]: Leaving directory '/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build' [ 97%] Built target llama-perplexity /usr/bin/gmake -f tools/mtmd/CMakeFiles/llama-mtmd-cli.dir/build.make tools/mtmd/CMakeFiles/llama-mtmd-cli.dir/depend gmake[2]: Entering directory '/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build' cd /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build && /usr/bin/cmake -E cmake_depends "Unix Makefiles" /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153 /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/tools/mtmd /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build/tools/mtmd /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build/tools/mtmd/CMakeFiles/llama-mtmd-cli.dir/DependInfo.cmake "--color=" gmake[2]: Leaving directory '/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build' /usr/bin/gmake -f tools/mtmd/CMakeFiles/llama-mtmd-cli.dir/build.make tools/mtmd/CMakeFiles/llama-mtmd-cli.dir/build gmake[2]: Entering directory '/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build' [ 98%] Building CXX object tools/mtmd/CMakeFiles/llama-mtmd-cli.dir/mtmd-cli.cpp.o cd /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build/tools/mtmd && /usr/bin/hipcc -DGGML_BACKEND_SHARED -DGGML_SHARED -DGGML_USE_CPU -DGGML_USE_CUDA -DGGML_USE_HIP -DLLAMA_SHARED -DLLAMA_USE_CURL -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/common/. -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/common/../vendor -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/src/../include -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/../include -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/tools/mtmd/. -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -DNDEBUG -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -MD -MT tools/mtmd/CMakeFiles/llama-mtmd-cli.dir/mtmd-cli.cpp.o -MF CMakeFiles/llama-mtmd-cli.dir/mtmd-cli.cpp.o.d -o CMakeFiles/llama-mtmd-cli.dir/mtmd-cli.cpp.o -c /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/tools/mtmd/mtmd-cli.cpp sh: line 1: /usr/bin/rocm_agent_enumerator: No such file or directory [ 98%] Linking CXX executable ../../bin/llama-mtmd-cli cd /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build/tools/mtmd && /usr/bin/cmake -E cmake_link_script CMakeFiles/llama-mtmd-cli.dir/link.txt --verbose=1 sh: line 1: /usr/bin/rocm_agent_enumerator: No such file or directory clang++: warning: argument unused during compilation: '-Xarch_host -fstack-protector-strong' [-Wunused-command-line-argument] clang++: warning: argument unused during compilation: '-Xarch_host -fcf-protection' [-Wunused-command-line-argument] clang++: warning: argument unused during compilation: '-specs=/usr/lib/rpm/redhat/redhat-package-notes' [-Wunused-command-line-argument] [ 98%] Linking CXX executable ../../bin/llama-tts cd /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build/tools/tts && /usr/bin/cmake -E cmake_link_script CMakeFiles/llama-tts.dir/link.txt --verbose=1 sh: line 1: /usr/bin/rocm_agent_enumerator: No such file or directory clang++: warning: argument unused during compilation: '-Xarch_host -fstack-protector-strong' [-Wunused-command-line-argument] clang++: warning: argument unused during compilation: '-Xarch_host -fcf-protection' [-Wunused-command-line-argument] clang++: warning: argument unused during compilation: '-specs=/usr/lib/rpm/redhat/redhat-package-notes' [-Wunused-command-line-argument] /usr/bin/hipcc -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -DNDEBUG -Wl,-z,relro -Wl,--as-needed -Wl,-z,pack-relative-relocs -Wl,-z,now -Wl,-z,now -Wl,--build-id=sha1 -specs=/usr/lib/rpm/redhat/redhat-package-notes -Xlinker --dependency-file=CMakeFiles/llama-run.dir/link.d "CMakeFiles/llama-run.dir/run.cpp.o" "CMakeFiles/llama-run.dir/linenoise.cpp/linenoise.cpp.o" -o ../../bin/llama-run ../../common/libcommon.a ../../bin/libllama.so.b6153 /usr/lib64/libcurl.so ../../bin/libggml.so.b6153 ../../bin/libggml-cpu.so.b6153 ../../bin/libggml-hip.so.b6153 ../../bin/libggml-base.so.b6153 gmake[2]: Leaving directory '/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build' [ 98%] Built target llama-run /usr/bin/gmake -f tools/cvector-generator/CMakeFiles/llama-cvector-generator.dir/build.make tools/cvector-generator/CMakeFiles/llama-cvector-generator.dir/depend gmake[2]: Entering directory '/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build' cd /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build && /usr/bin/cmake -E cmake_depends "Unix Makefiles" /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153 /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/tools/cvector-generator /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build/tools/cvector-generator /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build/tools/cvector-generator/CMakeFiles/llama-cvector-generator.dir/DependInfo.cmake "--color=" gmake[2]: Leaving directory '/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build' /usr/bin/gmake -f tools/cvector-generator/CMakeFiles/llama-cvector-generator.dir/build.make tools/cvector-generator/CMakeFiles/llama-cvector-generator.dir/build gmake[2]: Entering directory '/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build' [ 99%] Building CXX object tools/cvector-generator/CMakeFiles/llama-cvector-generator.dir/cvector-generator.cpp.o cd /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build/tools/cvector-generator && /usr/bin/hipcc -DGGML_BACKEND_SHARED -DGGML_SHARED -DGGML_USE_CPU -DGGML_USE_CUDA -DGGML_USE_HIP -DLLAMA_SHARED -DLLAMA_USE_CURL -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/common/. -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/common/../vendor -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/src/../include -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -DNDEBUG -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -MD -MT tools/cvector-generator/CMakeFiles/llama-cvector-generator.dir/cvector-generator.cpp.o -MF CMakeFiles/llama-cvector-generator.dir/cvector-generator.cpp.o.d -o CMakeFiles/llama-cvector-generator.dir/cvector-generator.cpp.o -c /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/tools/cvector-generator/cvector-generator.cpp sh: line 1: /usr/bin/rocm_agent_enumerator: No such file or directory [ 99%] Linking CXX executable ../../bin/llama-cvector-generator cd /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build/tools/cvector-generator && /usr/bin/cmake -E cmake_link_script CMakeFiles/llama-cvector-generator.dir/link.txt --verbose=1 sh: line 1: /usr/bin/rocm_agent_enumerator: No such file or directory clang++: warning: argument unused during compilation: '-Xarch_host -fstack-protector-strong' [-Wunused-command-line-argument] clang++: warning: argument unused during compilation: '-Xarch_host -fcf-protection' [-Wunused-command-line-argument] clang++: warning: argument unused during compilation: '-specs=/usr/lib/rpm/redhat/redhat-package-notes' [-Wunused-command-line-argument] /usr/bin/hipcc -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -DNDEBUG -Wl,-z,relro -Wl,--as-needed -Wl,-z,pack-relative-relocs -Wl,-z,now -Wl,-z,now -Wl,--build-id=sha1 -specs=/usr/lib/rpm/redhat/redhat-package-notes -Xlinker --dependency-file=CMakeFiles/llama-mtmd-cli.dir/link.d "CMakeFiles/llama-mtmd-cli.dir/mtmd-cli.cpp.o" -o ../../bin/llama-mtmd-cli ../../common/libcommon.a ../../bin/libmtmd.so.b6153 /usr/lib64/libcurl.so ../../bin/libllama.so.b6153 ../../bin/libggml.so.b6153 ../../bin/libggml-cpu.so.b6153 ../../bin/libggml-hip.so.b6153 ../../bin/libggml-base.so.b6153 gmake[2]: Leaving directory '/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build' [ 99%] Built target llama-mtmd-cli /usr/bin/gmake -f tools/export-lora/CMakeFiles/llama-export-lora.dir/build.make tools/export-lora/CMakeFiles/llama-export-lora.dir/depend gmake[2]: Entering directory '/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build' cd /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build && /usr/bin/cmake -E cmake_depends "Unix Makefiles" /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153 /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/tools/export-lora /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build/tools/export-lora /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build/tools/export-lora/CMakeFiles/llama-export-lora.dir/DependInfo.cmake "--color=" gmake[2]: Leaving directory '/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build' /usr/bin/gmake -f tools/export-lora/CMakeFiles/llama-export-lora.dir/build.make tools/export-lora/CMakeFiles/llama-export-lora.dir/build gmake[2]: Entering directory '/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build' [100%] Building CXX object tools/export-lora/CMakeFiles/llama-export-lora.dir/export-lora.cpp.o cd /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build/tools/export-lora && /usr/bin/hipcc -DGGML_BACKEND_SHARED -DGGML_SHARED -DGGML_USE_CPU -DGGML_USE_CUDA -DGGML_USE_HIP -DLLAMA_SHARED -DLLAMA_USE_CURL -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/common/. -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/common/../vendor -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/src/../include -I/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -DNDEBUG -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -MD -MT tools/export-lora/CMakeFiles/llama-export-lora.dir/export-lora.cpp.o -MF CMakeFiles/llama-export-lora.dir/export-lora.cpp.o.d -o CMakeFiles/llama-export-lora.dir/export-lora.cpp.o -c /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/tools/export-lora/export-lora.cpp sh: line 1: /usr/bin/rocm_agent_enumerator: No such file or directory [100%] Linking CXX executable ../../bin/llama-export-lora cd /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build/tools/export-lora && /usr/bin/cmake -E cmake_link_script CMakeFiles/llama-export-lora.dir/link.txt --verbose=1 sh: line 1: /usr/bin/rocm_agent_enumerator: No such file or directory clang++: warning: argument unused during compilation: '-Xarch_host -fstack-protector-strong' [-Wunused-command-line-argument] clang++: warning: argument unused during compilation: '-Xarch_host -fcf-protection' [-Wunused-command-line-argument] clang++: warning: argument unused during compilation: '-specs=/usr/lib/rpm/redhat/redhat-package-notes' [-Wunused-command-line-argument] /usr/bin/hipcc -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -DNDEBUG -Wl,-z,relro -Wl,--as-needed -Wl,-z,pack-relative-relocs -Wl,-z,now -Wl,-z,now -Wl,--build-id=sha1 -specs=/usr/lib/rpm/redhat/redhat-package-notes -Xlinker --dependency-file=CMakeFiles/llama-tts.dir/link.d "CMakeFiles/llama-tts.dir/tts.cpp.o" -o ../../bin/llama-tts ../../common/libcommon.a ../../bin/libllama.so.b6153 ../../bin/libggml.so.b6153 ../../bin/libggml-cpu.so.b6153 ../../bin/libggml-hip.so.b6153 ../../bin/libggml-base.so.b6153 /usr/lib64/libcurl.so gmake[2]: Leaving directory '/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build' [100%] Built target llama-tts [100%] Linking CXX executable ../../bin/llama-server cd /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build/tools/server && /usr/bin/cmake -E cmake_link_script CMakeFiles/llama-server.dir/link.txt --verbose=1 sh: line 1: /usr/bin/rocm_agent_enumerator: No such file or directory clang++: warning: argument unused during compilation: '-Xarch_host -fstack-protector-strong' [-Wunused-command-line-argument] clang++: warning: argument unused during compilation: '-Xarch_host -fcf-protection' [-Wunused-command-line-argument] clang++: warning: argument unused during compilation: '-specs=/usr/lib/rpm/redhat/redhat-package-notes' [-Wunused-command-line-argument] /usr/bin/hipcc -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -DNDEBUG -Wl,-z,relro -Wl,--as-needed -Wl,-z,pack-relative-relocs -Wl,-z,now -Wl,-z,now -Wl,--build-id=sha1 -specs=/usr/lib/rpm/redhat/redhat-package-notes -Xlinker --dependency-file=CMakeFiles/llama-cvector-generator.dir/link.d "CMakeFiles/llama-cvector-generator.dir/cvector-generator.cpp.o" -o ../../bin/llama-cvector-generator ../../common/libcommon.a ../../bin/libllama.so.b6153 ../../bin/libggml.so.b6153 ../../bin/libggml-cpu.so.b6153 ../../bin/libggml-hip.so.b6153 ../../bin/libggml-base.so.b6153 /usr/lib64/libcurl.so gmake[2]: Leaving directory '/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build' [100%] Built target llama-cvector-generator /usr/bin/hipcc -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -DNDEBUG -Wl,-z,relro -Wl,--as-needed -Wl,-z,pack-relative-relocs -Wl,-z,now -Wl,-z,now -Wl,--build-id=sha1 -specs=/usr/lib/rpm/redhat/redhat-package-notes -Xlinker --dependency-file=CMakeFiles/llama-export-lora.dir/link.d "CMakeFiles/llama-export-lora.dir/export-lora.cpp.o" -o ../../bin/llama-export-lora ../../common/libcommon.a ../../bin/libllama.so.b6153 ../../bin/libggml.so.b6153 ../../bin/libggml-cpu.so.b6153 ../../bin/libggml-hip.so.b6153 ../../bin/libggml-base.so.b6153 /usr/lib64/libcurl.so gmake[2]: Leaving directory '/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build' [100%] Built target llama-export-lora /usr/bin/hipcc -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -DNDEBUG -Wl,-z,relro -Wl,--as-needed -Wl,-z,pack-relative-relocs -Wl,-z,now -Wl,-z,now -Wl,--build-id=sha1 -specs=/usr/lib/rpm/redhat/redhat-package-notes -Xlinker --dependency-file=CMakeFiles/llama-server.dir/link.d "CMakeFiles/llama-server.dir/server.cpp.o" -o ../../bin/llama-server ../../common/libcommon.a ../../bin/libmtmd.so.b6153 /usr/lib64/libcurl.so ../../bin/libllama.so.b6153 ../../bin/libggml.so.b6153 ../../bin/libggml-cpu.so.b6153 ../../bin/libggml-hip.so.b6153 ../../bin/libggml-base.so.b6153 gmake[2]: Leaving directory '/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build' [100%] Built target llama-server gmake[1]: Leaving directory '/builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build' /usr/bin/cmake -E cmake_progress_start /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/redhat-linux-build/CMakeFiles 0 + RPM_EC=0 ++ jobs -p + exit 0 Executing(%install): /bin/sh -e /var/tmp/rpm-tmp.tZ53M3 + umask 022 + cd /builddir/build/BUILD/llama-cpp-b6153-build + '[' /builddir/build/BUILD/llama-cpp-b6153-build/BUILDROOT '!=' / ']' + rm -rf /builddir/build/BUILD/llama-cpp-b6153-build/BUILDROOT ++ dirname /builddir/build/BUILD/llama-cpp-b6153-build/BUILDROOT + mkdir -p /builddir/build/BUILD/llama-cpp-b6153-build + mkdir /builddir/build/BUILD/llama-cpp-b6153-build/BUILDROOT + CFLAGS='-O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -fcf-protection -mtls-dialect=gnu2 -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer ' + export CFLAGS + CXXFLAGS='-O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer' + export CXXFLAGS + FFLAGS='-O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -fcf-protection -mtls-dialect=gnu2 -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -I/usr/lib64/gfortran/modules ' + export FFLAGS + FCFLAGS='-O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -fcf-protection -mtls-dialect=gnu2 -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -I/usr/lib64/gfortran/modules ' + export FCFLAGS + VALAFLAGS=-g + export VALAFLAGS + RUSTFLAGS='-Copt-level=3 -Cdebuginfo=2 -Ccodegen-units=1 -Cstrip=none -Cforce-frame-pointers=yes -Clink-arg=-specs=/usr/lib/rpm/redhat/redhat-package-notes --cap-lints=warn' + export RUSTFLAGS + LDFLAGS='-Wl,-z,relro -Wl,--as-needed -Wl,-z,pack-relative-relocs -Wl,-z,now -Wl,-z,now -Wl,--build-id=sha1 -specs=/usr/lib/rpm/redhat/redhat-package-notes ' + export LDFLAGS + LT_SYS_LIBRARY_PATH=/usr/lib64: + export LT_SYS_LIBRARY_PATH + CC=hipcc + export CC + CXX=hipcc + export CXX + cd llama.cpp-b6153 + DESTDIR=/builddir/build/BUILD/llama-cpp-b6153-build/BUILDROOT + /usr/bin/cmake --install redhat-linux-build -- Install configuration: "Release" -- Installing: /builddir/build/BUILD/llama-cpp-b6153-build/BUILDROOT/usr/lib64/libggml-cpu.so.b6153 -- Installing: /builddir/build/BUILD/llama-cpp-b6153-build/BUILDROOT/usr/lib64/libggml-cpu.so -- Installing: /builddir/build/BUILD/llama-cpp-b6153-build/BUILDROOT/usr/lib64/libggml-hip.so.b6153 -- Installing: /builddir/build/BUILD/llama-cpp-b6153-build/BUILDROOT/usr/lib64/libggml-hip.so -- Installing: /builddir/build/BUILD/llama-cpp-b6153-build/BUILDROOT/usr/lib64/libggml.so.b6153 -- Installing: /builddir/build/BUILD/llama-cpp-b6153-build/BUILDROOT/usr/lib64/libggml.so -- Installing: /builddir/build/BUILD/llama-cpp-b6153-build/BUILDROOT/usr/include/ggml.h -- Installing: /builddir/build/BUILD/llama-cpp-b6153-build/BUILDROOT/usr/include/ggml-cpu.h -- Installing: /builddir/build/BUILD/llama-cpp-b6153-build/BUILDROOT/usr/include/ggml-alloc.h -- Installing: /builddir/build/BUILD/llama-cpp-b6153-build/BUILDROOT/usr/include/ggml-backend.h -- Installing: /builddir/build/BUILD/llama-cpp-b6153-build/BUILDROOT/usr/include/ggml-blas.h -- Installing: /builddir/build/BUILD/llama-cpp-b6153-build/BUILDROOT/usr/include/ggml-cann.h -- Installing: /builddir/build/BUILD/llama-cpp-b6153-build/BUILDROOT/usr/include/ggml-cpp.h -- Installing: /builddir/build/BUILD/llama-cpp-b6153-build/BUILDROOT/usr/include/ggml-cuda.h -- Installing: /builddir/build/BUILD/llama-cpp-b6153-build/BUILDROOT/usr/include/ggml-opt.h -- Installing: /builddir/build/BUILD/llama-cpp-b6153-build/BUILDROOT/usr/include/ggml-metal.h -- Installing: /builddir/build/BUILD/llama-cpp-b6153-build/BUILDROOT/usr/include/ggml-rpc.h -- Installing: /builddir/build/BUILD/llama-cpp-b6153-build/BUILDROOT/usr/include/ggml-sycl.h -- Installing: /builddir/build/BUILD/llama-cpp-b6153-build/BUILDROOT/usr/include/ggml-vulkan.h -- Installing: /builddir/build/BUILD/llama-cpp-b6153-build/BUILDROOT/usr/include/ggml-webgpu.h -- Installing: /builddir/build/BUILD/llama-cpp-b6153-build/BUILDROOT/usr/include/gguf.h -- Installing: /builddir/build/BUILD/llama-cpp-b6153-build/BUILDROOT/usr/lib64/libggml-base.so.b6153 -- Installing: /builddir/build/BUILD/llama-cpp-b6153-build/BUILDROOT/usr/lib64/libggml-base.so -- Installing: /builddir/build/BUILD/llama-cpp-b6153-build/BUILDROOT/usr/lib64/cmake/ggml/ggml-config.cmake -- Installing: /builddir/build/BUILD/llama-cpp-b6153-build/BUILDROOT/usr/lib64/cmake/ggml/ggml-version.cmake -- Installing: /builddir/build/BUILD/llama-cpp-b6153-build/BUILDROOT/usr/bin/llama-batched-bench -- Installing: /builddir/build/BUILD/llama-cpp-b6153-build/BUILDROOT/usr/bin/llama-gguf-split -- Installing: /builddir/build/BUILD/llama-cpp-b6153-build/BUILDROOT/usr/bin/llama-imatrix -- Installing: /builddir/build/BUILD/llama-cpp-b6153-build/BUILDROOT/usr/bin/llama-bench -- Installing: /builddir/build/BUILD/llama-cpp-b6153-build/BUILDROOT/usr/bin/llama-cli -- Installing: /builddir/build/BUILD/llama-cpp-b6153-build/BUILDROOT/usr/bin/llama-perplexity -- Installing: /builddir/build/BUILD/llama-cpp-b6153-build/BUILDROOT/usr/bin/llama-quantize -- Installing: /builddir/build/BUILD/llama-cpp-b6153-build/BUILDROOT/usr/bin/llama-server -- Installing: /builddir/build/BUILD/llama-cpp-b6153-build/BUILDROOT/usr/bin/llama-run -- Installing: /builddir/build/BUILD/llama-cpp-b6153-build/BUILDROOT/usr/bin/llama-tokenize -- Installing: /builddir/build/BUILD/llama-cpp-b6153-build/BUILDROOT/usr/bin/llama-tts -- Installing: /builddir/build/BUILD/llama-cpp-b6153-build/BUILDROOT/usr/lib64/libmtmd.so.b6153 -- Installing: /builddir/build/BUILD/llama-cpp-b6153-build/BUILDROOT/usr/lib64/libmtmd.so -- Installing: /builddir/build/BUILD/llama-cpp-b6153-build/BUILDROOT/usr/include/mtmd.h -- Installing: /builddir/build/BUILD/llama-cpp-b6153-build/BUILDROOT/usr/include/mtmd-helper.h -- Installing: /builddir/build/BUILD/llama-cpp-b6153-build/BUILDROOT/usr/bin/llama-mtmd-cli -- Installing: /builddir/build/BUILD/llama-cpp-b6153-build/BUILDROOT/usr/bin/llama-cvector-generator -- Installing: /builddir/build/BUILD/llama-cpp-b6153-build/BUILDROOT/usr/bin/llama-export-lora -- Installing: /builddir/build/BUILD/llama-cpp-b6153-build/BUILDROOT/usr/lib64/libllama.so.b6153 -- Installing: /builddir/build/BUILD/llama-cpp-b6153-build/BUILDROOT/usr/lib64/libllama.so -- Installing: /builddir/build/BUILD/llama-cpp-b6153-build/BUILDROOT/usr/include/llama.h -- Installing: /builddir/build/BUILD/llama-cpp-b6153-build/BUILDROOT/usr/include/llama-cpp.h -- Installing: /builddir/build/BUILD/llama-cpp-b6153-build/BUILDROOT/usr/lib64/cmake/llama/llama-config.cmake -- Installing: /builddir/build/BUILD/llama-cpp-b6153-build/BUILDROOT/usr/lib64/cmake/llama/llama-version.cmake -- Installing: /builddir/build/BUILD/llama-cpp-b6153-build/BUILDROOT/usr/bin/convert_hf_to_gguf.py -- Installing: /builddir/build/BUILD/llama-cpp-b6153-build/BUILDROOT/usr/lib64/pkgconfig/llama.pc + rm -rf '/builddir/build/BUILD/llama-cpp-b6153-build/BUILDROOT/usr/lib64/libggml_shared.*' + rm /builddir/build/BUILD/llama-cpp-b6153-build/BUILDROOT/usr/bin/convert_hf_to_gguf.py + /usr/bin/find-debuginfo -j4 --strict-build-id -m -i --build-id-seed b6153-1.fc44 --unique-debug-suffix -b6153-1.fc44.x86_64 --unique-debug-src-base llama-cpp-b6153-1.fc44.x86_64 --run-dwz --dwz-low-mem-die-limit 10000000 --dwz-max-die-limit 110000000 --remove-section .gnu.build.attributes -S debugsourcefiles.list /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153 find-debuginfo: starting Extracting debug info from 20 files DWARF-compressing 20 files dwz: ./usr/bin/llama-batched-bench-b6153-1.fc44.x86_64.debug: Unknown debugging section .debug_str_offsets dwz: ./usr/bin/llama-bench-b6153-1.fc44.x86_64.debug: Unknown debugging section .debug_str_offsets dwz: ./usr/bin/llama-cli-b6153-1.fc44.x86_64.debug: Unknown debugging section .debug_str_offsets dwz: ./usr/bin/llama-cvector-generator-b6153-1.fc44.x86_64.debug: Unknown debugging section .debug_str_offsets dwz: ./usr/bin/llama-export-lora-b6153-1.fc44.x86_64.debug: Unknown debugging section .debug_str_offsets dwz: ./usr/bin/llama-gguf-split-b6153-1.fc44.x86_64.debug: Unknown debugging section .debug_str_offsets dwz: ./usr/bin/llama-imatrix-b6153-1.fc44.x86_64.debug: Unknown debugging section .debug_str_offsets dwz: ./usr/bin/llama-mtmd-cli-b6153-1.fc44.x86_64.debug: Unknown debugging section .debug_str_offsets dwz: ./usr/bin/llama-perplexity-b6153-1.fc44.x86_64.debug: Unknown debugging section .debug_str_offsets dwz: ./usr/bin/llama-quantize-b6153-1.fc44.x86_64.debug: Unknown debugging section .debug_str_offsets dwz: ./usr/bin/llama-run-b6153-1.fc44.x86_64.debug: Unknown debugging section .debug_str_offsets dwz: ./usr/bin/llama-server-b6153-1.fc44.x86_64.debug: Unknown debugging section .debug_str_offsets dwz: ./usr/bin/llama-tokenize-b6153-1.fc44.x86_64.debug: Unknown debugging section .debug_str_offsets dwz: ./usr/bin/llama-tts-b6153-1.fc44.x86_64.debug: Unknown debugging section .debug_str_offsets dwz: ./usr/lib64/libggml-base.so.b6153-b6153-1.fc44.x86_64.debug: Unknown debugging section .debug_str_offsets dwz: ./usr/lib64/libggml-cpu.so.b6153-b6153-1.fc44.x86_64.debug: Unknown debugging section .debug_str_offsets dwz: ./usr/lib64/libggml-hip.so.b6153-b6153-1.fc44.x86_64.debug: Unknown debugging section .debug_str_offsets dwz: ./usr/lib64/libggml.so.b6153-b6153-1.fc44.x86_64.debug: Unknown debugging section .debug_str_offsets dwz: ./usr/lib64/libllama.so.b6153-b6153-1.fc44.x86_64.debug: Unknown debugging section .debug_str_offsets dwz: ./usr/lib64/libmtmd.so.b6153-b6153-1.fc44.x86_64.debug: Unknown debugging section .debug_str_offsets dwz: Too few files for multifile optimization sepdebugcrcfix: Updated 0 CRC32s, 20 CRC32s did match. Creating .debug symlinks for symlinks to ELF files Copying sources found by 'debugedit -l' to /usr/src/debug/llama-cpp-b6153-1.fc44.x86_64 find-debuginfo: done + /usr/lib/rpm/check-buildroot + /usr/lib/rpm/redhat/brp-ldconfig + COMPRESS='gzip -9 -n' + COMPRESS_EXT=.gz + /usr/lib/rpm/brp-compress + /usr/lib/rpm/redhat/brp-strip-lto /usr/bin/strip + /usr/lib/rpm/check-rpaths + /usr/lib/rpm/redhat/brp-mangle-shebangs + /usr/lib/rpm/brp-remove-la-files + /usr/lib/rpm/redhat/brp-python-rpm-in-distinfo + env /usr/lib/rpm/redhat/brp-python-bytecompile '' 1 0 -j4 + /usr/lib/rpm/redhat/brp-python-hardlink + /usr/bin/add-det --brp -j4 /builddir/build/BUILD/llama-cpp-b6153-build/BUILDROOT Scanned 74 directories and 374 files, processed 0 inodes, 0 modified (0 replaced + 0 rewritten), 0 unsupported format, 0 errors + /usr/bin/linkdupes --brp /builddir/build/BUILD/llama-cpp-b6153-build/BUILDROOT/usr Scanned 73 directories and 374 files, considered 368 files, read 92 files, linked 13 files, 0 errors sum of sizes of linked files: 242641 bytes Reading /builddir/build/BUILD/llama-cpp-b6153-build/SPECPARTS/rpm-debuginfo.specpart Processing files: llama-cpp-b6153-1.fc44.x86_64 Executing(%license): /bin/sh -e /var/tmp/rpm-tmp.g8xZll + umask 022 + cd /builddir/build/BUILD/llama-cpp-b6153-build + cd llama.cpp-b6153 + LICENSEDIR=/builddir/build/BUILD/llama-cpp-b6153-build/BUILDROOT/usr/share/licenses/llama-cpp + export LC_ALL=C.UTF-8 + LC_ALL=C.UTF-8 + export LICENSEDIR + /usr/bin/mkdir -p /builddir/build/BUILD/llama-cpp-b6153-build/BUILDROOT/usr/share/licenses/llama-cpp + cp -pr /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/LICENSE /builddir/build/BUILD/llama-cpp-b6153-build/BUILDROOT/usr/share/licenses/llama-cpp + RPM_EC=0 ++ jobs -p + exit 0 Provides: libggml-base.so.b6153()(64bit) libggml-cpu.so.b6153()(64bit) libggml-hip.so.b6153()(64bit) libggml.so.b6153()(64bit) libllama.so.b6153()(64bit) libmtmd.so.b6153()(64bit) llama-cpp = b6153-1.fc44 llama-cpp(x86-64) = b6153-1.fc44 Requires(rpmlib): rpmlib(CompressedFileNames) <= 3.0.4-1 rpmlib(FileDigests) <= 4.6.0-1 rpmlib(PayloadFilesHavePrefix) <= 4.0-1 Requires: glibc >= 2.42.9000-16 ld-linux-x86-64.so.2()(64bit) ld-linux-x86-64.so.2(GLIBC_2.3)(64bit) libamdhip64.so.7()(64bit) libamdhip64.so.7(hip_4.2)(64bit) libamdhip64.so.7(hip_6.0)(64bit) libc.so.6()(64bit) libc.so.6(GLIBC_2.10)(64bit) libc.so.6(GLIBC_2.14)(64bit) libc.so.6(GLIBC_2.16)(64bit) libc.so.6(GLIBC_2.17)(64bit) libc.so.6(GLIBC_2.2.5)(64bit) libc.so.6(GLIBC_2.29)(64bit) libc.so.6(GLIBC_2.3.2)(64bit) libc.so.6(GLIBC_2.3.4)(64bit) libc.so.6(GLIBC_2.32)(64bit) libc.so.6(GLIBC_2.33)(64bit) libc.so.6(GLIBC_2.34)(64bit) libc.so.6(GLIBC_2.38)(64bit) libc.so.6(GLIBC_2.4)(64bit) libc.so.6(GLIBC_2.7)(64bit) libc.so.6(GLIBC_ABI_DT_RELR)(64bit) libcurl.so.4()(64bit) libgcc_s.so.1()(64bit) libgcc_s.so.1(GCC_12.0.0)(64bit) libgcc_s.so.1(GCC_3.0)(64bit) libggml-base.so.b6153()(64bit) libggml-cpu.so.b6153()(64bit) libggml-hip.so.b6153()(64bit) libggml.so.b6153()(64bit) libhipblas.so.3()(64bit) libllama.so.b6153()(64bit) libm.so.6()(64bit) libm.so.6(GLIBC_2.2.5)(64bit) libm.so.6(GLIBC_2.27)(64bit) libm.so.6(GLIBC_2.29)(64bit) libm.so.6(GLIBC_2.43)(64bit) libmtmd.so.b6153()(64bit) libstdc++.so.6()(64bit) libstdc++.so.6(CXXABI_1.3)(64bit) libstdc++.so.6(CXXABI_1.3.11)(64bit) libstdc++.so.6(CXXABI_1.3.13)(64bit) libstdc++.so.6(CXXABI_1.3.2)(64bit) libstdc++.so.6(CXXABI_1.3.3)(64bit) libstdc++.so.6(CXXABI_1.3.5)(64bit) libstdc++.so.6(CXXABI_1.3.7)(64bit) libstdc++.so.6(CXXABI_1.3.9)(64bit) libstdc++.so.6(GLIBCXX_3.4)(64bit) libstdc++.so.6(GLIBCXX_3.4.11)(64bit) libstdc++.so.6(GLIBCXX_3.4.14)(64bit) libstdc++.so.6(GLIBCXX_3.4.15)(64bit) libstdc++.so.6(GLIBCXX_3.4.17)(64bit) libstdc++.so.6(GLIBCXX_3.4.18)(64bit) libstdc++.so.6(GLIBCXX_3.4.19)(64bit) libstdc++.so.6(GLIBCXX_3.4.20)(64bit) libstdc++.so.6(GLIBCXX_3.4.21)(64bit) libstdc++.so.6(GLIBCXX_3.4.22)(64bit) libstdc++.so.6(GLIBCXX_3.4.25)(64bit) libstdc++.so.6(GLIBCXX_3.4.26)(64bit) libstdc++.so.6(GLIBCXX_3.4.29)(64bit) libstdc++.so.6(GLIBCXX_3.4.30)(64bit) libstdc++.so.6(GLIBCXX_3.4.32)(64bit) libstdc++.so.6(GLIBCXX_3.4.9)(64bit) Recommends: numactl Processing files: llama-cpp-devel-b6153-1.fc44.x86_64 Executing(%doc): /bin/sh -e /var/tmp/rpm-tmp.OxRXwX + umask 022 + cd /builddir/build/BUILD/llama-cpp-b6153-build + cd llama.cpp-b6153 + DOCDIR=/builddir/build/BUILD/llama-cpp-b6153-build/BUILDROOT/usr/share/doc/llama-cpp-devel + export LC_ALL=C.UTF-8 + LC_ALL=C.UTF-8 + export DOCDIR + /usr/bin/mkdir -p /builddir/build/BUILD/llama-cpp-b6153-build/BUILDROOT/usr/share/doc/llama-cpp-devel + cp -pr /builddir/build/BUILD/llama-cpp-b6153-build/llama.cpp-b6153/README.md /builddir/build/BUILD/llama-cpp-b6153-build/BUILDROOT/usr/share/doc/llama-cpp-devel + RPM_EC=0 ++ jobs -p + exit 0 Provides: cmake(ggml) cmake(llama) llama-cpp-devel = b6153-1.fc44 llama-cpp-devel(x86-64) = b6153-1.fc44 pkgconfig(llama) = 0.0.0 Requires(rpmlib): rpmlib(CompressedFileNames) <= 3.0.4-1 rpmlib(FileDigests) <= 4.6.0-1 rpmlib(PartialHardlinkSets) <= 4.0.4-1 rpmlib(PayloadFilesHavePrefix) <= 4.0-1 Requires: /usr/bin/pkg-config cmake-filesystem(x86-64) libggml-base.so.b6153()(64bit) libggml-cpu.so.b6153()(64bit) libggml-hip.so.b6153()(64bit) libggml.so.b6153()(64bit) libllama.so.b6153()(64bit) libmtmd.so.b6153()(64bit) Processing files: llama-cpp-debugsource-b6153-1.fc44.x86_64 Provides: llama-cpp-debugsource = b6153-1.fc44 llama-cpp-debugsource(x86-64) = b6153-1.fc44 Requires(rpmlib): rpmlib(CompressedFileNames) <= 3.0.4-1 rpmlib(FileDigests) <= 4.6.0-1 rpmlib(PayloadFilesHavePrefix) <= 4.0-1 Processing files: llama-cpp-debuginfo-b6153-1.fc44.x86_64 Provides: debuginfo(build-id) = 0d6fbb9e1b0f65a6faa723182175b34b4f932c89 debuginfo(build-id) = 19d72a123de36c5d99ad0115bd80af3f543d2dbd debuginfo(build-id) = 1ba40eec7c99a2064ff55e2b577830983084abe0 debuginfo(build-id) = 2242c4edc6b5f6099a9353e8b277cc63b905e2b3 debuginfo(build-id) = 2b2224dfd7ea2e4f22ef315fc9c8bf9e7173782f debuginfo(build-id) = 357a7e17c55113cb68dd706db2c935f45cbde215 debuginfo(build-id) = 4fd2477e720046aa7fff825f3ac432d513e226fb debuginfo(build-id) = 52113ba213ceccd45f160cb91e20522d1da6abc6 debuginfo(build-id) = 543cce647b0446de52bc54593e8ba52bec0e7b32 debuginfo(build-id) = 579af5d38e7810fd1899a430c24c0d4d88fe6bd1 debuginfo(build-id) = 59449fc6fb8bec98a87ef05261924fc787dbb0ac debuginfo(build-id) = 740c51e06c5d36f43689b53a2ed7bd6937d6180a debuginfo(build-id) = 75cd1d6b9b3f5f5aa1950c846ecdf2ae3ac0e7d8 debuginfo(build-id) = 821bfa6304e4590ac348b2ae1100472264f62d96 debuginfo(build-id) = 87c4125b55bb03ef74aac2eeb416967237593615 debuginfo(build-id) = 934ec3ea4f74c1a9445e03909c5cb13d0a423a03 debuginfo(build-id) = d65530cd9ae82c98fb604a4b153fbcf3884976c3 debuginfo(build-id) = db8f9acfec5c2bff5be058c8f585e9ddfafab9e2 debuginfo(build-id) = e68486d8b4abf726eeaf73bedf63b9a47553ecde debuginfo(build-id) = eaadc3a1ca3a3083f5b2b9319fa4c78e2f995d23 libggml-base.so.b6153-b6153-1.fc44.x86_64.debug()(64bit) libggml-cpu.so.b6153-b6153-1.fc44.x86_64.debug()(64bit) libggml-hip.so.b6153-b6153-1.fc44.x86_64.debug()(64bit) libggml.so.b6153-b6153-1.fc44.x86_64.debug()(64bit) libllama.so.b6153-b6153-1.fc44.x86_64.debug()(64bit) libmtmd.so.b6153-b6153-1.fc44.x86_64.debug()(64bit) llama-cpp-debuginfo = b6153-1.fc44 llama-cpp-debuginfo(x86-64) = b6153-1.fc44 Requires(rpmlib): rpmlib(CompressedFileNames) <= 3.0.4-1 rpmlib(FileDigests) <= 4.6.0-1 rpmlib(PayloadFilesHavePrefix) <= 4.0-1 Recommends: llama-cpp-debugsource(x86-64) = b6153-1.fc44 Checking for unpackaged file(s): /usr/lib/rpm/check-files /builddir/build/BUILD/llama-cpp-b6153-build/BUILDROOT Wrote: /builddir/build/RPMS/llama-cpp-devel-b6153-1.fc44.x86_64.rpm Wrote: /builddir/build/RPMS/llama-cpp-debugsource-b6153-1.fc44.x86_64.rpm Wrote: /builddir/build/RPMS/llama-cpp-debuginfo-b6153-1.fc44.x86_64.rpm Wrote: /builddir/build/RPMS/llama-cpp-b6153-1.fc44.x86_64.rpm Executing(rmbuild): /bin/sh -e /var/tmp/rpm-tmp.a5ViC0 + umask 022 + cd /builddir/build/BUILD/llama-cpp-b6153-build + test -d /builddir/build/BUILD/llama-cpp-b6153-build + /usr/bin/chmod -Rf a+rX,u+w,g-w,o-w /builddir/build/BUILD/llama-cpp-b6153-build + rm -rf /builddir/build/BUILD/llama-cpp-b6153-build + RPM_EC=0 ++ jobs -p + exit 0 Finish: rpmbuild llama-cpp-b6153-1.fc44.src.rpm Finish: build phase for llama-cpp-b6153-1.fc44.src.rpm INFO: chroot_scan: 1 files copied to /var/lib/copr-rpmbuild/results/chroot_scan INFO: /var/lib/mock/fedora-rawhide-x86_64-1766268696.323769/root/var/log/dnf5.log INFO: chroot_scan: creating tarball /var/lib/copr-rpmbuild/results/chroot_scan.tar.gz /bin/tar: Removing leading `/' from member names INFO: Done(/var/lib/copr-rpmbuild/results/llama-cpp-b6153-1.fc44.src.rpm) Config(child) 94 minutes 9 seconds INFO: Results and/or logs in: /var/lib/copr-rpmbuild/results INFO: Cleaning up build root ('cleanup_on_success=True') Start: clean chroot INFO: unmounting tmpfs. Finish: clean chroot Finish: run Running RPMResults tool Package info: { "packages": [ { "name": "llama-cpp", "epoch": null, "version": "b6153", "release": "1.fc44", "arch": "x86_64" }, { "name": "llama-cpp-debuginfo", "epoch": null, "version": "b6153", "release": "1.fc44", "arch": "x86_64" }, { "name": "llama-cpp-devel", "epoch": null, "version": "b6153", "release": "1.fc44", "arch": "x86_64" }, { "name": "llama-cpp", "epoch": null, "version": "b6153", "release": "1.fc44", "arch": "src" }, { "name": "llama-cpp-debugsource", "epoch": null, "version": "b6153", "release": "1.fc44", "arch": "x86_64" } ] } RPMResults finished